Vectara
AI

Machine Learning (ML) Engineer

Vectara ·

Actively hiring Posted 6 months ago

Vectara provides a scalable platform to deploy your Enterprise AI Agents and AI Assistants with
Accuracy, Security, and Explainability
like no other solution. Our enterprise RAG Platform offers unparalleled
Accuracy, Security, and Explainability
by leveraging the strongest models for
retrieval, embedding, reranking
, a optimized
LLM trained for quality
, and advanced
Hallucination Mitigation
. We are the developers of the Hughes Hallucination Evaluation Model and Correction model, core to ensuring accuracy, quality, and responsible AI that is
production ready
. These innovations have been cited in the
New York Times, Visual Capitalist
, and many other leading publications. This platform has allowed us to be very successful with over 100 Enterprise clients including the likes of large US military organizations, Financial services, Healthcare, and Manufacturing.

Our founding team includes industry veterans and experts in neural information retrieval and distributed systems from Google. Join us as we pursue our mission to help the world find meaning. People at Vectara are passionate about ensuring customers take advantage of breakthroughs in applied Artificial Intelligence (AI) to solve real-world technology and business problems today. Our team is a group of unquestionable all-stars in their respective fields of computer science and business from Google, Cloudera, Splunk, MongoDB, Elastic, and more.

Job responsibilities

  • Design, prototype, research and build AI systems for Vectara.
  • Train, evaluate and deploy ML models in the domains of Natural Language Processing, Information Retrieval, AI Agents, Large Language Models (LLMs) and Multimodal Large Models (MLMs).
  • Improve the quality of Vectara’s RAG-as-a-service platform, working on features like multilinguality, self-supervised learning, agentic behavior and hallucination reduction.
  • Publish technical blogs, papers, and patents.

Requirements:

  • BS/MS in Computer Science, Statistics, Electrical/Computer Engineering, Mathematics, or a related field.
  • 5+/4+ years of experience after BS/MS.
  • Strong software engineering basics, we work on research but also write production code.
  • Knowledge of common challenges in training ML models and solutions to them.
  • Familiarity with the technical details of deep learning concepts, such as Transformers, Retrieval-Augmented Generation (RAG), mixture of experts (MoE).
  • Proficiency in data/ML libraries such as pandas, transformers, and torch.
  • Hands-on experience in training ML systems end-to-end from data curation to evaluation and deployment.
  • Ability to collaborate with cross-functional teams.

Preferred requirements:

  • PhD in Computer Science/Engineering with 1+ years of industry experience.
  • Publications in prestigious venues such as ACL, NAACL, EMNLP, NeurIPS, ICML, ICLR as a key author.
  • Experience as an ML engineer in an early-stage, high growth environment.
  • Expertise in the following areas:

  • Embedding models, rerankers

  • Multimodal retrieval, question answering, and reasoning

  • Vector databases, BM25

  • Planning and reasoning in LLMs

  • Multilinguality in LLMs

  • NLG Evaluation such as hallucination detection

Location requirements:
We support remote applicants from all over the US but candidates who can come to the office 2-3 days a week in our Palo Alto office are preferred.

Equity and Salary Range:
Salary is just one component of Vectara’s employee compensation. Our full-time employees are also equity owners in the company, which although not an immediate cash component, can have positive impacts on long-term total compensation for each participating employee. We would be remiss if we didn’t highlight and celebrate our focus on engaging many of our employees in being economic co-owners of the business.

Vectara welcomes all. We value the collective wisdom of people from different backgrounds, experiences, abilities and perspectives. We never discriminate on the basis of race, religion, national origin, gender identity or expression, sexual orientation, age, or marital, veteran, or disability status. Vectara has a positive and supportive culture—we look for people who are inventive and work to be a little better every single day. We seek to be smart, humble, hardworking and, above all, curious. After all, we are on a mission to find meaning.

Perks and Benefits:
100% paid Medical, Dental, Vision begins on your first day! Option of Health Savings Account (HSA) or Flexible Savings Account (FSA). Generous paid time off (PTO) plus paid sick time, holidays, and company rest days. Professional development and training opportunities. Company virtual happy hours and fun team building activities and more.

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Remote Ai Machine Learning Generative Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.