Waymo
AI

Research Scientist, RL for Autonomous Planning World Modeling

Waymo · Remote, US · $204k - $259k

Actively hiring Posted 5 months ago

Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building the Waymo Driver—The World's Most Experienced Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The Waymo Driver powers Waymo’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The Waymo Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states.

The mission of the Waymo AI Foundations team is to develop machine learning solutions addressing open problems in autonomous driving, towards the goal of safely operating Waymo vehicles in dozens of cities and under all driving conditions. As part of our work, we also initiate and foster collaborations with other research teams in Alphabet. AI Foundations areas that we are currently focusing on include reinforcement learning, learning from demonstration, generative modeling, Bayesian inference, hierarchical learning, and robust evaluation.

In this hybrid role, you will report to a Principal Scientist.

You will:

  • Participate in Waymo’s Foundation World Model post-training and evaluation
  • Research and develop cutting edge RL and Distillation techniques for Autonomous Vehicle Trajectory Planning
  • Integrate emerging research from the broader AI community into Waymo’s internal RL infrastructure, conducting rigorous ablations to identify and scale the most promising methods
  • Partner with engineering and research teams across Waymo to share recipes, techniques, and post-training best practices to accelerate our collective know-how

You have:

  • PhD or Masters in Computer Science, Machine Learning, Robotics, or a similar technical field; with 3+ years of industry or post-doc research experience in Reinforcement Learning or Foundation Models
  • Demonstration of original contributions to the field through high-impact publications (ArXiv, peer-reviewed conferences like NeurIPS/ICLR/CVPR), technical blog posts, or significant open-source contributions
  • Proficiency in implementing model training flows in a scalable, distributed and performant manner such as Data parallel, FSDP and other sharding approaches
  • A willingness to work with complexity of globally distributed inference infrastructure

We prefer:

  • PhD in Computer Science, Machine Learning, or Robotics, with a research focus on Reinforcement Learning, Foundation Models, or Multi-Modal learning
  • Extensive experience designing and deploying Reinforcement Learning infrastructure, specifically for on-policy learning or alignment with human preferences
  • A consistent history of original contributions to the AI community, evidenced by first-author publications at top-tier venues (e.g., NeurIPS, ICLR, ICRA) or maintaining significant open-source ML projects
  • Experience with large scale (many-machine) training infrastructure and techniques for inference with large models such as model sharding/tensor-parallel

((Remote jobs only - Please note that Waymo may not be able to employ remotely in all locations. Please speak with your recruiter about your preferred location for remote work when you begin the interview process.))

The expected base salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-related factors, including exact work location, experience, relevant training and education, and skill level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process.

Waymo employees are also eligible to participate in Waymo’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements.

Salary Range

$204,000—$259,000 USD

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Remote Ai Machine Learning Robotics
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.