DFINITY
AI

Senior AI Engineer, Reinforcement Learning (Post-training)

DFINITY · ZH Zürich, Zürich, Switzerland · $105k - $112k

Actively hiring Posted over 1 year ago

Employment Type: 6 Month Contract

We are seeking a highly skilled Senior AI Engineer to accelerate the deployment of improvements to our models. You will collaborate with diverse teams handling various facets of the system, including core capabilities, multimodal integration (code, text, and images), tools, and more. This role offers a unique opportunity to shape the future of the Internet Computer, working across the technology stack, from optimizing low-level components like GPU kernels to mastering the intricacies of reinforcement learning post-training.

 

The ideal candidate has a robust technical background in areas such as data technologies, reliable software engineering, production ML model development, and cross-functional collaboration. While research experience is not required, a deep understanding of ML fundamentals and large-scale deep learning is essential for troubleshooting and analyzing complex system and ML issues. Excellent verbal and written communication skills, along with strong project management abilities, are crucial as you will collaborate with both technical engineering and research teams and non-technical product teams across the company.

 

Responsibilities

  • Ownership of Post-Training Pipeline: Lead the design, implementation, and optimization of the post-training pipeline to ensure efficient model deployment and scalability.
  • Coordination of Data Development: Oversee the development of high-quality training datasets, including managing the creation and use of synthetic data.
  • Model Training: Conduct advanced model training, ensuring continuous improvement in accuracy and performance.
  • Collaboration: Work closely with cross-functional teams including data engineers, software engineers, and product teams to integrate AI models into production.
  • Performance Monitoring: Analyze and monitor the performance of models in production, iterating on training pipelines to enhance outcomes.

 

Requirements

  • Experience: Minimum 5 years of experience in AI/ML engineering with a focus on model training and deployment.
  • Post-Training Expertise: Demonstrated ability to build and optimize post-training pipelines at scale.
  • Data Coordination: Experience in managing the development and annotation of synthetic and real-world datasets.
  • Technical Skills: Proficiency in Python, TensorFlow/PyTorch, and experience with cloud platforms like AWS, GCP, or Azure.
  • Team Leadership: Proven track record of coordinating complex engineering projects with cross-functional teams.
  • Analytical Skills: Strong problem-solving skills with a focus on performance optimization and automation.

 

Bonus Points

  • Prior experience with distributed AI systems.
  • Hands-on experience with synthetic data generation and augmentation techniques.
  • Familiarity with tools for data pipeline automation and orchestration.

 

Tags & focus areas

Used for matching and alerts on DevFound
Ai Engineer Senior Education Aws Tensorflow Pytorch Python Gcp Azure
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.