Role overview
- Co-own the training pipeline end-to-end. Design, build, and maintain the infrastructure and components that let us iterate fast on experiments.
- Build high-quality tooling. Model training is a continuous effort, and we deliberately invest in our tooling and infrastructure to stay successful long term.
- Collaborate across disciplines. We believe in cross-functional teams. Engineers and researchers work closely so we can learn from each other and iterate faster together.
- Champion good engineering practices. Working incrementally, maintaining fast feedback loops, and refactoring continuously keep a team successful long-term, especially when moving fast.
- Shape the direction of the team. Our culture empowers individuals to take ownership. If you see that we'll need more GPUs, a different storage system, or a change to how the team is set up, you should drive this change.
Basic qualifications
- A track record of taking initiative to deliver high-impact work.
- Experience contributing in high-performing teams.
- Degree in computer science, engineering, or a related field.
- Willingness to relocate to Germany. Our primary working locations are Heidelberg (preferred) and Berlin, although there is some flexibility to work from other locations in Germany with regular travel to Heidelberg (potentially weekly).
- Ability to write software that other strong engineers want to read and build on.
- Desire to take ownership of problems and collaborate with other teams to solve them.
- Deep interest in how state-of-the-art foundation models work.
- Strong communication skills, with the ability to convey technical solutions to diverse audiences.
- Experience working with distributed systems.
- Experience working with Kubernetes.
- Experience bringing AI research innovations into production.
- Experience in areas such as large-scale data processing or distributed computation for foundation model training or inference.
- Experience with performance engineering: profiling, benchmarking, and optimizing code for throughput, latency, or memory.
Benefits
- Become part of an AI revolution!
- 30 days of paid vacation
- Access to a variety of fitness & wellness offerings via Wellhub
- Mental health support through nilo.health
- Substantially subsidized company pension plan for your future security
- Subsidized Germany-wide transportation ticket
- Budget for additional technical equipment
- Flexible working hours for better work-life balance and hybrid working model
- Virtual Stock Option Plan
- JobRad Bike Lease
Tags & focus areas
Used for matching and alerts on DevFound Fulltime Ai