Nebius
AI

Senior ML Engineer (Token Factory)

Nebius · Praha, A, CZ

Actively hiring Posted 4 months ago

Role overview

  • Advanced Fine-Tuning: Enhancing fine-tuning methodologies - both LoRA-based and full-parameter - for cutting-edge LLMs (e.g., GPT-OSS, Kimi K2.5, DeepSeek V3.1/V3.2, GLM-4.7), focusing on both model quality and training efficiency.
  • Inference Optimization: Identifying LLM inference bottlenecks to drive production speedups. This involves building model training and evaluation pipelines in JAX for speculative decoding, experimenting with architectures (dense/MoE, auto-regressive/parallel), and deriving scaling laws to guide resource allocation.
  • Low Precision Training & Inference: Investigating low-precision (FP8, NVFP4/MXFP4) methodologies for supervised fine-tuning and reinforcement learning - spanning both inference and training - optimized for modern hardware
  • A profound understanding of theoretical foundations of machine learning and reinforcement learning.
  • Deep expertise in modern deep learning for language processing and generation
  • Experience with training large models on multiple computational nodes
  • Reasonable understanding of performance aspects of large neural network training (sharding strategies, custom kernels, hardware features etc.)
  • Strong software engineering skills (we mostly use Python)
  • Deep experience with modern deep learning frameworks (we use JAX)
  • Proficiency in contemporary software engineering approaches, including CI/CD, version control and unit testing
  • Strong communication and leadership abilities

Preferred qualifications

  • Previous experience working with language models or other similar NLP technologies.
  • Familiarity with important ideas in LLM space, such as MHA, RoPE, ZeRO/FSDP, Flash Attention, quantization
  • A track record of building and delivering products (not necessarily ML-related) in a dynamic startup-like environment.
  • Strong engineering skills, including experience in developing large distributed systems or high-load web services.
  • Open-source projects that showcase your engineering prowess
  • Excellent command of the English language, alongside superior writing, articulation, and communication skills.

Benefits

  • Competitive salary and comprehensive benefits package.
  • Opportunities for professional growth within Nebius.
  • Flexible working arrangements.
  • A dynamic and collaborative work environment that values initiative and innovation.

Tags & focus areas

Used for matching and alerts on DevFound
Remote Ai Machine Learning