youre a hands-on ML engineer with 4-6 years of experience building and fine-tuning large language models (LLMs) and transformer-based models
Implement RLHF (Reinforcement Learning from Human Feedback) pipelines for model alignment and preference optimization . Design experiments for automated hyperparameter tuning,training strategies,and model selection .
Tags & focus areas
Used for matching and alerts on DevFound Training Version Control Managed Services Gcp Machine Learning Technical Leadership Data Quality Monitoring Generative Ai Ai