Flo Health
AI

Senior AI/ML engineer

Flo Health · London, ENG, GB

Actively hiring Posted 4 months ago

500M+ downloads. 80M+ monthly users. A decade of building – and we're still accelerating.

Flo is the world's #1 health & fitness app worldwide on a mission to build a better future for female health. Backed by a $200M investment led by General Atlantic, we became the first product of our kind to reach a $1B valuation in 2024 – and we're not slowing down.

With 7M paid subscribers and the highest-rated experience in the App Store's health category, we've spent 10 years earning trust at scale. Now, we're building the next generation of digital health – AI-powered, privacy-first, clinically backed – to help our users know their body better.

The job

We are looking for a Senior Software Engineer with deep expertise in AI/ML infrastructure to join our AI Platform team and help build the GenAI platform that powers every AI feature at Flo.

You will bridge core infrastructure, data engineering, and LLM development to deliver production-grade medical safety judges, fine-tuning pipelines, evaluation frameworks, and real-time personalisation. The team operates 60+ LLM-based evaluation judges, develops proprietary fine-tuned health models, and maintains active partnerships with Databricks, Google, OpenAI, Anthropic, and AWS.

What you'll do

  • LLM Judge Ecosystem: build and scale Judge-as-a-Service, prompt registries, calibration pipelines, and evaluation orchestration using MLflow 3.x
  • Fine-Tuning and Serving: develop LoRA/SFT/preference optimisation pipelines for health-domain models (Llama, Gemma, MedGemma) and manage model serving at scale on Databricks
  • Data and Evaluation Pipelines: build synthetic Q&A generation, golden test sets, reward function engineering, and Delta table schemas in Unity Catalog for reliable, reproducible evaluation data
  • Infrastructure: maintain Terraform-managed AWS infrastructure (EKS, S3, IAM), Databricks AI Gateway, and CI/CD pipelines (GitHub Actions) with evaluation gates and progressive rollout
  • Cross-Functional Impact: collaborate with Product, Security, Analytics, and Medical teams, develop internal SDKs and APIs consumed by 5+ teams, and engage directly with technology partners on pre-release capabilities

**Experience and skills

Must have:**

  • Engineering maturity: 7+ years of software engineering, 4+ years focused on ML/AI platforms
  • LLM experience: recent hands-on work with at least one of: fine-tuning, prompt engineering, LLM evaluation, or model serving
  • Technical stack: strong Python across production services and data pipelines, data engineering fundamentals (Spark, Delta tables, Parquet)
  • Platform and infrastructure: Databricks (MLflow, Unity Catalog, Model Serving), AWS (EKS/Kubernetes, IAM), Terraform, GitHub Actions
  • Cross-domain flexibility: comfort working across ML, data engineering, and infrastructure. You don't need to be expert in all three, but you contribute wherever the team needs it

Nice to have:

  • LLM evaluation frameworks (judges, graders, calibration methodology) or fine-tuning techniques (LoRA, RLHF/DPO, model distillation)
  • ML data engineering: synthetic data generation, evaluation dataset design, annotation pipelines
  • Healthcare, regulated industry, or safety-critical AI systems experience
  • Prompt optimisation frameworks (DSPy or similar), feature stores (Tecton)

#LI-KP1 #LI-Hybrid

How we work

We're a mission-led, product-driven team. We move fast, stay focused and take ownership – from brief to build to impact. Debate is encouraged. Decisions are shared. We care about craft, ship with purpose, and always raise the bar.

You'll be working with people who take their work seriously, not themselves. It takes commitment, resilience, and the drive to keep going when things get tough. Because better health outcomes are worth it.

What you'll get

We support impact with meaningful reward. Here's what that looks like:

  • Competitive salary and annual reviews
  • Opportunity to participate in Flo's performance incentive scheme
  • Paid holiday, sick leave, and female health leave
  • Enhanced parental leave and pay for maternity, paternity, same-sex and adoptive parents
  • Accelerated professional growth through world-changing work and learning support
  • In-person collaboration and work in a hybrid model, with 3 days per week spent in the office
  • 5-week fully paid sabbatical at 5-year Floversary
  • Flo Premium for friends & family, plus more health, pension and wellbeing perks

Diversity, equity and inclusion

Our strength is in our differences. At Flo, hiring is based on merit, skill and what you bring to the role – nothing else. We're proud to be an equal opportunity employer, and we welcome applicants from all backgrounds, communities and identities. Read our privacy notice for job applicants.

Tags & focus areas

Used for matching and alerts on DevFound
Ai Ai Engineer Machine Learning Data Engineer Generative Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.