DeWinter Group
AI

Senior AI Platform Engineer (Generative AI)

DeWinter Group ·

Actively hiring Posted 5 months ago

**Senior AI Platform Engineer (Generative AI)

Boston, MA (Hybrid preferred/open to remote)

12 Month + contract (or contract to hire, if desired)**

We are looking for a senior AI Platform Engineer to build, operate, and scale generative AI systems in production. This role sits at the intersection of ML systems, platform engineering, and SRE, supporting large-scale inference workloads in regulated, enterprise environments.

This is not a research-only role — candidates must have real production experience supporting AI systems end-to-end.

Core Responsibilities

  • Design, deploy, and operate production-grade generative AI platforms
  • Serve and scale large language models using modern inference frameworks (e.g., vLLM, SGLang, TensorRT-LLM)
  • Own platform reliability, including on-call responsibilities, incident response, and performance tuning
  • Build and operate Kubernetes-based infrastructure for AI workloads
  • Partner with application teams consuming the AI platform and support production use cases
  • Evaluate and integrate developer productivity tools (Copilot agents, Cursor, etc.) responsibly

Required Experience

  • 5+ years of professional software engineering experience
  • Hands-on experience in AI/ML systems (not just API consumption)
  • Proven experience operating production platforms with:
  • Kubernetes
  • Cloud infrastructure (AWS, GCP, or Azure)
  • Monitoring, alerting, and incident response
  • Prior experience with platform engineering or SRE concepts
  • Strong fundamentals in algorithms, systems design, and debugging

Strongly Preferred

  • Experience serving LLMs at scale using raw or managed GPUs
  • Familiarity with inference optimization and cost-performance tradeoffs
  • Background in:
  • Large tech companies
  • Research labs or higher-ed research environments
  • Enterprise platforms with strict reliability requirements
  • Daily, practical use of GenAI developer tools (Copilot, Cursor, Windsurf, etc.)

What Success Looks Like

  • You can reason about model serving, infrastructure, and failure modes, not just model outputs
  • You are comfortable being on-call for systems you build
  • You understand when to use AI tools — and when not to
  • You can support teams consuming AI platforms without hand-holding

What This Role Is
*Not
*

  • Not a junior ML role
  • Not a pure research position
  • Not a “prompt engineer” role
  • Not a platform consumer-only position

Tags & focus areas

Used for matching and alerts on DevFound
Contract Remote Ai Machine Learning Generative Ai