PeopleCaddie
AI

LLM Engineer

PeopleCaddie · · $187k

Actively hiring Posted 7 months ago

Job Title:
LLM Engineer (Contract)

Company:
Big Four Client

Location (hybrid):
San Jose, CA (Bay Area) - 2x/wk at client site

Work Authorization:
U.S. Citizen or Green Card Holder

Pay Rate:
Up to $90 per hour (W2), depending on experience

Duration:
1 Month (with possible extension)

Overview:

We are seeking a highly experienced
LLM Engineer Contractor
to support short-term, high-impact work focused on building, optimizing, and deploying large language models at one of our Big Four clients. The ideal candidate has deep technical expertise across modern LLM architectures, inference systems, and applied machine-learning workflows. This role is fast-paced and hands-on, requiring strong research skills, engineering excellence, and the ability to collaborate across product, data, and ML operations teams. The contractor will contribute directly to model development, performance optimization, and the deployment of LLM-backed features into production environments.

Key Responsibilities:

  • Model Development & Optimization:
  • Design, train, fine-tune, and evaluate LLMs for performance, efficiency, safety, and reliability.
  • Optimize models through techniques such as transfer learning, RLHF, low-rank adaptation (LoRA), quantization-aware training, and distillation.
  • Conduct rigorous benchmarking and ensure alignment with product or research objectives.
  • Systems Integration & Deployment:
  • Build scalable inference pipelines that support high-volume, low-latency LLM serving.
  • Implement infrastructure optimizations including quantization, caching, sharding, and model distillation.
  • Integrate models into applications, APIs, or microservices and collaborate with ML Ops to ensure robust deployment.
  • Research & Cross-Functional Collaboration:
  • Lead experimentation on new model architectures, prompting strategies, retrieval-augmented generation (RAG), and hybrid search pipelines.
  • Work closely with product managers, data engineers, ML ops, and research teams to convert experimental insights into production features.
  • Document findings, communicate results, and contribute to technical roadmaps.

Requirement Skills & Qualifications:

  • Bachelor’s degree in Computer Science, Machine Learning, Engineering, or related field.
  • Minimum of 6 years of experience in machine learning engineering, deep learning, or related fields.
  • Proven hands-on experience training, fine-tuning, and deploying LLMs (e.g., GPT-style, LLaMA-based, or transformer architectures).
  • Strong proficiency in Python, deep learning frameworks (e.g., PyTorch, TensorFlow), and distributed training systems.
  • Experience building scalable ML pipelines and working with modern inference stacks (e.g., Triton, Ray Serve, Hugging Face, ONNX Runtime).
  • Strong understanding of GPU acceleration, optimization, and cloud-native deployment workflows.
  • Excellent communication skills with the ability to work autonomously in a fast-moving environment.

Preferred Skills & Qualifications:

  • Experience with RAG systems, vector databases, and search frameworks (e.g., FAISS, Milvus, Pinecone).
  • Familiarity with model evaluation for alignment, safety, hallucination reduction, and adversarial testing.
  • Prior experience in a research-oriented or applied AI lab environment.
  • Master’s degree or Ph.D. in a relevant technical field.

Tags & focus areas

Used for matching and alerts on DevFound
Contract Machine Learning
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.