Brooksource
AI

LLM Engineer

Brooksource ·

Actively hiring Posted 7 months ago

Role overview

We are seeking a highly skilled LLM Engineer to assist in the development of a multi-modal Large Language Model (LLM) pipeline for digitizing geotechnical bore log data. This role is critical to transforming unstructured PDF documents into structured, machine-readable JSON outputs that support downstream analytics, GIS integration, and AI-powered search.

You will work closely with a Project Manager and technical stakeholders at our customer to build, fine-tune, and evaluate a custom LLM solution capable of interpreting complex geotechnical documents across multiple vendors.

Responsibilities

  • Fine-tune a multi-modal LLM (e.g., Pixtral-12B, PaliGemma, Gemma 3) using annotated bore log PDFs and JSON samples.
  • Build preprocessing pipelines for: Page segmentation, Figure isolation, Normalization of units and soil classification.
  • Develop and implement an evaluation framework including Precision/Recall/F1, domain-specific metrics, and JSON schema conformance.
  • Test model generalization on bore logs from 3 additional vendors.
  • Identify and categorize failure cases.
  • Compare performance across vendors and recommend strategies for scaling.
  • Package preprocessing scripts, model artifacts, and evaluation dashboards into a reproducible workflow.
  • Deliver structured JSON outputs and final benchmark reports.
  • Provide all source code and documentation for handoff.

Basic qualifications

  • Proven experience fine-tuning and deploying multi-modal LLMs (e.g., Pixtral, LLaMA, Gemma, etc.)
  • Ollama/llama.ccp, mongodb/non-relational dbs, and ai coding tools (cursor/windsurf/co-pilot.) experience.
  • Experience using OSS models
  • Strong proficiency in Python and ML frameworks (e.g., PyTorch, TensorFlow)
  • Experience with OCR, image preprocessing (OpenCV), and document parsing
  • Familiarity with geospatial data and JSON schema design
  • Ability to work with GPU environments (e.g., A100s) and cloud-based training setups
  • Strong understanding of evaluation metrics and model benchmarking
  • Excellent communication and documentation skills
  • Experience with geotechnical or engineering datasets
  • Familiarity with MongoDB, vector search, and embedding-based retrieval
  • Exposure to MLOps practices and CI/CD for ML pipelines
  • Prior work in AI document ingestion or enterprise-scale data transformation

Tags & focus areas

Used for matching and alerts on DevFound
Ai Machine Learning Mlops Generative Ai Pytorch Tensorflow Fulltime
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.