Diverse Consulting Group
AI

AI Engineer (RAG On Prem LLMs)

Diverse Consulting Group · Warszawa, MZ, PL

Actively hiring Posted 17 days ago

As a recruitment company, DCG understands that every business is powered by experienced professionals. Our management style and partnership approach enable us to meet your needs and provide continuous support. Due to our ongoing growth and the large number of recruitment projects we undertake for our partners, we are currently looking for:

AI Engineer (RAG & On Prem LLMs)

Responsibilities:

  • Architect, implement, and optimize end-to-end Retrieval Augmented Generation (RAG) pipelines for enterprise use cases in on-premises environments
  • Design and integrate retrieval mechanisms (e.g. vector databases such as Neo4j) with generative models (e.g. LLAMA 3.2, Mistral)
  • Fine-tune and optimize retrieval and generation components to achieve high accuracy and low latency
  • Implement and customize inference servers using vLLM and LiteLLM for efficient and scalable LLM serving
  • Integrate open-source large language models with proprietary data sources and enterprise APIs
  • Design GPU-optimized, scalable on-prem infrastructure for model training and inference, ensuring security and data governance compliance
  • Collaborate with DevOps teams to containerize workflows using Docker and Kubernetes and automate MLOps pipelines
  • Apply performance optimization techniques such as quantization, pruning, and dynamic batching
  • Monitor system performance, troubleshoot bottlenecks, and ensure high availability
  • Work closely with data engineers and business stakeholders to translate business requirements into technical AI solutions in telco environments

Requirements:

  • At least 3 years of professional experience in ML/NLP roles, including 2+ years working with RAG systems
  • Proven experience deploying and operating LLM‑based solutions in on‑prem or hybrid environments
  • Hands‑on experience with vLLM, LiteLLM, and open‑source LLMs such as LLAMA 3.2, DeepSeek, or Mistral
  • Strong Python skills and experience with frameworks such as PyTorch, Hugging Face Transformers, and LangChain
  • Experience with vector databases (e.g. Neo4j)
  • Familiarity with Linux‑based systems and Red Hat OpenShift
  • Strong problem‑solving and analytical skills
  • Ability to clearly communicate complex AI concepts to non‑technical stakeholders
  • Bachelor's, Master's, or PhD degree in Computer Science, Artificial Intelligence, or a related field
  • Knowledge of English (B2+/C1)

Offer:

  • Private medical care co-financing
  • Sports card
  • Training & learning opportunities
  • Life insurance co-financing

Tags & focus areas

Used for matching and alerts on DevFound
Ai Engineer Generative Ai Ai

Next step

Ready to Join the Team?

Apply once with DevFound. We'll route your profile to Diverse Consulting Group and keep you informed when matching AI roles go live.

  • Single profile, multiple curated AI opportunities
  • No spam roles — only vetted AI positions
  • You choose which roles to apply to
Sign up to apply

No CV uploads. We never share your profile without your consent.

Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.