Role overview

Own and manage our heavy-compute AI hardware, specifically optimizing workloads for our Nvidia HGX H200 infrastructure.
Deploy, fine-tune, and maintain open-source LLMs, ensuring maximum throughput and minimal latency.
Manage inference engines (e.g., vLLM, TensorRT-LLM) and handle dynamic GPU memory allocation for hundreds of concurrent agent requests.
Provide a flawless, millisecond-response API layer for our "OpenClaw" agent farm (a fleet of 200+ bare-metal Apple Silicon nodes).
Monitor model performance, detect hallucinations, and build synthetic training data pipelines to continuously improve agent accuracy.
Design, scale, and maintain a high-performance, on-premise RAG service and vector database (e.g., Qdrant, Milvus, Milvus/Chroma) to seamlessly serve internal documentation to our LLMs.
Build robust data ingestion and embedding pipelines to ensure internal knowledge bases and documents are updated in real-time for the RAG service.
Work tightly with Product, Operations, and the Growth team to ensure alignment.
Provide the technical foundation for the Growth team's AI-assisted content creation, verification, and contextual validation pipelines.
Build repeatable AI engines that can guarantee linguistic, cultural, and regulatory correctness across dozens of markets simultaneously.
A senior AI/MLOps engineer with proven experience scaling self-hosted LLM infrastructure in a production environment.
Experienced with semantic search, vector databases, and information retrieval techniques (RAG) at scale.
Deeply experienced with Python, PyTorch, CUDA, and modern inference serving frameworks.
Experienced in using AI as a production and verification tool, not a gimmick.
Comfortable working closely with networking and storage architects to eliminate I/O bottlenecks in a ZFS/Linux ecosystem.
Highly structured, data-driven, and execution-focused.
Motivated by building systems that scale, not campaigns that win awards.

Benefits

A senior tech role with direct ownership over a state-of-the-art AI hardware stack.
A fast-scaling international fintech with real infrastructure, licenses, and products.
Competitive salary and employment conditions.
A modern office in Amsterdam Houthavens overlooking ‘Het IJ’.
Daily healthy lunches prepared by our in-house chef.
A culture that values execution, ownership, and long-term thinking.

About the company

At Yoursafe, we believe that everyone deserves access to safe and easy financial services - wherever they are. Our mission is to build financial tools that empower people, especially those who are new to a country or outside the traditional banking system. We combine solid financial expertise with smart technology to make everyday money management simple, secure, and fair.

Tags & focus areas

Used for matching and alerts on DevFound

Fulltime Ai Ai Engineer Generative Ai

AI Engineer (LLM Infrastructure)

Role overview

Benefits

About the company

Tags & focus areas

Ready to Join the Team?