Role overview
- Own and manage our heavy-compute AI hardware, specifically optimizing workloads for our Nvidia HGX H200 infrastructure.
- Deploy, fine-tune, and maintain open-source LLMs, ensuring maximum throughput and minimal latency.
- Manage inference engines (e.g., vLLM, TensorRT-LLM) and handle dynamic GPU memory allocation for hundreds of concurrent agent requests.
- Provide a flawless, millisecond-response API layer for our "OpenClaw" agent farm (a fleet of 200+ bare-metal Apple Silicon nodes).
- Monitor model performance, detect hallucinations, and build synthetic training data pipelines to continuously improve agent accuracy.
- Design, scale, and maintain a high-performance, on-premise RAG service and vector database (e.g., Qdrant, Milvus, Milvus/Chroma) to seamlessly serve internal documentation to our LLMs.
- Build robust data ingestion and embedding pipelines to ensure internal knowledge bases and documents are updated in real-time for the RAG service.
- Work tightly with Product, Operations, and the Growth team to ensure alignment.
- Provide the technical foundation for the Growth team's AI-assisted content creation, verification, and contextual validation pipelines.
- Build repeatable AI engines that can guarantee linguistic, cultural, and regulatory correctness across dozens of markets simultaneously.
- A senior AI/MLOps engineer with proven experience scaling self-hosted LLM infrastructure in a production environment.
- Experienced with semantic search, vector databases, and information retrieval techniques (RAG) at scale.
- Deeply experienced with Python, PyTorch, CUDA, and modern inference serving frameworks.
- Experienced in using AI as a production and verification tool, not a gimmick.
- Comfortable working closely with networking and storage architects to eliminate I/O bottlenecks in a ZFS/Linux ecosystem.
- Highly structured, data-driven, and execution-focused.
- Motivated by building systems that scale, not campaigns that win awards.
Benefits
- A senior tech role with direct ownership over a state-of-the-art AI hardware stack.
- A fast-scaling international fintech with real infrastructure, licenses, and products.
- Competitive salary and employment conditions.
- A modern office in Amsterdam Houthavens overlooking ‘Het IJ’.
- Daily healthy lunches prepared by our in-house chef.
- A culture that values execution, ownership, and long-term thinking.
About the company
At Yoursafe, we believe that everyone deserves access to safe and easy financial services - wherever they are. Our mission is to build financial tools that empower people, especially those who are new to a country or outside the traditional banking system. We combine solid financial expertise with smart technology to make everyday money management simple, secure, and fair.
Tags & focus areas
Used for matching and alerts on DevFound Fulltime Ai Ai Engineer Generative Ai