Ofinno
AI

Generative AI Operations Engineer

Ofinno · Reston, VA, US

Actively hiring Posted 3 months ago

**Generative AI Operations Engineer

About Ofinno:**

Ofinno is a leading research and development lab headquartered in Reston, Virginia, specializing in advancing communication and media standards. Our team’s innovative work has led to significant contributions to technologies such as 5G cellular, Wi-Fi, and media compression. Ofinno holds strategic partnerships and licensing agreements with several of the world’s leading technology companies that use such technologies. At Ofinno, we foster an environment of collaboration and excellence, where researchers can focus on delivering breakthroughs that shape the future of technology.

Position Overview:

At Ofinno, we are committed to pushing the boundaries of innovation in 6G and beyond. Our 6G Innovation Lab is at the forefront of research and development, exploring transformative technologies that shape the future of wireless communication. We are seeking a Generative AI Operations Engineer to spearhead efforts in leveraging Large Language Models (LLMs) and Retrieval-Augmented Generation (RAG) technologies to support our researchers and drive continual improvements in AI architecture.

As a Generative AI Operations Engineer, you will play a pivotal role in empowering the 6G Innovation Lab team by bridging the gap between experimental AI research and production-grade reliability. Your focus will be on architecting resilient LLM infrastructure and automated data ingestion pipelines to ensure that RAG-driven solutions achieve the highest standards of availability, scalability, and long-term maintainability.

Key Responsibilities:

As a Generative AI Operations Engineer, you will:

  • Design and operate scalable infrastructure for LLM-based applications and RAG pipelines supporting research in our 6G Innovation Lab.
  • Build and maintain CI/CD and LLMOps pipelines for model deployment, experimentation, and lifecycle management.
  • Develop and maintain ETL pipelines to ingest, process, and prepare research data for generative AI workflows.
  • Collaborate with researchers and engineers to translate experimental AI models into reliable, production-ready services.
  • Develop and support generative AI workflows using frameworks such as LangChain and implement monitoring and performance optimization for deployed systems.

Qualifications:

  • Bachelor’s degree in Computer Science, Data Science, Artificial Intelligence, or a related discipline.
  • 4+ years of experience in DevOps, cloud infrastructure, data engineering, or MLOps.
  • Experience supporting or deploying LLM or generative AI applications (e.g., RAG, embeddings, vector databases).
  • Experience building AI workflows using frameworks such as LangChain or similar orchestration tools.
  • Familiarity with GCP, Terraform, and container technologies such as Docker or Kubernetes.
  • Proficiency in Python, including experience developing APIs or AI services using frameworks such as FastAPI.

What Else You Should Know:

Our people are our business. We know you have to see it to believe it, but here are some of the perks you can count on:
401(K) matching

  • - We help you plan and save for retirement with a 401(K) matching program that’s available on day one. Free healthcare plans* - Ofinno covers full premiums for you are your family on select healthcare plans, including employer HSA contributions if applicable. Free Food* - Our kitchen is always fully stocked, including lunch, protein bars, fruit, sodas, coffee, and tea. Unlimited Paid Time Off* - Our lives are enriched by family time, vacations, and personal time. We offer unlimited paid time off and sick leave. On-campus gym* - Unwind, reduce stress and feel great – even when you’re at work. Other benefits, too long to list* - Please discuss with our great People Ops team about additional benefits offered.

What Now?

What are you waiting for? We hope you will click on the link and forward your credentials to us today. All your information will be kept confidential according to EEO guidelines.

67usx4Cv7q

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Ai Generative Ai
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.