Unifonic
AI

AI Engineering Lead

Unifonic · القاهرة, C, EG

Actively hiring Posted 3 days ago

Proudly voted a Great Place to Work®, we are a dynamic startup in the SaaS space that is revolutionizing the way businesses communicate. Our team is made up of 500 energetic and passionate Unifones who are dedicated to delivering the best possible experience to 5000+ customer-centric companies.

We pride ourselves on our fun and collaborative work environment, where creativity and new ideas are constantly encouraged. As shareholders in the business, we’re so much more than a group of passionate communicators. We are Unifones. Join our team and be a part of something big!

Meet the team!

Our Engineering team is responsible for designing, developing, and maintaining the systems and technologies that drive Unifonic’s solutions. We work closely with other departments to ensure our products and services meet the needs of our customers. If you are passionate about technology and are excited about working on cutting-edge communication and engagement solutions, we want you on our team.

Our Customer Care Squad transforms customer support from reactive to predictive leveraging state-of-the-art AI, Retrieval-Augmented Generation (RAG), and Large Language Models (LLMs) to provide accurate, real-time, personalized assistance at a massive scale.

Our Customer Care Squad transforms customer support from reactive to predictive leveraging state-of-the-art AI, Agentic AI, Retrieval-Augmented Generation (RAG), and Large Language Models (LLMs) to provide accurate, real-time, personalized assistance at a massive scale.

As an AI Engineering Lead - Conversational, you will draw on deep, hands-on experience in building and delivering large-scale, production-grade conversational AI and Retrieval-Augmented Generation (RAG) solutions. This role is for an AI expert who has genuinely "been there and done that", someone ready to architect, build, and operate a real-time AI customer support platform with a relentless focus on accuracy, reliability, and ultra-low latency. You'll lead a lean, high-impact team, driving the execution and innovation while ensuring production excellence at every layer of the stack

Help us shape the future of communication by:

  • Owning the design and implementation of the AI-driven customer care systems and autonomous multi-agent orchestration workflows.
  • Designing, developing, and scaling state-of-the-art cyclic graph agent networks and multi-agent systems using frameworks like LangGraph, CrewAI, or AutoGen.
  • Optimizing LLM & Agent execution utilizing advanced runtime techniques such as quantization, pruning, batching, token streaming, and semantic caching to ensure ultra-low latency.
  • Owning the solutions alignment of dependencies and service contracts with other teams.
  • Designing, developing, and scaling real-time Retrieval-Augmented Generation (RAG) pipelines integrating state-of-the-art open-source LLMs (Llama 3, Mistral, Falcon, or similar).
  • Implementing scalable, high-performance vector search (Qdrant, Weaviate, Milvus) for robust knowledge retrieval and semantic search.
  • Having awareness of techniques such as quantization, pruning, distillation, batching, and caching for optimizing LLM inference with the minimum response times.
  • Developing and exposing secure, performant APIs via FastAPI/gRPC or others, containerized (Docker), orchestrated (Kubernetes), and fully integrated into automated CI/CD pipelines.
  • Embedding comprehensive monitoring and evaluation (e.g. MRR, Recall@k, NDCG, Faithfulness, latency metrics) and implementing automated regression testing for continuous improvement.
  • Championing and enforcing best practices for data security, compliance (GDPR, Saudi PDPL is a plus), and responsible AI, including PII redaction and end-to-end encryption.
  • Demonstrating mastery of foundational software engineering by writing clean code and architecture, maintainable and testable code, designing robust, modular, and scalable systems; leveraging version control, and implementing comprehensive continuous integration, automated testing, and deployment practices.
  • Leading rigorous design and code reviews, mentoring engineers, and fostering an innovative engineering culture grounded in clean architecture, SOLID principles, and proactive best practices to ensure system reliability, security, and agility.

What you’ll bring:

  • Bachelor’s degree in Computer Science, Data Engineering, Information Systems, or a related field.
  • 5+ years delivering production AI/NLP systems, including 2+ years as a technical lead or senior staff engineer.
  • Proven experience owning real-time conversational AI/RAG platforms at massive scale, serving thousands of concurrent users.
  • Expert proficiency in Java or Python with strong software engineering fundamentals and system-design capabilities.
  • Deep knowledge and hands-on experience with frameworks and technologies: PyTorch, Scikit-learn, Hugging Face, LangChain, LlamaIndex, SpringAI (Optional), vector databases (Pinecone, Weaviate, Milvus), and embedding models.
  • Strong knowledge of Agentic AI design and tools, e.g. LangGraph, CrewAI, tool calling, and reasoning/thinking models.
  • Strong knowledge about context-engineering, and how to design a RAG/chat system memory (long, short, summarized, ...)
  • Strong expertise in low-latency inference optimization and GPU resource management.
  • Solid experience building large-scale data ingestion and processing pipelines (Spark, Flink, Kafka, RabbitMQ).
  • Robust MLOps and deployment expertise (Docker, Kubernetes, MLflow, Kubeflow, Git-based prompt versioning, automated CI/CD).
  • Clear communicator capable of translating complex technical concepts into strategic business value.
  • Expertise in red-teaming practices and machine learning security research, including developing and reinforcing robust defenses against adversarial threats.
  • Arabic & English language proficiency.

As a Unifone you’ll receive a range of benefits:

  • Competitive salary and bonus
  • Unifonic share scheme (we are all owners!)
  • 30 holiday days after the first anniversary
  • Your Birthday off!
  • Spend up to 25 days per year working from anywhere in the world!
  • Paid leave for new parents

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Ai Engineer Ai

Next step

Ready to Join the Team?

Apply once with DevFound. We'll route your profile to Unifonic and keep you informed when matching AI roles go live.

  • Single profile, multiple curated AI opportunities
  • No spam roles — only vetted AI positions
  • You choose which roles to apply to
Sign up to apply

No CV uploads. We never share your profile without your consent.

Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.