Nile Bits
AI

Senior Applied ML Engineer (Speech Audio)

Nile Bits · القاهرة, C, EG

Actively hiring Posted 28 days ago

Responsibilities

  • Benchmark and evaluate TTS and ASR models using Arabic-specific test sets, measuring metrics such as Word Error Rate (WER), naturalness, and dialect coverage.
  • Fine-tune generative models for voice cloning, zero-shot speaker adaptation, and speech synthesis.
  • Build and maintain Arabic-focused data pipelines, including: Audio collection and preprocessing Diacritization (Tashkil) Data cleaning and augmentation
  • Audio collection and preprocessing
  • Diacritization (Tashkil)
  • Data cleaning and augmentation
  • Optimize model inference for production environments using: Quantization KV-cache tuning Streaming inference techniques
  • Quantization
  • KV-cache tuning
  • Streaming inference techniques
  • Integrate and evaluate complete speech-to-speech conversational pipelines.
  • Conduct experiments based on recent research papers and convert findings into production-ready solutions.
  • Collaborate with engineering and product teams to deploy robust and scalable speech systems.

Basic qualifications

  • 5+ years of experience in Machine Learning, Applied AI, or AI Research.
  • Strong programming skills in Python.
  • Extensive hands-on experience with PyTorch and the Hugging Face ecosystem.
  • Proven experience training and fine-tuning neural models for: Text-to-Speech (TTS) Automatic Speech Recognition (ASR) Audio codecs
  • Text-to-Speech (TTS)
  • Automatic Speech Recognition (ASR)
  • Audio codecs
  • Deep understanding of modern speech architectures such as: Whisper Conformer HiFi-GAN Diffusion-based models
  • Whisper
  • Conformer
  • HiFi-GAN
  • Diffusion-based models
  • Experience with audio processing techniques including: Voice Activity Detection (VAD) Speaker Diarization Neural Vocoders
  • Voice Activity Detection (VAD)
  • Speaker Diarization
  • Neural Vocoders
  • Demonstrated ability to implement and adapt research papers into practical production experiments.
  • Strong understanding of Arabic language challenges, including: Diacritization (Tashkil) Dialectal variations Code-switching
  • Diacritization (Tashkil)
  • Dialectal variations
  • Code-switching
  • Experience with inference optimization techniques such as: Quantization Streaming inference NVIDIA TensorRT
  • Quantization
  • Streaming inference
  • NVIDIA TensorRT

Preferred qualifications

  • Experience developing custom NVIDIA CUDA kernels for high-performance model inference.
  • Familiarity with speculative decoding and other advanced acceleration techniques.
  • Experience deploying models at scale in cloud or GPU-based production environments.
  • Contributions to open-source speech or machine learning projects.

Benefits

  • All employees benefits for free (our famous games room, daily breakfast, fruits, coffee and other hot drinks, soft drinks and juices, company days out and parties…).
  • Flexible and comfortable schedule.
  • Social insurance.
  • Paid annual and national vacation.
  • Working remotely.
  • Competitive salaries.
  • Monetary rewards and incentives.
  • Career possibilities with growing team.
  • Open-door management policy.
  • Full Medical insurance.
  • Accommodation and transportation allowance.
  • Friendly environment that values innovation and efficiency.
  • Exciting opportunities for career growth and talent development.
  • Feedback encouragement.
  • Recognition and reward programs.
  • Friendly environment.
  • Fun committees.
  • Fun, smart and creative people.
  • Social benefits.
  • Natural Text-to-Speech (TTS)
  • Real-Time Automatic Speech Recognition (ASR)
  • End-to-End Speech-to-Speech Conversational Systems

Tags & focus areas

Used for matching and alerts on DevFound
Fulltime Remote Machine Learning Ai

Next step

Ready to Join the Team?

Apply once with DevFound. We'll route your profile to Nile Bits and keep you informed when matching AI roles go live.

  • Single profile, multiple curated AI opportunities
  • No spam roles — only vetted AI positions
  • You choose which roles to apply to
Sign up to apply

No CV uploads. We never share your profile without your consent.

Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.