M
AI

Vision Deep Learning Engineer - Pytorch

Mentium Technologies Inc. · Anywhere · $115k - $184k

Actively hiring Posted 8 months ago

As a Vision Machine Learning / Software Engineer, you will play a key role in enhancing Mentium's deep learning library for vision AI models on our in-memory compute ultra-low power hardware. Collaborating closely with a small team of software and hardware experts, you will design and refine a client-facing codebase that enables seamless compilation and deployment of vision models for real-time applications. Additionally, you will support the training and testing of models for demonstrations.

A strong understanding of vision model components is essential, including convolutional layers, transformers, activation functions, pooling, normalization, and non-maximum suppression. Familiarity with the structure, training and quantization of models such as YOLO, Mask R-CNN, MobileNet, ConvNeXt, EfficientNet, and Vision Transformers is required. You will work with libraries like Torchvision, Ultralytics, MMCV, Keras, ONNX, and OpenCV, adapting to client needs. Awareness of hardware fundamentals—such as memory transfer, inference time, model size, and precision—is critical for real-time edge deployments.

Proficiency in Python is essential, including expertise in OOP principles, decorators, context managers, and numerical computing libraries like NumPy and pandas. You should demonstrate strong skills in modular development, collaborative coding using GitHub, and producing clear, well-structured documentation to enhance the customer experience.

This role offers an exciting opportunity to work with cutting-edge vision AI technology and make a meaningful impact on the development of a robust, client-facing software library.

Requirements and Qualifications:

  • Educational Background: Master’s or PhD degree in Computer Science, Electrical Engineering, or a related field.
  • Programming Expertise: Proficiency in Python and PyTorch, including object-oriented programming principles, decorators, context managers, and experience with numerical computing libraries such as NumPy and pandas.
  • Vision Model Proficiency: Strong understanding of components like convolutional layers, transformers, activation functions, pooling, normalization, and non-maximum suppression.
  • Model Experience: Familiarity with the structure, training, and quantization of models such as YOLO, Mask R-CNN, MobileNet, ConvNeXt, EfficientNet, and Vision Transformers.
  • Frameworks and Libraries: Experience with libraries like Torchvision, Ultralytics, MMCV, Keras, ONNX, and OpenCV.
  • Hardware Awareness: Understanding of hardware fundamentals, including memory transfer, inference time, model size, and precision, crucial for real-time edge deployments.
  • Development Practices: Proficiency in modular development, collaborative coding using GitHub, and producing clear, well-structured documentation.
  • Soft Skills: Strong problem-solving and analytical skills, excellent communication abilities, and a collaborative mindset to work effectively within a multidisciplinary team.
  • Familiarity with OpenCV programming and video processing pipelines is a plus.
  • Having a knowledge and experience on working with Camera drivers and camera pipelines using hardware accelerators such as ISPs is also a plus.

Why Join Mentium?

At Mentium, you'll be at the forefront of vision AI technology, working alongside a talented team dedicated to innovation. We offer a collaborative environment where your contributions directly influence the development of cutting-edge solutions. Join us to make a meaningful impact in the field of vision AI.

Benefits:

  • Competitive compensation packages
  • Opportunity to work on diverse, cutting-edge AI projects across a range of industries.
  • 401(k)
  • Flexible PTO
  • Full PPO medical, dental, and vision insurance coverage

Tags & focus areas

Used for matching and alerts on DevFound
Pytorch Machine Learning Ai Engineer Remote Keras Transformers Python R Pandas
Common Questions

Frequently asked questions

Quick answers about how DevFound's AI matching, resumes, and referrals work.

DevFound's AI Copilot ingests your profile, goals, and live job data to deliver curated matches in seconds. Every match includes a resume variant, suggested referrals, and interview prep so you can act immediately. The more feedback you provide, the sharper the Copilot becomes.

AI-led job searches shrink the hours spent sifting through boards and formatting resumes. DevFound pairs automation with your personal outreach, so you reserve energy for interviews and negotiation. Traditional networking still matters, but AI gives you a lift before you even send a message.

Modern AI roles expect comfort with production-grade code, data fluency, and practical ML tooling. The strongest candidates pair deep technical chops with storytelling—translating model impact to product, GTM, and exec partners. Continuous learning keeps you ahead as stacks evolve.

DevFound rewards active seekers. Keep your profile fresh, respond to match quality prompts, and enable alerts so you never miss a role. The AI prioritizes companies and teams that align with your feedback, accelerating both introductions and interview invites.

High-density tech hubs continue to host the deepest AI talent pools, yet distributed teams are catching up fast. Use DevFound filters to hone in on onsite, hybrid, or fully remote roles and watch openings expand across time zones.

DevFound aggregates thousands of remote AI openings and flags the nuances—core hours, async culture, and visa needs—up front. The Copilot also recommends how to position your distributed work experience so hiring managers know you can thrive on a remote team.