Responsibilities
- Develop and maintain the infrastructure to support machine learning workflows for drug discovery at scale.
- Implement and optimize algorithms for data processing, model training and model deployment.
- Collaborate with and support the computational scientists at Vilya to help build a world-class computational macrocycle drug discovery pipeline.
- Participate in the development and enhancement of our software platform to ensure that our codebase is reliable, efficient, secure, and scalable.
Basic qualifications
- 1+ year of experience developing and maintaining ML pipelines
- Understanding of general software development concepts, including git, containers, testing, cloud computing, etc.
- Experience with modern deep learning frameworks, like PyTorch or JAX
- Experience with Python scientific computing tools, such as numpy, scipy, pandas, etc.
- Experience working with big data pipelines
- Experience working within Linux environments
- Ability to effectively communicate with a team of diverse backgrounds
- B.S. in computer science (or equivalent industry experience)
Preferred qualifications
- A solid background in machine learning, including linear algebra, statistics, and calculus
- Experience with modern generative modeling frameworks, such as flow matching and diffusion
- Familiarity with techniques for modeling point-clouds or graphs (GNNs, equivariant transformers, etc.)
- Familiarity with basic cheminformatics libraries and tools (RDKit, OpenBabel, PyMol, etc.)
- An interest in drug discovery and working with biochemical data
- Opportunity to work in a disruptive startup with a talented, experienced, and dedicated team
- 401(k) plan with employer matching for contributions
- Stock options
- Excellent medical, dental, and vision coverage
- Open, flexible vacation policy
- Support to attend professional conferences meaningful to your career growth
- Monthly cell phone and internet stipend
- Monthly wellness and commuter stipend
Tags & focus areas
Used for matching and alerts on DevFound Fulltime Machine Learning Ai