Job Summary

The AI/ML Engineer is a core member of the AI/AWS Team responsible for designing and deploying machine learning models and data pipelines on the cloud/AWS. This role focuses on the technical setup, configuration, and management of large-scale data ingestion and transformation processes on the cloud/AWS, leveraging automation to accelerate the data science and ML operationalization (MLOps) journey.

Key Responsibilities

● Data Transformation and ML Pipeline Management

○ Configure and manage the cloud/AWS services (e.g., AWS Glue, Sagemaker Data Wrangler) to set up data connectors and execute large-scale data transformation jobs.

○ Select and execute AI/ML capabilities such as feature engineering, data quality checks, model training, performance analysis, and model deployment pipelines (MLOps).

○ Review and assess model training job outputs, including feature importance reports, data drift metrics, and model performance baselines, to inform deployment decisions.

● Platform and Infrastructure

○ Set up and secure the cloud/AWSaccounts, S3 buckets, and configure necessary IAM permissions to enable secure data transfer and access for ML workflows.

○ Provision and manage target cloud infrastructure for ML model serving and data processing using Infrastructure as Code (IaC) templates (AWS CloudFormation, the cloud/AWS Cloud Development Kit (CDK), or Terraform).

○ Manage CI/CD/CD (or MLOps) pipelines to facilitate the deployment and continuous integration of models and microservices.

● Model and Data Handling

○ Organize and manage large datasets and required code artifacts—including training data, feature stores, Python scripts, and Jupyter notebooks—into secure data repositories (e.g., S3).

○ Develop and review production-grade model code and associated scripts (e.g., for inference) to ensure performance and maintainability, optionally enabling monitoring tools for model quality and drift detection.

● Model Testing and Validation

○ Generate test artifacts, including model validation metrics and test automation scripts, to support functional and performance testing of deployed ML models.

Required Skills and Experience

● Experience configuring and managing the cloud/AWSservices, specifically Amazon S3 and IAM permissions, and ML services like Amazon SageMaker, within an enterprise environment.

● Technical understanding of machine learning principles, model lifecycle management, and MLOps practices.

● Proficiency with Infrastructure as Code (IaC) tooling, such as AWS CloudFormation, AWS CDK, or Terraform.

● Knowledge of cloud-native development and deployment practices, including microservices, CI/CD, and AWS compute services (ECS, EKS, Lambda, Fargate).

● Familiarity with data transformation and processing methodologies (e.g., Spark, AWS Glue, EMR) and the phases of the ML lifecycle (Data Prep, Training, Tuning, Deployment, Monitoring).

Pay: $115,000.00 - $160,000.00 per year

Work Location: Remote

Tags & focus areas

Used for matching and alerts on DevFound

Fulltime Remote Ai Engineer Machine Learning Data Science Mlops Ai

AI/ML Engineer

Tags & focus areas

Ready to Join the Team?