MLOps Engineer

Metova
Full-time
On-site

A leading company in Mexico specializing in accounting software is looking for a highly skilled MLOps Engineer to join the team.

REQUIREMENTS:

  • 4+ years of experience in DevOps, Data Engineering, or MLOps.
  • Fluent technical English.
  • Practical experience with MLOps tools such as MLflow, Kubeflow, Metaflow, SageMaker, or Vertex AI.
  • Experience managing cloud infrastructure on AWS, GCP, or Azure.
  • Experience configuring and maintaining distributed training environments with GPUs.
  • Experience with tools such as Ray, Flyte, Apache Airflow, or Prefect (Nice to Have).

KNOWLEDGE AND SKILLS:

  • Advanced knowledge of Docker, Kubernetes, Helm, and CI/CD (GitHub Actions, GitLab CI, Jenkins).
  • Knowledge of data storage and versioning (DVC, LakeFS, DeltaLake). 
  • Familiarity with event-driven architectures, microservices, and REST/gRPC APIs.
  • Knowledge of Alibaba Cloud and its ecosystem for AI/ML (PAI, ACK, OSS, etc.) (Nice to Have).

RESPONSABILITIES:

  • Design and implement pipelines for training, validating, and deploying machine learning models (CI/CD for models). 
  • Manage and optimize infrastructure for distributed training, parallel processing, and efficient storage. 
  • Collaborate with Data Scientists to operationalize experimental notebooks and turn them into productive services. 
  • Integrate models with APIs and backend architectures, ensuring performance and security.
  • Define standards for reproducibility, validation, and automated testing throughout the ML lifecycle.