< Back to vacancies

MLOps Engineer

Worldwide
4+

Experience

Remote

Job Type

B2+

English Level

Who We’re Looking For

Jaxel is seeking a MLOps Engineer.

What You’ll Do

• Cloud Infrastructure: Design, provision, and manage the AWS infrastructure for hosting self-hosted SLMs (e.g., Mixtral, GLM 4.5) using Infrastructure as Code (IaC) tools like Terraform or CloudFormation.
• MLOps Pipeline: Implement and maintain a comprehensive MLOps pipeline for model fine-tuning, training, and deployment, ensuring a repeatable and efficient process.
• Deployment & Scaling: Automate the deployment of the AI models and the agentic application, configuring services for scalability (e.g., Auto Scaling Groups), high availability, and fault tolerance.
• Monitoring & Logging: Set up monitoring, logging, and alerting systems using AWS services (e.g., CloudWatch) to track system performance, model behavior, and resource utilization including observability capabilities built-in in agentic frameworks such as LangGraph.
• Security: Ensure the cloud environment is secure and airgapped by implementing and managing security best practices, including IAM roles, security groups, and network configurations.
• Collaboration: Work hand-in-hand with the AI Engineer to streamline the development-to-production workflow and troubleshoot infrastructure-related issues.

What You’ll Need

• Experience: Proven experience as a DevOps or Cloud Engineer, with a strong focus on AWS.
• Cloud: Extensive, hands-on experience with a wide range of AWS services, including EC2, S3, IAM, VPC, and CloudWatch.
• IaC: Expert-level proficiency with Infrastructure as Code tools such as Terraform, AWS CloudFormation, or Ansible.
• CI/CD: Experience building and managing CI/CD pipelines (e.g., Jenkins, GitLab CI, AWS CodePipeline).
• Containerization: Strong knowledge of container technologies like Docker and Kubernetes.
• Scripting: Proficiency in scripting languages, especially Bash and Python, for automation tasks.
• MLOps: Familiarity with MLOps concepts and the specific challenges of deploying and managing machine learning models.
• Troubleshooting: Exceptional ability to diagnose and resolve complex infrastructure and application issues.

What We Offer
  • Competitive salary.
  • Comfortable work in your local time zone.
  • Flexible work schedule.
  • Professional growth and development.
  • Multicultural working environment.