Site Reliability Engineer (Remote - Latam)

Jobgether
Full-time
On-site

This position is posted by Jobgether on behalf of Launchpad Technologies. We are currently looking for a Site Reliability Engineer in Latam.

Join a global team at the intersection of development and operations, where you'll help design, automate, and maintain high-availability systems for leading international clients. This role offers the chance to work remotely while solving complex infrastructure challenges, improving system resilience, and ensuring consistent performance across cloud environments. You'll contribute to mission-critical platforms, leveraging automation, monitoring, and modern DevOps tools in a collaborative, people-first culture that supports continuous learning and professional growth.

Accountabilities:

  • Collaborate with cross-functional teams to ensure system uptime, performance, and reliability.
  • Design and implement robust monitoring and alerting systems to proactively manage infrastructure health.
  • Automate infrastructure provisioning and deployments using Infrastructure as Code (IaC).
  • Manage and support containerized environments (Docker/Kubernetes) in cloud-native setups.
  • Perform root cause analysis, respond to incidents, and minimize service disruptions.
  • Conduct capacity planning and performance optimization for scalable systems.
  • Support CI/CD pipeline maintenance and infrastructure automation workflows.

Requirements

  • Proven experience in Site Reliability, DevOps, or Infrastructure Engineering roles.
  • Strong scripting and automation skills in Python, Bash, or Go.
  • Proficiency with Docker and Kubernetes for container orchestration.
  • Hands-on experience with cloud platforms (AWS, Azure, or Google Cloud).
  • Familiarity with monitoring/logging tools like Prometheus, Grafana, or ELK stack.
  • Solid understanding of performance tuning, fault tolerance, and system observability.
  • Strong communication skills in English and a collaborative problem-solving mindset.

Bonus points for:

  • Knowledge of networking and security best practices.
  • Experience managing incident response and conducting post-mortem reviews.
  • Relevant certifications (e.g., AWS Certified DevOps Engineer, Kubernetes Administrator).
  • Background in designing highly available and fault-tolerant systems.

Benefits

  • 100% remote position with flexible working arrangements.
  • Competitive compensation in USD.
  • Personal hardware setup support for remote work.
  • Paid time off (vacation, study leave, personal time, etc.).
  • Opportunities to collaborate with global teams across North America, Europe, and Asia.
  • Ongoing training and development allowances.
  • Supportive, people-first culture that values your well-being.

Jobgether is a Talent Matching Platform that partners with companies worldwide to efficiently connect top talent with the right opportunities through AI-driven job matching.
When you apply, your profile goes through our AI-powered screening process designed to identify top talent efficiently and fairly.
🔍 Our AI evaluates your CV and LinkedIn profile thoroughly, analyzing your skills, experience, and achievements.
📊 It compares your profile to the job’s core requirements and past success factors to determine your match score.
🎯 Based on this analysis, we automatically shortlist the 3 candidates with the highest match to the role.
🧠 When necessary, our human team may perform an additional manual review to ensure no strong profile is missed.
The process is transparent, skills-based, and free of bias — focusing solely on your fit for the role.
Once the shortlist is completed, we share it directly with the company that owns the job opening. The final decision and next steps (such as interviews or additional assessments) are then made by their internal hiring team.
Thank you for your interest!

#LI-CL1