Abex ExcellencePublic pageOpen

Site Reliability Engineer

KochiEngineering2

Job description

Formatted from the job record and limited to candidate-facing content.

Job Title: Site Reliability Engineer

Location: [Kochi, Kerala / Hybrid] Experience Required: Minimum 2 years Employment Type: Full-time

We are looking for a skilled and proactive Site Reliability Engineer (SRE) with at least 2 years of hands-on experience to join our growing technology team. In this role, you will be responsible for ensuring the stability, scalability, and reliability of our production systems. You’ll collaborate closely with development, operations, and security teams to design robust infrastructure and improve system performance through automation and monitoring.

Key Responsibilities

  • Maintain and improve the availability, reliability, and performance of production services and infrastructure.
  • Build and maintain automated deployment pipelines to ensure smooth and reliable software releases.
  • Implement monitoring, alerting, and logging solutions to detect issues proactively.
  • Troubleshoot production issues, conduct root cause analysis, and implement long-term fixes.
  • Collaborate with development teams to improve system architecture and application resilience.
  • Participate in on-call rotations to handle incidents and minimize downtime.
  • Continuously identify opportunities to optimize infrastructure costs and performance.

Required Qualifications

  • Bachelor’s degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
  • Minimum 2 years of experience in a Site Reliability Engineer, DevOps Engineer, or similar role.
  • Proficiency with cloud platforms (e.g., AWS, GCP, Azure).
  • Experience with containerization and orchestration tools (e.g., Docker, Kubernetes).
  • Strong knowledge of CI/CD pipelines and infrastructure-as-code tools (e.g., Terraform, Ansible).
  • Familiarity with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, or similar.
  • Proficient in at least one scripting language (e.g., Python, Bash, Go).
  • Solid understanding of networking, Linux systems, and security best practices.

Preferred Qualifications

  • Experience working in a 24/7 production environment.
  • Exposure to incident management and post-mortem practices.
  • Knowledge of performance tuning and cost optimization in cloud environments.
  • Familiarity with microservices architectures.

Why Join Us

  • Opportunity to work on cutting-edge infrastructure at scale.
  • Collaborative, fast-paced, and growth-oriented work culture.
  • Competitive compensation and benefits.
  • Chance to shape the reliability strategy of a growing company.

Role summary

Location

Kochi

Team

Engineering

Experience

2

Site Reliability Engineer — Abex Excellence