Home > Find Jobs

Job Search

A tropical beach
SportyBet company logo


LATAM, Central America

Posted on: 11 October 2023





Employee Type


Salary Range


Senior Site Reliability Engineer

Sporty's sites are some of the most popular on the internet, consistently staying in Alexa's list of top websites for the countries they operate in

In addition to our DevOps Team we are building a Site Reliability Team whose purpose is to focus on site reliability and security. It will also involved deployment, configuration, and monitoring, as well as the availability, latency, change management, emergency response, and capacity management of services in production.




  • Work with a team of DevOps/SRE and DBA professionals
  • Improve existing infrastructure and processes currently deployed in as well as streamlining processes deploy to new countries in the future
  • Holistically improve all aspects of our current infrastructure including: reducing costs; streamlining environment provisioning; lowering response times and incorporating the latest techniques and technologies
  • Monitor and maintain the existing cloud infrastructure via autoscaling, automated alerts, and OpsWork and Grafana dashboards
  • Take ownership and responsibility for our cloud operation activities
  • Liaise with external security agencies for annual audits as well as perform our own internal security sweeps
  • Aid in reconfiguring existing architecture to allow for rapid deployments to new countries
  • Mentoring less experienced team members




  • 4+ years SRE/DevOps experience
  • Experience independently leading the planning and deployment of a project
  • Experienced with cloud platforms, especially AWS, including solid knowledge of how to utilize cloud resources to fulfill the demand from other teams and production
  • Familiar with one program language or script language (Python, Java....)
  • Experience managing multiple kubernetes clusters in production (virtualization, orchestration, scalability, security, and high availability), skillset such as Helm, Rancher, ArgoCD
  • Solid networking protocol and cyber security knowledge, especially the TCP / IP stack and HTTP protocol 
  • A strong understanding of cache, including CDN, HTTP cache (CloudFlare, AWS CloudFront)
  • Experienced with CloudNative Monitoring solution in Large distributed system using observation model(Trace, Metric, Logging), skillset such as Prometheus, Jaeger, Loki, ELK, Grafana
  • Excellent troubleshooting skills, including Linux OS issue diagnosis and OS parameter optimization




  • Experience working with other cloud platform is a plus. (GCP, Azure, AliCloud)
  • Familiar with at least one of infrastructure as Code (Terraform, Cloudformation)
  • Design and implement CI/CD workflow is a plus (Jenkins, Github Action)
  • Experience with system automation tools (Ansible, Salt, Chef)
  • Understanding of modern Micro Services and Service Mesh concepts is a plus(Containers, Istio)





  • Quarterly and flash bonuses
  • Flexible working hours
  • Education allowance
  • Referral bonuses
  • 28 days paid annual leave
  • 2 x annual company retreats (Lisbon + Dubai in 2022 / Phuket in Q2 2023 + 1 more TBC!)
  • Highly talented, dependable co-workers in a global, multicultural organisation
  • Payment via world class online wallet system DEEL
  • Top of the line equipment supplied by market leader Hofy
  • We score 100% on The Joel Test
  • Our teams are small enough for you to be impactful
  • Our business is globally established and successful, offering stability and security to our Team Members


Interview Process


  • TestGorilla Aptitude Test 
  • Remote video screening with our Talent Acquisition Team + live ID check
  • Remote 90 min video interview loop with 3 x Team Members (30 mins each)
  • Pre offer call with Talent Acquisition Team
  • ID check via Zinc
  • 24-72 hour feedback loops throughout process


Apply to job