Back to jobs

Site Reliability / GitOps Engineer

External company

himalayasremotefull-time Worldwide
SREGitOpsLinuxIaCCI/CDPythonPrometheusGrafanaKubernetes

Job description

Canonical is hiring a Site Reliability and GitOps Engineer for its Information Systems team. The IS team supports and maintains all of Canonical's IT production services used by over 60 million Ubuntu users. This role is an opportunity for an automation-first technologist with a passion for Linux to build a career with Canonical. As an SRE and GitOps engineer you will be in a unique position to drive operations automation to the next level both in Canonical's own private clouds and in public clouds, utilizing the best of open source infrastructure as code software, CI/CD pipelines, and Canonical's leading products for software operation automation. Responsibilities include applying experience of IaC to develop infrastructure as code practices within the IS team by constantly increasing automation and improving IaC processes, automating software operations for reusability and consistency across private and public clouds, developing new features and improving the resilience and scalability of the existing cloud and container portfolio at Canonical, maintaining operational responsibility for all of Canonical's core services, networks, and infrastructure, setting up and maintaining observability tools such as Prometheus, Grafana, and Elasticsearch, and designing, implementing, and maintaining monitoring and alerting for various systems and services. Requirements include deep experience defining operations in code using version control, peer review, and CI/CD to roll out changes to applications and infrastructure, strong modern engineering background including peer-review, unit testing, SCM, CI/CD, and Agile, Python software development experience, practical knowledge of Linux networking, routing, and firewalls, hands-on experience administering enterprise Linux servers, and a bachelor's degree or greater.

Posted 24/04/2026