High 5 Games Logo

High 5 Games

DevOps Engineer

Reposted 3 Days Ago
Remote
Hiring Remotely in Canada
Mid level
Remote
Hiring Remotely in Canada
Mid level
The DevOps Engineer will design, build, and optimize cloud infrastructure for machine learning operations, manage CI/CD pipelines, and ensure reliability across systems.
The summary above was generated by AI

We’re looking for a DevOps Engineer to help design, build, and optimize the cloud infrastructure powering our machine learning operations. You’ll play a key role in scaling AI models from research to production — ensuring smooth deployments, real-time monitoring, and rock-solid reliability across our Google Cloud Platform (GCP) environment.

You’ll work hand-in-hand with data scientists, ML engineers, and other DevOps experts to automate workflows, enhance performance, and keep our AI systems running seamlessly for millions of players worldwide.

What You’ll Do

  • Manage, configure, and automate cloud infrastructure using tools such as Terraform and Ansible.
  • Implement CI/CD pipelines for ML models and data workflows, focusing on automation, versioning, rollback, and monitoring with tools like Vertex AI, Jenkins, and DataDog.
  • Build and maintain scalable data and feature pipelines for both real-time and batch processing using BigQuery, BigTable, Dataflow, Composer, Pub/Sub, and Cloud Run.
  • Set up infrastructure for model monitoring and observability — detecting drift, bias, and performance issues using Vertex AI Model Monitoring and custom dashboards.
  • Optimize inference performance, improving latency and cost-efficiency of AI workloads.
  • Ensure overall system reliability, scalability, and performance across the ML/Data platform.
  • Define and implement infrastructure best practices for deployment, monitoring, logging, and security.
  • Troubleshoot complex issues affecting ML/Data pipelines and production systems.
  • Ensure compliance with data governance, security, and regulatory standards, especially for real-money gaming environments.

What We’re Looking For

  • 3+ years of experience as a DevOps Engineer, ideally with a focus on ML and Data infrastructure.
  • Strong hands-on experience with Google Cloud Platform (GCP) — especially BigQuery, Dataflow, Vertex AI, Cloud Run, and Pub/Sub.
  • Proficiency with Terraform (and bonus points for Ansible).
  • Solid grasp of containerization (Docker, Kubernetes) and orchestration platforms like GKE.
  • Experience building and maintaining CI/CD pipelines, preferably with Jenkins.
  • Strong understanding of monitoring and logging best practices for cloud and data systems.
  • Scripting experience with Python, Groovy, or Shell.
  • Familiarity with AI orchestration frameworks (LangGraph or LangChain) is a plus.
  • Bonus points if you’ve worked in gaming, real-time fraud detection, or AI-driven personalization systems.

Similar Jobs

9 Days Ago
Easy Apply
Remote
CAN
Easy Apply
Mid level
Mid level
Artificial Intelligence • Edtech • Machine Learning • Software
Support design, deployment, and maintenance of multiregion/multicloud infrastructure on GCP. Build and operate Kubernetes (GKE) clusters, GitLab CI pipelines, GitOps with ArgoCD, Terraform automation, monitoring, and on-call incident response. Integrate agentic AI workflows, observability tooling, and document runbooks to improve reliability and developer productivity.
Top Skills: AnsibleArgocdBashCloudFormationCompute EngineElkGCPGcp Cloud LoggingGcp Cloud MonitoringGitlab CiGitopsGkeGoGoogle Kubernetes EngineGrafanaKubernetesLinux/UnixLlmsPrometheusPythonTerraform
3 Days Ago
Remote
Canada
Mid level
Mid level
Retail • Sales • Software
Build and maintain production-scale infrastructure and automation: CI/CD, release automation, testing environments, monitoring and incident response. Improve scalability, tooling, and developer workflows; collaborate with engineers and QA and mentor peers to raise engineering infrastructure quality.
Top Skills: AWSBazelCC++DjangoGCPGitlabJavaJavaScriptKubernetesLinuxNext.JsPostgresPythonReactReduxShell ScriptingTypescript
4 Days Ago
In-Office or Remote
Mid level
Mid level
Information Technology • Analytics • Consulting • Pharmaceutical
Build and maintain CI/CD pipelines with GitHub Actions; implement and manage IaC via Ansible/Terraform; maintain Docker container workflows; operate AWX/Ansible Tower/Semaphore; improve observability and respond to incidents; produce documentation; follow secure DevOps practices and participate in on-call rotations while growing AWS/CloudOps skills.
Top Skills: AnsibleAnsible TowerAWSAwxAzure MirageBashDockerGitGithub ActionsGoLinuxLokiPythonSemaphoreTerraform

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account