Roblox Logo

Roblox

Senior SRE, Compute Orchestration

Posted 5 Days Ago
Be an Early Applicant
San Mateo, CA
Senior level
San Mateo, CA
Senior level
As a Senior Site Reliability Engineer on the Infra Compute Orchestration team, you will create and maintain the infrastructure for Roblox's private cloud. Responsibilities include developing fault-tolerance systems, automating processes, creating performance monitoring tools, and analyzing system designs for readiness.
The summary above was generated by AI

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. 

At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. 

A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

What You’ll Do:

As a Site Reliability Engineer (SRE) on the Infra Compute Orchestration (ICO) team, you will create, support, and evolve the infrastructure at Roblox as we build out Roblox's private cloud. ICO's mission is to own and manage our underlying orchestration systems along with elements of service discovery, secrets management and related software layers. 

You Will:

  • Create systems & libraries that promote fault-tolerance and resilience– like retries, circuit breakers, and adaptive concurrency limits.
  • Build, automate and standardize process automation to create a "golden path" of tooling and platform support that powers the fundamental Roblox ecosystem.
  • Create tooling that provides production guardrails, for example evaluating release candidate capacity with load testing tooling before deploying to production.
  • Create performance monitoring services and observability towards understanding capacity issues and platform degradations.
  • Create tooling that monitors production services and their changes, like generalized canarying services with alerting.
  • Analyze systems and system designs for production readiness

You Have:

  • Experience: you have a BS degree (or equivalent professional experience) in Computer Science or related engineering field with proven track record including at least 6 years as an SRE or Software Engineer.
  • Passion for systems: You have experience and good habits around building software and tools and getting them adopted. Your system's focus advises a view of code needing to be deeply reliable.

You Are:

  • A Partner: You know that the best tools integrate broadly with the tooling ecosystem. You approach partners and processes with curiosity and seek to understand a problem deeply before you start coding.
  • A Coder: you have experience writing common programming languages (e.g., Go, Java, C#, Rust).
  • Passionate about problem-solving, finding creative work solutions, and addressing unexpected challenges as part of a team.
  • Problem Solver: you ask the right questions to tackle issues within your expertise and you use data to test your theories.
  • Planner: You have experience in large project lifecycles. You have experience working in sprints, breaking down complex tasks into achievements, and reporting status to keep project scheduling accurate.

For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits.

Annual Salary Range

$233,840$283,780 USD

Roles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).

You’ll Love: 

  • Industry-leading compensation package
  • Excellent medical, dental, and vision coverage
  • A rewarding 401k program
  • Flexible vacation policy (varies by exemption status)
  • Roflex - Flexible and supportive work policy 
  • Roblox Admin badge for your avatar
  • At Roblox HQ: 
    • Free catered lunches five times a week and several fully stocked kitchens with unlimited snacks
    • Onsite fitness center and fitness program credit
    • Annual CalTrain Go Pass

Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.

Top Skills

C#
Go
Java
Rust

Similar Jobs at Roblox

Be an Early Applicant
18 Hours Ago
San Mateo, CA, USA
Hybrid
2,500 Employees
Senior level
2,500 Employees
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
As a Senior Software Engineer in the Edge Team at Roblox, you will support and scale worldwide Edge Data Centers, build features for the Kubernetes control plane, and help re-architect services to be Kubernetes-native. Your work will involve creating an Edge Computing Platform to meet various workloads and ensuring reliable production systems.
Be an Early Applicant
18 Hours Ago
San Mateo, CA, USA
Hybrid
2,500 Employees
Expert/Leader
2,500 Employees
Expert/Leader
Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
As a Principal Software Engineer on the Edge Team at Roblox, you will develop and innovate features on a Kubernetes-based control plane to support edge data centers and enhance automation. Responsibilities include scaling production servers, re-architecting game play servers, and creating an Edge Computing Platform for various workloads.
Be an Early Applicant
18 Hours Ago
San Mateo, CA, USA
Hybrid
2,500 Employees
Senior level
2,500 Employees
Senior level
Computer Vision • Gaming • Software • Virtual Reality • Web3 • Metaverse
As a Principal Software Engineer at Roblox, you will set the technical vision for application networking, execute plans, deliver high-quality code, influence networking and infrastructure directions, and mentor engineers. You will tackle unique technical challenges and contribute to creating safer shared experiences for users.

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account