Button Logo

Button

Senior DevOps Engineer - Infrastructure

Reposted 8 Days Ago
Remote or Hybrid
Hiring Remotely in United States
Senior level
Remote or Hybrid
Hiring Remotely in United States
Senior level
As a Senior DevOps Engineer, you'll build and maintain Button's platform, ensuring scalability and stability, while collaborating with engineering teams on best practices and tool development.
The summary above was generated by AI

Button’s mission is to build a better way to do business in mobile. Today, we work with some of the largest and most interesting businesses in the world to connect consumers with what they want at the tap of a button. We build with the consumer experience in mind, have a reputation for paving the future of mobile, and have a good time doing it.

As a Senior DevOps Engineer on the Infrastructure team, you will be responsible for building, maintaining, and evolving our platform. The mission is to make Button’s platform scalable, stable, and operable in a straightforward way. A well-designed platform increases the velocity of every engineer on the team.


You will have direct influence over the platform that runs Button’s services, applications, and data processing systems. You will closely partner with other engineers on the Core and Infrastructure teams to ensure coherent designs for our new and existing products. You will provide and maintain a self-service platform for the system and data needs of Product Engineering to stand-up, maintain, and otherwise service the foundation for Button’s products. In this role, you will have the opportunity to greatly influence the vision and participate in planning the roadmap.


AS A DEVOPS ENGINEER:

  • Expand our system instrumentation and tooling with monitoring, alerting, logging, and tracing for our critical business tasks; you will be responsible for identifying and following through on key system metrics.
  • Build, improve, maintain, and otherwise support business-critical systems
  • Support new feature development as the go-to-partner for Product Engineering for the infrastructure and data needs, providing tools and guidance when it comes to best practices and solving problems with our unique constraints.
  • Manage and monitor most aspects of our production serving environment. We're an AWS shop, and we make heavy use of ECS, RDS, and EC2 in production, all managed through Terraform.


WE ARE LOOKING FOR TEAMMATES WHO ARE: 

  • Passionate about designing systems and services that are reliable, scalable, and thoughtfully right-sized, with an emphasis on strong documentation and a drive to explore new technologies that elevate Button’s products.
  • Building internal tools and scripting solutions. This will require comfort operating across the entire span of a data processing pipeline and software development cycles (including supporting CI/CD and scaling the platform).
  • Fluency in and around Linux systems. You should be comfortable in a variety of troubleshooting situations that involve the health of the underlying system.
  • A security- and safety-oriented mindset. You have good instincts for how to balance empowering developers versus maintaining a principle of least necessary privilege.
  • An ability to move fast, make decisions, and take a pragmatic approach to any problem. We know there are lots of ways to accomplish the same goal; you’ll find and champion the best approaches for our team.
  • A track record of “leveling-up” the team around them, driving impact not just through their own contributions but also by elevating others.
  • Experience and comfort in a production environment.


EXPERIENCE LEVEL:

  • 5+ years of experience supporting and building infrastructure with direct hands-on experience with a variety of tools and frameworks. Today, Button uses EC2, RDS, Aurora, and ECS, all managed through Terraform.
  • Experience in event-driven and queue-driven architecture, serverless,  DynamoDB, and step functions.
  • Experience with Docker
  • Experience with build technologies such as Make / Pantsbuild / Bazel / Buck2.
  • Experience administering and scaling monitoring and observability solutions (e.g., Grafana, Prometheus, Datadog, New Relic, or similar).
  • Proficiency with CI/CD solutions
  • Proficiency with AWS
  • Experience with Python, Go, or Node.js is a plus
  • Experience with GCP is a plus
  • Experience in AWS CDK is a plus


More About Button

Button has been recognized by BuiltIn, Fortune, Inc., and Crain’s as one of the best places to work in the United States. We believe in continuous growth and learning, and we value curiosity, integrity, and impact in everything we do.

Button provides employees with a RemotePlus workplace, which blends “work from anywhere” with in-person collaboration. Button has a “hub” workspace in New York City as well as team members distributed across the United States and beyond. The salary range for this role is expected to be between $133,000 - $172,000 (offered salary is based on a number of factors including skills and experience relative to the job description listed above).

In addition, Button also provides a 401(k) plan with a 3% automatic employer contribution, comprehensive health coverage (with most employee premiums fully covered), One Medical memberships, and generous stipends for lifestyle and connectivity. Employees enjoy unlimited paid time off, two dedicated company-wide weeks off, and a culture that prioritizes balance and well-being.

At Button, we live our values—Learn & Grow, Experiment, Adapt, and Deliver—and we are proud to be a team of people committed to building a better internet, fueled by commerce.

Equal Opportunity Statement

Button is committed to creating an inclusive and equitable workplace. We do not discriminate against qualified applicants or employees based on race, color, religion, gender identity, sexual orientation, pregnancy, national origin, age, marital status, disability, medical condition, veteran status, or any other characteristic protected by law.


Top Skills

AWS
Aws Cdk
Datadog
Docker
Ec2
Ecs
GCP
Go
Grafana
New Relic
Node.js
Prometheus
Python
Rds
Terraform

Similar Jobs

3 Days Ago
Remote
United States
Senior level
Senior level
Beauty • Robotics • Design • Appliances • Manufacturing
The Lead DevOps Engineer will lead DevOps strategies for Oracle and Salesforce systems, manage CI/CD pipelines, and ensure security compliance.
Top Skills: Ci/CdDatadogFlexdeployGitGrafanaJenkinsOracle CloudPrometheusSalesforceTerraform
20 Days Ago
Remote
USA
Senior level
Senior level
Software
As a Senior DevOps Engineer at Prompt Therapy, manage cloud infrastructure, automate deployment, and implement MLOps best practices to enhance AI-driven features.
Top Skills: AnsibleAWSAzureBashCloudFormationDatadogDockerEcsGCPGithub ActionsGitlabGoGrafanaKubernetesMlflowPrometheusPythonTerraform
Yesterday
Remote
United States
Senior level
Senior level
Artificial Intelligence • Fintech • Machine Learning • Software • Financial Services
The DevOps Engineer will evolve infrastructure, manage CI/CD pipelines, improve observability, and enhance developer productivity in a remote-first team.
Top Skills: ArgocdCi/CdDatadogFinops ToolingGCPGoHelmKubernetesPrometheusPythonTerraform

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account