BlackLine

Senior Manager, Site Reliability Engineering

Posted 8 Hours Ago

Be an Early Applicant

Hybrid

Pleasanton, CA

Senior level

Hybrid

Pleasanton, CA

Senior level

The Senior Manager, Site Reliability Engineering will lead a team overseeing the FedRamp operations and reliability of BlackLine's Multi-Tenant Accounts Receivable SaaS products hosted in Microsoft Azure. Responsibilities include capacity planning, performance monitoring, incident response, and managing day-to-day operations.

The summary above was generated by AI

Get to Know Us:
It's fun to work in a company where people truly believe in what they're doing!
At BlackLine, we're committed to bringing passion and customer focus to the business of enterprise applications.
Since being founded in 2001, BlackLine has become a leading provider of cloud software that automates and controls the entire financial close process. Our vision is to modernize the finance and accounting function to enable greater operational effectiveness and agility, and we are committed to delivering innovative solutions and services to empower accounting and finance leaders around the world to achieve Modern Finance.
Being a best-in-class SaaS Company, we understand that bringing in new ideas and innovative technology is mission critical. At BlackLine we are always working with new, cutting edge technology that encourages our teams to learn something new and expand their creativity and technical skillset that will accelerate their careers.
Work, Play and Grow at BlackLine!
The successful applicant will be performing work in FedRAMP environments, and therefore, must be a US Citizen.
Make Your Mark:
The FedRamp team plays a critical role in advancing our organization's mission to deliver secure, compliant, and highly reliable cloud solutions. This team is at the forefront of ensuring our systems meet the stringent requirements of the FedRamp program, supporting both our internal needs and those of our customers.
As part of this team, members will have the unique opportunity to design, deploy, build, and support our FedRamp -compliant cloud environment. They'll be instrumental in shaping the future of our infrastructure by applying cutting-edge patterns such as Site Reliability Engineering (SRE), DevOps, and DevSecOps to streamline deployments and enhance overall efficiency.
By embracing these modern approaches, the FedRamp team will not only deliver robust and secure systems but also drive innovation and set a new standard for operational excellence across our cloud platform.
You'll Get To:

Lead and nurture a high-performing team of engineers passionate about large-scale distributed systems for FedRamp initiatives.

Skilled in managing and supporting distributed teams effectively, ensuring seamless collaboration and high performance across remote environments

Foster a culture of collaboration, innovation, and accountability within the team.

D emonstrate ability to cultivate and prioritize a customer-first mindset.

Drive the adoption of SRE, DevOps, and DevSecOps patterns to enhance system reliability, scalability, and security.

Collaborate with cross-functional teams to implement automated solutions for observability , incident response, and deployments.

Partner with our Talent Acquisition team as we recruit, interview, and hire the best engineering talent to join BlackLine's growing SRE FedRAMP team!

Manage engineers to achieve more than they thought possible. You enjoy managing and driving teams to success, and as a Leader, you are fulfilled through the success of others.

Partner with compliance, security, and engineering teams to align objectives and ensure seamless execution of FedRamp projects

Manage a team working on reliability projects, including:

Business Continuity Planning, disaster recovery, backup/restore, RTO, RPO

Application uptime and performance

Capacity management & planning

SLIs, SLOs, error budgets, and monitoring dashboards

Responsible for deployment and operations of large-scale distributed data stores and streaming services

Establishing design patterns for monitoring signals and benchmarking

Establishing and documenting production run books and guidelines for developers

Tooling , runbooks & automation to handle production environments

Incident management and improving MTTD/MTTR for services

Cloud cost optimization

What You'll Bring:

Bachelors/ master's in computer science , Engineering, or related technical field, or equivalent practical experience.

Minimum of 8 to 10+ years of experience in handling large-scale cloud-native microservices platforms.

7+ years of hands-on solid management experience managing teams deploying, handling, and monitoring large-scale public cloud, specifically GCP, AWS or Azure.

Strong understanding of SRE principles, CI/CD patterns, and Infrastructure-as-code (e.g., Terraform, Ansible).

Proven experience in leading a team focused on API development and implementing everything-as-code automation practices.

Skilled in building APIs with clear and efficient API contracts, ensuring seamless integration and consistent communication between systems.

Strong understanding of modern API frameworks and standards, enabling robust, scalable, and maintainable solutions.

Strong hands-on experience in Observability tools in order to build distributed tracing, logging, and metrics for large-scale deployments.

Experience with deployment, operations, and performance management of one or more large-scale databases .

Excellent problem-solving, triaging, and debugging skills in large-scale cloud distributed systems

Prior C#, .NET, Python , Go or Java development experience, preferably in an agile SaaS environment.
Working knowledge of cloud platforms ( Google Cloud, strongly preferred).

We're Even More Excited If You Have:

Familiarity working with and/or managing in compliance environments such as FedRamp , NIST-800-53, GovCloud, or SOC 2 Type 2.

GCP Solutions Architect certification preferred.

Experience with Infrastructure-as-Code using Terraform, CloudFormation, Google Deployment Manager.

Experience with CI/CD frameworks and Pipeline-as-Code such as GitHub, GitHub Actions , Istio, Argo, and Artifactory, etc.

Proven skills to effectively work across teams and functions to influence the design, operations, and deployment of highly available software.

Thrive at BlackLine Because You Are Joining:

A technology-based company with a sense of adventure and a vision for the future. Every door at BlackLine is open. Just bring your brains, your problem-solving skills, and be part of a winning team at the world's most trusted name in Finance Automation!
A culture that is kind, open, and accepting. It's a place where people can embrace what makes them unique, and the mix of cultural backgrounds and varying interests cultivates diverse thought and perspectives.
A culture where BlackLiner's continued growth and learning is empowered. BlackLine offers a wide variety of professional development seminars and inclusive affinity groups to celebrate and support our diversity.

BlackLine is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity or expression, race, ethnicity, age, religious creed, national origin, physical or mental disability, ancestry, color, marital status, sexual orientation, military or veteran status, status as a victim of domestic violence, sexual assault or stalking, medical condition, genetic information, or any other protected class or category recognized by applicable equal employment opportunity or other similar laws.
BlackLine recognizes that the ways we work and the workplace itself has shifted. We innovate in a workplace that optimizes a combination of virtual and in-person interactions to maximize collaboration and nurture our culture. Candidates who live within a reasonable commute to one of our offices will work in the office at least 2 days a week.
Salary Range:
USD $186,000.00 - USD $248,000.00
Pay Transparency Statement:
Placement within this range depends upon several factors, including the applicant's prior relevant job experience, skill set, and geographic location. In addition to base pay, BlackLine also offers short-term and long-term incentive programs, based on eligibility, along with a robust offering of benefit and wellness plans.
Accommodations:
BlackLine is committed to creating an inclusive and accessible experience for all candidates. If you require a reasonable accommodation that would better enable your success during the application or interview process, please complete this form.

Top Skills

Azure

Similar Jobs at BlackLine

BlackLine

Senior Site Reliability Engineer

Be an Early Applicant

8 Hours Ago

Pleasanton, CA, USA

Hybrid

1,810 Employees

Senior level

Apply

1,810 Employees

Senior level

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI

The Senior Site Reliability Engineer ensures optimal performance and availability of BlackLine's cloud services by managing capacity planning, technical project execution, and software engineering tasks. Responsibilities include collaborating with teams, addressing customer escalations, identifying and resolving performance issues, and developing automation tools to enhance service reliability and efficiency.

BlackLine

Staff I Reliability Engineer - FedRAMP

Be an Early Applicant

8 Hours Ago

Pleasanton, CA, USA

Hybrid

1,810 Employees

Senior level

Apply

1,810 Employees

Senior level

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI

The Staff Site Reliability Engineer at BlackLine will assess and report on the performance and availability of production applications, create testing frameworks, and develop capacity plans. Responsibilities include optimizing the service experience, automating event identification, establishing KPIs, and mentoring team members. The role requires collaboration across functions and a commitment to continuous learning in a dynamic environment.

BlackLine

Lead Network Engineer

Be an Early Applicant

2 Days Ago

Pleasanton, CA, USA

Hybrid

1,810 Employees

Expert/Leader

Apply

1,810 Employees

Expert/Leader

Cloud • Fintech • Information Technology • Machine Learning • Software • App development • Generative AI

The Lead Network Engineer will oversee network engineering and operational support in a global environment, ensuring compliance with system architecture and security policies, collaborating on infrastructure projects, and maintaining network documentation. Responsibilities include deploying cloud networking solutions, monitoring network performance, and providing escalation support.

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.