Very Good Security

Sr. Infrastructure Engineer

Reposted 10 Days Ago

Remote

Hiring Remotely in Canada

Senior level

Remote

Hiring Remotely in Canada

Senior level

The role involves architecting and maintaining scalable infrastructure, leading incident management, automating monitoring, and collaborating with cross-functional teams to improve operational processes.

The summary above was generated by AI

VGS is the world's leader in payment tokenization. Large banks, aspiring fintechs, and growing merchants embed our universal token vault into their technology stack to manage the complexities of payment data tokenization across processors and networks, open banking, card issuance, omnichannel loyalty, PCI compliance, payment orchestration, and more. We empower our clients and partners by tokenizing sensitive payment data, limiting compliance scope, and consolidating payments to unlock revenue and business opportunities.

VGS provides processor-agnostic tokenization solutions via secure universal token vaults, iframes, mobile SDKs, tokenization proxies, APIs, and data orchestration tooling to support payment acceptance, card issuance, PII and bank account tokenization, and other payments value-added services. Some of the use cases we enable include multi-processor Network Tokenization, Account Updater, payment orchestration, secure settlement file processing, 3DS, and Risk provider connectivity.

We are looking for a well-versed, passionate Engineer who wants to play a key role in site reliability engineering and cloud operations of our global cloud infrastructure.

We’re seeking individuals with creative problem-solving, enthusiasm for new technologies, and a desire to contribute to our product. You will likely be successful in this role if you identify with the following traits: attention to detail, problem solver, customer-oriented, versatile, resilient, and confident. If all of this sounds interesting to you, we’d love to hear from you.

What you will be doing at VGS…

Architect and maintain scalable, reliable infrastructure: Design and optimize infrastructure for high availability, fault tolerance, and performance across distributed systems.
Lead incident management and root cause analysis: Own incident response processes, ensure swift resolution of issues, and drive post-incident improvements to prevent recurrences.
Service monitoring and automation: Build and maintain automated monitoring, alerting, and healing systems that improve system health, reduce manual intervention, and minimize downtime.
Performance tuning and capacity planning: Identify bottlenecks and optimization opportunities, and implement scaling strategies to handle traffic spikes and growing workloads efficiently.
Collaborate with cross-functional teams: Work closely with software engineers, product teams, and DevOps to enhance system reliability and delivery pipelines.
Improve operational processes: Champion continuous improvement initiatives in deployment, scaling, and performance testing, while advocating for the adoption of SRE best practices across the organization.
Mentorship and leadership: Provide technical mentorship to junior engineers, contribute to strategic decisions around infrastructure, and ensure best practices are implemented at scale.
Be proactive and innovative: we rely on your feedback to build a world-class product.
Be a part of a team that believes in the core values of transparency, collaboration, grit, and humility; in going above and beyond what is required to do the right thing for our customers and the company; and in having fun while doing all this!

What we are looking for from you (Requirements)...

Proven experience in Infrastructure/SRE roles, with a track record of managing production systems in complex, large-scale environments.
Strong proficiency in AWS, including infrastructure-as-code (Terraform, CloudFormation, etc.).
Solid understanding of cloud-native architecture, Linux Systems, microservices, Infrastructure-as-code (Terraform, CloudFormation, CDK), CI/CD (CircleCI, GitHub Actions, Argo), GitOps, Authentication and Authorization, APIs and API Gateway, Docker, Kubernetes (EKS), Kafka (MSK), Java, Spring Framework, Python, and AWS services.
Strong plus if you are a database wiz.
Expertise in monitoring and observability tools like Prometheus, Grafana, Open Telemetry, New Relic, or similar tools to measure system health and performance.
Programming and scripting experience in languages such as Python, Go, Bash, or other relevant languages used in automating infrastructure.
Solid understanding of networking, security, and load balancing in cloud-native environments.
Strong communication and collaboration skills, with the ability to lead cross-functional initiatives and mentor junior team members.
Experience with incident management and disaster recovery best practices.
Strong written and verbal communication skills.

What you get from us...

• Flexible work hours and flexible PTO

• Competitive health benefits

• VGS stock options

• 401k plan, with employer matching 4% and immediate vesting (available only for US employees)

• Life & disability insurance

• Pre-tax flexible spending accounts, dependent and healthcare FSA (available only for US employees)

• Global parental leave program

• Employee Assistance Program

• Home Internet reimbursement

• New hire home office set-up allowance

• Professional learning reimbursement

At VGS, we have a remote-first philosophy because we believe flexibility leads to great work and a healthy work-life balance. That said, if you live within 30 miles of one of our office locations, you’ll be on a hybrid schedule with some in-person time—because we know there’s real value in coming together.

We’re not about being in the office every day—but we are about connection, collaboration, and the energy that comes from a great brainstorm, a team lunch, or celebrating a big win in person.

We consider applicants without regard to race, color, national origin, sex, age, religion, sexual orientation, gender identity, veteran status, marital status, physical or mental disability, or other protected classes under all local, state, and federal laws and ordinances (AA/EOE/W/M/Vet/Disabled).

Qualified applicants with arrest and conviction records will be considered for the position in accordance with the San Francisco Fair Chance Ordinance.

Visa Sponsorship. The Company does not provide visa sponsorship for this role. Candidates must be legally authorized to work in the United States at the time of hire and throughout their employment. Individuals with temporary visas such as E, F-1 (including those with OPT or CPT), H-1, H-2, L-1, B, J, or TN, or who need sponsorship for work authorization now or in the future, are not eligible.

Please note we are currently only hiring in the following states...

California, Colorado, Connecticut, Florida, Illinois, New York, North Carolina, Oregon, Texas, Virginia, and Washington

Top Skills

AWS

Ci/Cd

CloudFormation

Docker

Gitops

Grafana

Java

Kafka

Kubernetes

New Relic

Open Telemetry

Prometheus

Python

Spring Framework

Terraform

Similar Jobs

Webflow

Staff Engineer

14 Days Ago

Easy Apply

Remote

Easy Apply

Senior level

Artificial Intelligence • Enterprise Web • Software • Design • Generative AI

As a Senior Staff Engineer at Webflow, you'll architect scalable AI products, partner with leadership for technical strategy, and mentor engineers to elevate architectural standards.

Top Skills: AWSGCPGoKubernetesNode.jsPulumiTerraformTypescript

Affirm

Senior Software Engineer

20 Days Ago

Easy Apply

Remote

Canada

Easy Apply

Senior level

Big Data • Fintech • Mobile • Payments • Financial Services

Lead and deliver streaming infrastructure initiatives: design and scale real-time data pipelines, collaborate with product and stakeholders, ensure operational availability and observability, drive code/design quality, mentor engineers, and own quarterly team goals.

Top Skills: Python,Kotlin,Aws,Mysql,Kubernetes,Confluent Platform,Schema Registry,Tableflow,Spark,Samza,Flink,Beam,Kafka

ClickHouse

Senior Software Engineer

12 Days Ago

Easy Apply

Remote

Canada

Easy Apply

Senior level

Database • Analytics

The Senior Software Engineer will architect and build scalable cloud infrastructure, collaborate with teams, and improve system security and performance.

Top Skills: AWSAzureC/C++CloudFormationEnvoyGCPGoIstioJavaKubernetesTerraform

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.