Nagarro

Associate Principal Engineer, Performance and Site Reliability

Posted Yesterday

Remote

Seeking a talented and motivated Principal Performance and Reliability Engineer to join our dynamic team and contribute to the success of our cloud-based projects where performance, scalability, reliability, and resiliency are critical.

The summary above was generated by AI

Company Description

We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (15000+ experts across 26 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!

Job Description

Must have Skills : Performance Tuning - General Experience, Performance Testing - General Experience, Cloud architecture (Strong), Observability,

Job Description : The ISV practice is focused on building new offerings targeted towards the diverse market segments in the horizontal tech space, creating new competencies in niche technology areas and supporting sales team to make convincing pitches of Nagarroâ€™s capabilities in the ISV market. As a rapidly growing company, we are seeking geeks who want to squeeze the last bit of performance and maintain every second of uptime for an application. We are looking for a talented and motivated Principal Performance and Reliability Engineer to join our dynamic team and contribute to the success of our cloud-based projects where performance, scalability, reliability and resiliency are critical to support billions of online transactions everyday.

Profile requirements:

Strong knowledge of Java/.NET and SQL/NoSQL, event driven architectures
Hands-on experience with Locust, JMeter, Gremlin and other performance and reliability tools
Performance engineering experiences with a distinguished track record on technically demanding projects
Experience with performance testing, analysis, troubleshooting, and optimizing enterprise software - BS, MS, or PhD in Computer Science or equivalent work experience
Outstanding attention to detail and communication skills, ability to analyze / interpret technical data, ability to identify / resolve system bottlenecks, ability to identify potential limitations in software architectures, ability to write clear and concise technical documents, excellent troubleshooting and reasoning skills
Understanding of throughput, latency, memory and CPU utilization, concepts of multi-threading, cache, memory management, Java Virtual Machine, JIT
Working knowledge of HTTP load generation / measurement software (e. g. LoadRunner, JMeter, TestMaker, Silk Performer, etc)
Working knowledge of APM and Observability tools like Splunk, NewRelic, Prometheus and Grafana - Working knowledge of AJAX, HTTP, REST, JSON
Working knowledge of database tuning and administration (e. g. Oracle, DB2, Oracle AWR / STASPACK)
Working knowledge of operating system design / tuning / tools (e. g. Linux, Solaris, OProfile, VTune Roles and responsibilities: The Performance and Reliability engineering architect must demonstrate the ability to understand novel system architectures, pinpoint problem areas, and develop architectures and succinct test plans across multiple hardware and software tiers.
Design modern scalable cloud-native architecture capable of transacting billions of online transactions every day
Orchestrate SLO/SLI design workshops and set performance goals with product owners and engineering leaders
Work closely with management, product owners, developers and quality engineers to understand product requirements and design appropriate performance and reliability test plans to verify features performance and scalability requirements
Provide accurate scoping, clear strategies, and milestones for performance and reliability tests
Communicate status, risks to test complete dates, and offer mitigation strategies to ensure on-time delivery of all committed projects to cross functional teams
Participate in discussions and planning with development teams
Design and implement scalability, stability, stress, reliability, availability, and longevity tests using industry standard tools and custom developed software written in Java or . NET
Review automation implementations and encourage sound design principles
Review test plans, results analysis, capacity planning guides for in-flight releases
Work closely with the Production Success Performance Team to provide product and automation expertise for use in patches, research projects, sizing, and capacity planning
Coordinate technical presentations and training for capacity planning guides
Report weekly trends highlighting product performance and team status against committed milestones and projects

Top Skills

.Net

Gremlin

Java

Jmeter

Locust

NoSQL

SQL

100 University Ave,, Toronto, Ontario , Canada, M5J

Similar Jobs

Workiva

Staff Software Engineer (Reliability)

Yesterday

Remote

USA

Senior level

Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services

As a Staff Software Engineer on the Site Reliability team, you will design and develop solutions to enhance system reliability and performance. You'll lead technical initiatives, collaborate with architects, and communicate effectively with stakeholders. The role involves using coding skills in Python, Golang, or Java, as well as leveraging tools like AWS, Kubernetes, and various databases to ensure operational excellence.

Top Skills: GoJavaPython

Capital One

Sr Lead Software Engineer, Site Reliability - Shopping (Remote-Eligible)

7 Days Ago

Remote

Hybrid

Richmond, VA, USA

Senior level

Fintech • Machine Learning • Payments • Software • Financial Services

As a Sr Lead Software Engineer in Site Reliability, you will lead diverse technology projects, optimize and automate critical services, and collaborate with product managers to deliver cloud-based solutions. You will also ensure performance, regulatory compliance, and reliability improvements, while mentoring junior engineers and staying updated on tech trends.

Top Skills: GoJavaNode.jsPython

Affirm

Staff Software Engineer, Backend (Reliability Engineering)

11 Days Ago

Easy Apply

Remote

United States

Easy Apply

Senior level

Big Data • Fintech • Mobile • Payments • Financial Services

As a Staff Software Engineer in Site Reliability Engineering at Affirm, you will lead the development of backend systems, guiding projects and ensuring operational excellence. You will set technical strategy, enhance system reliability, and foster team growth through mentorship and quality standards.

Top Skills: KotlinPython

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Nagarro

Associate Principal Engineer, Performance and Site Reliability

Top Skills

Nagarro Toronto, Ontario, CAN Office

Similar Jobs

Staff Software Engineer (Reliability)

Sr Lead Software Engineer, Site Reliability - Shopping (Remote-Eligible)

Staff Software Engineer, Backend (Reliability Engineering)

What you need to know about the Toronto Tech Scene