Nagarro Logo

Nagarro

Associate Principal Engineer, Performance and Site Reliability

Posted Yesterday
Remote
Remote
Seeking a talented and motivated Principal Performance and Reliability Engineer to join our dynamic team and contribute to the success of our cloud-based projects where performance, scalability, reliability, and resiliency are critical.
The summary above was generated by AI

Company Description

We are a Digital Product Engineering company that is scaling in a big way! We build products, services, and experiences that inspire, excite, and delight. We work at scale — across all devices and digital mediums, and our people exist everywhere in the world (15000+ experts across 26 countries, to be exact). Our work culture is dynamic and non-hierarchical. We are looking for great new colleagues. That is where you come in!

Job Description

Must have Skills : Performance Tuning - General Experience, Performance Testing - General Experience, Cloud architecture (Strong), Observability,

Job Description : The ISV practice is focused on building new offerings targeted towards the diverse market segments in the horizontal tech space, creating new competencies in niche technology areas and supporting sales team to make convincing pitches of Nagarro’s capabilities in the ISV market. As a rapidly growing company, we are seeking geeks who want to squeeze the last bit of performance and maintain every second of uptime for an application. We are looking for a talented and motivated Principal Performance and Reliability Engineer to join our dynamic team and contribute to the success of our cloud-based projects where performance, scalability, reliability and resiliency are critical to support billions of online transactions everyday.

Profile requirements:

  • Strong knowledge of Java/.NET and SQL/NoSQL, event driven architectures
  • Hands-on experience with Locust, JMeter, Gremlin and other performance and reliability tools
  • Performance engineering experiences with a distinguished track record on technically demanding projects
  • Experience with performance testing, analysis, troubleshooting, and optimizing enterprise software - BS, MS, or PhD in Computer Science or equivalent work experience
  • Outstanding attention to detail and communication skills, ability to analyze / interpret technical data, ability to identify / resolve system bottlenecks, ability to identify potential limitations in software architectures, ability to write clear and concise technical documents, excellent troubleshooting and reasoning skills
  • Understanding of throughput, latency, memory and CPU utilization, concepts of multi-threading, cache, memory management, Java Virtual Machine, JIT
  • Working knowledge of HTTP load generation / measurement software (e. g. LoadRunner, JMeter, TestMaker, Silk Performer, etc)
  • Working knowledge of APM and Observability tools like Splunk, NewRelic, Prometheus and Grafana - Working knowledge of AJAX, HTTP, REST, JSON
  • Working knowledge of database tuning and administration (e. g. Oracle, DB2, Oracle AWR / STASPACK)
  • Working knowledge of operating system design / tuning / tools (e. g. Linux, Solaris, OProfile, VTune Roles and responsibilities: The Performance and Reliability engineering architect must demonstrate the ability to understand novel system architectures, pinpoint problem areas, and develop architectures and succinct test plans across multiple hardware and software tiers.
  • Design modern scalable cloud-native architecture capable of transacting billions of online transactions every day
  • Orchestrate SLO/SLI design workshops and set performance goals with product owners and engineering leaders
  • Work closely with management, product owners, developers and quality engineers to understand product requirements and design appropriate performance and reliability test plans to verify features performance and scalability requirements
  • Provide accurate scoping, clear strategies, and milestones for performance and reliability tests
  • Communicate status, risks to test complete dates, and offer mitigation strategies to ensure on-time delivery of all committed projects to cross functional teams
  • Participate in discussions and planning with development teams
  • Design and implement scalability, stability, stress, reliability, availability, and longevity tests using industry standard tools and custom developed software written in Java or . NET
  • Review automation implementations and encourage sound design principles
  • Review test plans, results analysis, capacity planning guides for in-flight releases
  • Work closely with the Production Success Performance Team to provide product and automation expertise for use in patches, research projects, sizing, and capacity planning
  • Coordinate technical presentations and training for capacity planning guides
  • Report weekly trends highlighting product performance and team status against committed milestones and projects

Top Skills

.Net
Gremlin
Java
Jmeter
Locust
NoSQL
SQL

Nagarro Toronto, Ontario, CAN Office

100 University Ave,, Toronto, Ontario , Canada, M5J

Similar Jobs

Yesterday
Remote
USA
Senior level
Senior level
Artificial Intelligence • Cloud • Fintech • Professional Services • Software • Analytics • Financial Services
As a Staff Software Engineer on the Site Reliability team, you will design and develop solutions to enhance system reliability and performance. You'll lead technical initiatives, collaborate with architects, and communicate effectively with stakeholders. The role involves using coding skills in Python, Golang, or Java, as well as leveraging tools like AWS, Kubernetes, and various databases to ensure operational excellence.
Top Skills: GoJavaPython
7 Days Ago
Remote
Hybrid
Richmond, VA, USA
Senior level
Senior level
Fintech • Machine Learning • Payments • Software • Financial Services
As a Sr Lead Software Engineer in Site Reliability, you will lead diverse technology projects, optimize and automate critical services, and collaborate with product managers to deliver cloud-based solutions. You will also ensure performance, regulatory compliance, and reliability improvements, while mentoring junior engineers and staying updated on tech trends.
Top Skills: GoJavaNode.jsPython
11 Days Ago
Easy Apply
Remote
United States
Easy Apply
Senior level
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
As a Staff Software Engineer in Site Reliability Engineering at Affirm, you will lead the development of backend systems, guiding projects and ensuring operational excellence. You will set technical strategy, enhance system reliability, and foster team growth through mentorship and quality standards.
Top Skills: KotlinPython

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account