Red Hat Logo

Red Hat

Senior Performance and Scale Engineer, OpenShift AI Platform

Posted 11 Hours Ago
Be an Early Applicant
In-Office
Toronto, ON
Senior level
In-Office
Toronto, ON
Senior level
Lead performance and scalability strategy for RHOAI components, build automated benchmarking and analysis tools, identify and resolve performance bottlenecks, collaborate with product and engineering, triage customer performance issues, mentor engineers, and represent Red Hat externally.
The summary above was generated by AI

About the Job

Red Hat’s Performance and Scale Engineering team is looking for a highly motivated Senior Software Engineer to join the PSAP (Performance and Scale for AI Platforms) team. In this high-impact role, you will serve as a technical leader, driving the performance, scalability, and efficiency of Red Hat OpenShift AI (RHOAI).

RHOAI is a cornerstone of Red Hat’s AI portfolio, providing a robust platform for managing the full lifecycle of predictive and generative AI (GenAI) models at scale across the hybrid cloud. As a senior member of this team, you will ensure that RHOAI remains the industry’s premier choice for enterprise-grade AI by tuning components ranging from GenAI API servers to distributed training frameworks.

This is a dynamic role for a Senior Software Engineer with a growth mindset who handles and adapts to rapid change, has a strong commitment to open-source values, and the willingness to learn and apply new technologies. You will be joining a vibrant open source culture, and helping promote performance and innovation in this Red Hat engineering team. The broader mission of the Performance and Scale team is to establish performance and scale leadership of the Red Hat product and cloud services portfolio. The scope includes component level, system and solution analysis and targeted enhancements. The team collaborates with engineering, product management, product marketing and customer support as well as Red Hat’s hardware and software ecosystem partners.

What you’ll have

  • Define and lead the performance and scalability strategy for RHOAI components, including but not limited to GenAI API servers, vector databases, MCP Gateways, and Model Registry.

  • Design and maintain tools and automated frameworks to streamline performance data collection, and analysis.

  • Identify bottlenecks in component performance and collaborate with core RHOAI engineering teams to drive performance improvements.

  • Triage and resolve complex performance-related customer cases collaborating with the customer facing teams

  • Collaborate with Product Management and Core Engineering to influence the product roadmap based on performance data and industry trends

  • Provide technical guidance and mentorship to junior engineers. Champion a culture of performance-centric development within the broader PSAP team.

  • Represent Red Hat in industry consortia and at global conferences. Author high-impact technical blogs and white papers to establish Red Hat’s thought leadership.

What you’ll bring

  • Bachelor’s degree in Computer Science or related field 

  • 5+ years of software engineering experience

  • Experience in systems-level performance analysis, profiling, and tuning (CPU, Memory, I/O, and Network).

  • Experience with Kubernetes or OpenShift (containers, pods, and orchestration).

  • Strong Python proficiency in designing complex, maintainable automation software and data analysis pipelines.

  • Experience working in a Linux environment with an understanding of system resources (CPU, Memory, I/O).

  • Understanding of AI concepts (classical ML, Gen AI and agentic AI), and knowledge of AI lifecycle and MLops workflows

  • Experience preparing and managing high quality datasets for accurate benchmarking of data-intensive systems

  • Proven ability to communicate complex performance metrics into clear, actionable insights that bridge the gap between technical engineering and strategic business objectives for stakeholders at all levels. 

The following is considered a plus

  • Master’s or PhD in Computer Science or a related quantitative field.

  • 3+ years of relevant industry experience in performance engineering or distributed/operating systems.

  • Advanced experience using AI-assisted coding and productivity tools to optimize team workflows and accelerate complex debugging.

  • Experience with SQL and noSQL databases and their performance tuning

  • Significant contributions to open-source projects, particularly in the Kubernetes, MLOps, or AI domains.

  • Experience in applying statistical methods to massive datasets including trend forecasting and anomaly detection

#LI-EK1

#AI-HIRING

About Red Hat

Red Hat is the world’s leading provider of enterprise open source software solutions, using a community-powered approach to deliver high-performing Linux, cloud, container, and Kubernetes technologies. Spread across 40+ countries, our associates work flexibly across work environments, from in-office, to office-flex, to fully remote, depending on the requirements of their role. Red Hatters are encouraged to bring their best ideas, no matter their title or tenure. We're a leader in open source because of our open and inclusive environment. We hire creative, passionate people ready to contribute their ideas, help solve complex problems, and make an impact.

Inclusion at Red Hat
Red Hat’s culture is built on the open source principles of transparency, collaboration, and inclusion, where the best ideas can come from anywhere and anyone. When this is realized, it empowers people from different backgrounds, perspectives, and experiences to come together to share ideas, challenge the status quo, and drive innovation. Our aspiration is that everyone experiences this culture with equal opportunity and access, and that all voices are not only heard but also celebrated. We hope you will join our celebration, and we welcome and encourage applicants from all the beautiful dimensions that compose our global village.

Equal Opportunity Policy (EEO)
Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, veteran status, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.


Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.

Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email [email protected]. General inquiries, such as those regarding the status of a job application, will not receive a reply.

Top Skills

Distributed Training Frameworks
Kubernetes
Linux
Mlops
NoSQL
Openshift
Python
SQL
Vector Databases

Similar Jobs

An Hour Ago
Easy Apply
Hybrid
3 Locations
Easy Apply
Junior
Junior
Cloud • Mobile • Software
Source and qualify enterprise commercial-contractor opportunities through targeted outbound outreach, social selling, account research, and multi-threading. Use CRM and AI tools, collaborate on ABM and events, meet pipeline goals, and travel 5–10% to support pipeline growth and sales enablement.
Top Skills: Salesforce,Linkedin Sales Navigator,Zoominfo,Chatgpt,Glean,Crossbeam
7 Hours Ago
Hybrid
Toronto, ON, CAN
Entry level
Entry level
Cloud • Computer Vision • Information Technology • Sales • Security • Cybersecurity
Entry-level systems engineer who writes endpoint sensor code and tests (largely in Python), learns an internal DSL, reasons about OS events across macOS/Windows/Linux, collaborates with internal teams to implement and maintain sensor detections, and diagnoses customer or engineering issues.
Top Skills: Python,C++,In-House Dsl,Macos,Windows,Linux,Kernel Programming,Falcon Sensor,Cloud
15 Hours Ago
Easy Apply
Remote or Hybrid
Toronto, ON, CAN
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Own vision and roadmap for core support tools. Lead discovery and user research, draft PRDs and wireframes, oversee builds with engineering and AI/data teams, manage launches and change management, track KPIs and adoption, and iterate to drive GTM impact and support experience.
Top Skills: Decagon,Happy Robot,Intercom Fin,Llm-Based Platforms,Ai Support Tools

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account