Confluent Logo

Confluent

Staff Software Engineer

Posted Yesterday
Be an Early Applicant
Remote
2 Locations
Expert/Leader
Remote
2 Locations
Expert/Leader
Design and build backend services (Go/Java/Python) to run AI/model inference on streaming data. Own end-to-end features across model lifecycle, inference routing, control plane and serving layers. Ensure production reliability, test coverage, documentation, and participate in on-call. Lead cross-team technical decisions and mentor engineers.
The summary above was generated by AI

We’re not just building better tech. We’re rewriting how data moves and what the world can do with it. With Confluent, data doesn’t sit still. Our platform puts information in motion, streaming in near real-time so companies can react faster, build smarter, and deliver experiences as dynamic as the world around them.

It takes a certain kind of person to join this team. Those who ask hard questions, give honest feedback, and show up for each other. No egos, no solo acts. Just smart, curious humans pushing toward something bigger, together.

One Confluent. One Team. One Data Streaming Platform.

About the Role:

You'll help build Confluent Cloud's AI capabilities — the layer that lets customers bring machine learning and AI agents directly to their real-time data. Instead of moving data out to a separate system to run inference or build an agent, our customers do it in place, on streaming data, as part of the same platform they already use to move and process events at scale.

As an engineer, you'll own delivery of significant pieces of this product — not just writing code, but deciding how a capability should work across the services that make it up. The interesting problems here rarely live in one place: shipping something like inference-on-streaming-data or an AI agent that reacts to live events touches several systems at once — the user-facing API, the services that manage model and agent lifecycle, the control plane that schedules and runs the work, and the serving layer that actually executes inference. You'll be expected to reason across those boundaries, make sound design calls, and get engineers inside and outside the team aligned on the approach.

What You Will Do:
  • Design and build the backend services (primarily Go, Java, and Python) that run AI and model inference on real-time data.

  • Own features end to end — drafting the design, aligning stakeholders inside and outside the team, and driving the decision to a conclusion.

  • Make the technical calls on systems that span teams: model lifecycle, inference routing, and agent execution.

  • Own the quality of what you ship — code, test coverage, documentation, operability, and rollout safety. This is production infrastructure serving live inference, so reliability isn't an afterthought.

  • Make the engineers around you better through code review, design feedback, and being someone the team trusts with ambiguous, cross-cutting work.

  • Participate in on-call for the services your team owns, and help keep the team's processes and rituals healthy.

What You Will Bring:
  • 10+ years of significant experience designing, building, and operating distributed systems or cloud-native backend infrastructure in production.

  • Strong working knowledge of Kubernetes and distributed-systems patterns (control loops, API servers, high-scale control planes), plus the fundamentals — containerization, networking, resource isolation.

  • Proficiency in at least one of Go, Java, or Python, and the willingness to work across all three.

  • A track record of leading cross-team technical work: turning ambiguous requirements into designs others can rally behind.

  • Excellent written and verbal communication — you can write a design doc that aligns people who don't report to you.

What Gives You an Edge:
  • Exposure to model serving, LLM/agent infrastructure, or streaming data systems.

You don't need a background in ML research or model training — this role is about building and operating the platform that serves AI reliably at scale, not inventing the models.

Ready to build what's next? Let’s get in motion.

Come As You Are

Belonging isn’t a perk here. It’s the baseline. We work across time zones and backgrounds, knowing the best ideas come from different perspectives. And we make space for everyone to lead, grow, and challenge what’s possible.

We’re proud to be an equal opportunity workplace. Employment decisions are based on job-related criteria, without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by law.

Privacy Statement

Confluent is an IBM subsidiary which has been acquired by IBM and will be integrated into the IBM organization. By proceeding with this application, you understand that Confluent will share your personal information with other IBM affiliates involved in your recruitment process, wherever these are located. More Information on how IBM protects your personal information, including the safeguards in case of cross-border data transfer, are available here.

Confluent Toronto, Ontario, CAN Office

100 University Ave, Toronto, Ontario, Canada, M5J1v6

Similar Jobs

6 Days Ago
Easy Apply
Remote or Hybrid
CA
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
Lead architecture and implementation of routing, dispatch, and real-time tracking systems. Design scalable backend services and APIs, integrate with TMS/ERP systems, build dispatcher web interfaces, mentor engineers, and collaborate with product, design, and mobile teams to deliver fleet optimization features at scale.
Top Skills: AndroidAs/400ErpGeospatialGoGraphQLiOSReactReact NativeReal-Time SystemsRoute OptimizationSAPTmsTypescript
10 Days Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Artificial Intelligence • Blockchain • Fintech • Financial Services • Cryptocurrency • NFT • Web3
Lead architecture and implementation of Coinbase's Risk Platform: build high-throughput, low-latency real-time fraud detection, decisioning, and mitigation systems. Define multi-quarter technical strategy, partner with Data Science/ML/Product/Compliance, implement AI-native agent-driven workflows, and mentor engineers to improve reliability, performance, and scale.
Top Skills: Agent FrameworksEvent-Driven ArchitecturesGenerative AiGraphQLMicroservicesReal-Time DecisioningRest
21 Days Ago
Easy Apply
Remote
Canada
Easy Apply
Senior level
Senior level
Big Data • Fintech • Mobile • Payments • Financial Services
Drive CI efforts, ensuring reliable and scalable development pipelines, while mentoring other engineers and improving developer productivity through tooling and automation.
Top Skills: ArtifactoryAWSBazelBuildkiteGitJavaKotlinKubernetesPython

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account