Design and implement SkyPilot's commercial multicloud platform: architect control and data plane separation, tenant/user management, scaling, monitoring, and alerting. Build production-grade cloud-native platform services and APIs using Go, Kubernetes, gRPC, PostgreSQL, and Terraform, prioritizing reliability, security, and great user experience.
SkyPilot is building the future of multicloud AI infra. We are the Berkeley founding team commercializing SkyPilot (9.5K+ GitHub stars, 200+contributors), to enable AI to run on different cloud infrastructures in a portable, cost-optimizing, and highly available way.
SkyPilot is deployed at 100s of companies, including Fortune 500s and top AI-natives (Shopify, Redis, Abridge, Hippocratic, Applied Compute, etc.). In 2025, adoption grew >600%, now launching more GPUs per month than the biggest neocloud’s fleet. Currently in stealth, SkyPilot is founded in 2024 by UC Berkeley PhDs and professors (incl. Databricks cofounders). We’re building a top-tier engineering team, with current talent from Databricks, Google, Crusoe, ByteDance, and PingCap.
What You’ll Do
You’ll play an instrumental role in designing and implementing SkyPilot’s commercial cloud platform, which will power a reimagined multicloud AI experience:
- Architect SkyPilot’s commercial cloud platform from the ground up: Control plane and data plane separation, tenant/user management, control plane scaling, monitoring, alerting.
- Building core, production-grade platform services: Designing and implementing APIs and services in a cloud-native stack (e.g., Go, Kubernetes, microservices), balancing reliability, security, and simplicity.
Ideal Candidates
You are a seasoned engineer with experience building SaaS/cloud platforms from zero to one.
- 6+ years of experience in building SaaS platforms at startups: You have 6+ years of experience building SaaS platforms at startups, from inception to launch to scaling. You are intimately familiar with the best-in-class tools/vendors needed for a SaaS platform.
- SaaS platform expertise: You have hands-on experience building user and organization management, authentication and RBAC, API gateway, usage metering and billing integration, CI/CD pipelines, and other core platform services — using technologies like gRPC, Go, Kubernetes, PostgreSQL, Terraform.
- Great product taste: You believe great products must deliver both a solid platform foundation and a great user experience.
What We Offer
- Competitive equity, compensation, and health benefits.
- Chance to work with some of the best minds in cloud, distributed, and AI systems, with significant autonomy and ownership.
- Front-row seat at the latest open-source infra startup from Berkeley.
Similar Jobs
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead customer retention and adoption for ServiceNow customers by identifying churn risk, partnering with Sales on adoption/retention plans, advising on governance and SLA issues, and improving customer satisfaction through consulting, project oversight, and executive engagement.
Top Skills:
AIAi-Powered ToolsServicenow
HR Tech • Information Technology • Professional Services • Sales • Software
Design, develop, and maintain scalable backend systems for the Payroll product using a microservices architecture. Own the full development lifecycle from technical design to deployment and monitoring, collaborate with product and front-end teams, build and optimize APIs, and work in a continuous delivery environment with automated QA and testing practices.
Top Skills:
APIsAutomated QaAWSContinuous DeliveryJavaKotlinMicroservicesMockingMonitoringMySQLPostgresScalaTddUnit Testing
Artificial Intelligence • Professional Services • Business Intelligence • Consulting • Cybersecurity • Generative AI
Lead selection, implementation, and administration of marketing and sales technologies to drive growth and customer engagement. Manage and coach a team, execute digital marketing and creative campaigns, optimize marketing automation and Salesforce analytics, ensure data quality and validation, and partner with stakeholders to improve processes and deliverables from planning through completion.
Top Skills:
Adobe Data CollectionAdobe Experience Manager (Aem)Adobe Martech PlatformsAnalytics InstrumentationCdpCRMDom ManipulationHTMLJavaScriptMarketing AutomationSalesforce Crm AnalyticsSalesforce Marketing CloudTypescriptWeb Sdk
What you need to know about the Toronto Tech Scene
Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.



