Roche Logo

Roche

Principal Data Scientist

Posted 14 Days Ago
Be an Early Applicant
In-Office
Mississauga, ON
Senior level
In-Office
Mississauga, ON
Senior level
Lead development of algorithms and production pipelines for Roche's SBX sequencing technology. Provide technical direction to data scientists and bioinformatics engineers, build ML/DL models for signal-to-sequence tasks, implement and optimize sequence analysis algorithms, and architect scalable, reproducible workflows on HPC and cloud using Nextflow, SLURM, and MLOps practices.
The summary above was generated by AI

At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections,  where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.

The Position

A healthier future. It’s what drives us to innovate. To continuously advance science and ensure everyone has access to the healthcare they need today and for generations to come.

Creating a world where we all have more time with the people we love.

That’s what makes us Roche.

We are seeking a visionary and authoritative Principal Data Scientist to serve as a technical lead for Roche’s proprietary sequencing technology, SBX.

In this pivotal role, you will sit at the intersection of discovery and engineering. You will drive exploratory research to decode complex nanopore signal data, develop novel algorithms for DNA sequence analysis, and architect industrial-grade production pipelines. You will provide technical leadership to a cross-functional squad of Data Scientists and Bioinformatics Software Engineers, ensuring that cutting-edge AI/ML models are successfully translated into robust, scalable software solutions on HPC infrastructure.

As a Principal on the team, you will define the analytical strategy for SBX data. You will move beyond simple analysis to build the infrastructure and algorithmic core that allows our sequencing technology to scale.

The Opportunity

  • Provide technical direction and mentorship to hybrid teams of Data Scientists and Bioinformatics Software Engineers.

  • Establish best practices for code quality, collaborative development, and model lifecycle management across diverse teams.

  • Lead the development of algorithms for DNA sequence analysis, including basecalling and post-primary analyses.

  • Innovate on bioinformatics methods like string matching, graph assembly, and Hidden Markov Models to address SBX data challenges.

  • Design and deploy advanced deep learning models, such as Transformers, CNNs, and RNNs/LSTMs, for analyzing electrical signal data and predicting sequencing outcomes.

  • Advocate for MLOps practices to ensure model reproducibility, version control, and monitoring in production environments.

  • Architect scalable workflows using tools like Airflow and Nextflow for research exploration and production deployment.

  • Manage and optimize HPC workloads using SLURM, while writing Bash and Python scripts to integrate complex systems efficiently.

Who You Are

  • MS/Ph.D. in Bioinformatics, Computer Science, Computational Biology, Physics, or a related discipline.

  • 5+ years of post-PhD industrial experience, in similar fields

  • Deep theoretical and practical knowledge of algorithms used in DNA sequence analysis (e.g., dynamic programming, BWT, de Bruijn graphs, HMMs) and experience implementing them from scratch or optimizing existing implementations.

  • Expert-level proficiency in applying Machine Learning and Deep Learning frameworks (PyTorch, TensorFlow, Keras) to biological data. Experience with supervised/unsupervised learning and sequence modeling is essential.

  • Advanced proficiency in Linux/Unix environments, including complex Bash scripting and workload management on HPC clusters using SLURM.

  • Mastery of workflow management systems, specifically Nextflow (DSL2), and experience deploying pipelines in cloud or cluster environments.

  • Expert-level proficiency in Python and a strong command of software engineering principles (OOP, Unit Testing, CI/CD, Git).

Preferred:

  • Deep experience analyzing raw current traces/signal data from nanopore sequencing platforms .

  • proficiency in C++ and CUDA for accelerating critical algorithm components or custom kernels.

  • Extensive experience with Docker/Singularity/Apptainer for reproducible science.

Relocation benefits are not available for this posting.

The expected salary range for this position based on the primary location of Mississauga is 136,936.00 and 179,728.50 of hiring range. Actual pay will be determined based on experience, qualifications, and other job-related factors as determined by the company.

We use artificial intelligence to screen, assess or select applicants for this role.

This posting is for an existing vacancy at Hoffmann-La Roche Ltd.

Who we are

A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.


Let’s build a healthier future, together.

Roche is an Equal Opportunity Employer.

Top Skills

Python,C++,Cuda,Bash,Pytorch,Tensorflow,Keras,Airflow,Nextflow (Dsl2),Slurm,Docker,Singularity,Apptainer,Git,Linux/Unix,Hpc,Mlops

Similar Jobs

5 Days Ago
In-Office
Toronto, ON, CAN
Senior level
Senior level
Financial Services
Lead development of advanced ML/AI solutions: design large-scale data pipelines, build and deploy ML/deep learning and LLM models, collaborate with product and analytics teams, deliver data-driven recommendations, and scale MLOps processes.
Top Skills: AWSAzureGenaiLlmLstmMlopsPower BIPyTorchTensorFlowXgboost
2 Hours Ago
Easy Apply
Remote or Hybrid
Toronto, ON, CAN
Easy Apply
Junior
Junior
Artificial Intelligence • Cloud • Computer Vision • Hardware • Internet of Things • Software
The Customer Success Manager will support clients in improving their operations using the IoT platform by developing customized success plans and fostering long-term relationships.
Top Skills: Internet Of Things (Iot)SaaS
2 Hours Ago
Easy Apply
In-Office or Remote
8 Locations
Easy Apply
Entry level
Entry level
Greentech • Hardware • Internet of Things • Machine Learning • Software • Business Intelligence • Agriculture
Halter seeks expressions of interest for various roles across teams like Engineering, Product, Hardware, Sales, and Support. Applicants should be passionate about impactful work and problem-solving. A cover letter is required to express interest and qualifications.

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account