BenchSci Logo

BenchSci

Software Engineer - Data Engineer

Posted 6 Days Ago
Be an Early Applicant
Hybrid
Toronto, ON
Mid level
Hybrid
Toronto, ON
Mid level
As a Software Data Engineer, you will develop data pipelines, collaborate with teams, and build maintainable code to support scientific research. You'll utilize LLMs and cloud technologies to enhance data processes.
The summary above was generated by AI
We are looking for a Software Data Engineer to join our growing Data Team! Reporting to
the Engineering Manager, you will evolve our data models in several styles of datastores and
operationalize production-grade data pipelines. As part of this role, you'll collaborate with a
world-class team, experience growth and mentorship, and apply data engineering solutions
to shape the future of scientific discovery.

Pay range: $110,000 - 135,000

We know compensation is an important part of choosing your next role. The range shown reflects our target hiring range, informed by market data, internal equity, and the role’s current scope. Often the mid-range is where we tend to fall, but individual offers may vary based on experience, skills, and the role scope.

You Will:

  • Collaborate with Machine Learning, Full-stack engineers and Science to solve complex document mining challenges, helping us capture and model additional scientific experiments
  • Scale data pipelines to allow our data to go from research to platform quickly and reliably
  • Work with sources that contain both semi-structured and unstructured data
  • Use your experience to help define and apply best practices for a broad platform of technologies in a cloud-based environment
  • Architect and maintain robust data pipelines that ingest diverse sources and utilize LLMs for high-fidelity entity extraction into structured formats
  • Implement evaluation frameworks to monitor the accuracy, drift, and hallucination rates of extraction models within the production pipeline.
  • Lead or consult the authoring of engineering design proposals following the unified Platform Stream roadmap at BenchSci
  • Leverage a deep understanding of the business context and the team’s goals to unlock independent technical decisions in the face of open-ended requirements
  • Proactively identify new opportunities (from both internal and external sources) and advocate for and implement improvements to the current state of projects
  • Respond with urgency and drive urgency in own team to operational issues, owning resolution within one's sphere of responsibility
  • Challenge the status quo and propose newertechnologies or ways of working

You Have:

  • A degree in Computer Science/Engineering or a related field within science
  • 3+ years experience working as a software developerin the industry
  • Proficient with Python
  • Proficient with SQL
  • Experience using LLMs for structured data extraction
  • Experience with event-driven architecture with Pub/Sub
  • A track record in building high-quality, maintainable code
  • Experience with cloud computing (for example: GCP, Azure, AWS)

Nice To Have:

  • ML/Data science exposure
  • Worked with Auth0, Terraform
  • Have experience with data warehouse solutions like BigQuery, and databases including AlloyDB and Spanner
  • Have experience with agentic driven development and AI-based tools like Cursor or Claude Code
  • Have experience with building ConversationalAI solutions

Benefits and Perks: 
* A great compensation package that includes BenchSci equity options
* A robust  vacation policy plus an additional vacation day every year
* Company closures for 14 more days throughout the year
* Flex time for sick days, personal days, and religious holidays
* Comprehensive health and dental benefits
* Annual learning & development budget
* A one-time home office set-up budget to use upon joining BenchSci
* An annual lifestyle spending account allowance
* Generous parental leave benefits with a top-up plan or paid time off options
* The ability to save for your retirement coupled with a company match!

About BenchSci:
BenchSci's mission is to exponentially increase the speed and quality of life-saving research and development. We empower scientists to run more successful experiments with the world's most advanced, biomedical artificial intelligence software platform. 

Backed by Generation Investment Management, TCV, Inovia, F-Prime, Golden Ventures, and Google's AI fund, Gradient Ventures, we provide an indispensable tool for scientists that accelerates research at top pharmaceutical companies and leading academic centers.

Our Culture:
Our culture fosters transparency, collaboration, and continuous learning. 

We value each other's differences and always look for opportunities to embed equity into the fabric of our work. We foster diversity, autonomy, and personal growth, and provide resources to support motivated self-leaders in continuous improvement. 

You will work with high-impact, highly skilled, and intelligent experts motivated to drive impact and fulfill a meaningful mission. We empower you to unleash your full potential, do your best work, and thrive. Here you will be challenged to stretch yourself to achieve the seemingly impossible. 

Diversity, Equity and Inclusion: We're committed to creating an inclusive environment where people from all backgrounds can thrive. We believe that improving diversity, equity and inclusion is our collective responsibility, and this belief guides our DEI journey.  Learn more about our DEI initiatives.

Accessibility Accommodations: Should you require any accommodation, we will work with you to meet your needs. Please reach out to [email protected].

Top Skills

Alloydb
Auth0
AWS
Azure
BigQuery
GCP
Llms
Pub/Sub
Python
Spanner
SQL
Terraform

BenchSci Toronto, Ontario, CAN Office

25 York St, Suite 1100, Toronto, Ontario , Canada, M5J 2V5

Similar Jobs

2 Days Ago
Hybrid
Toronto, ON, CAN
Mid level
Mid level
HR Tech
Emburse seeks a Software Engineer in Test III specializing in data analytics to design, implement, and maintain data testing frameworks and validate analytical outputs in cloud data systems, using SQL and Python.
Top Skills: AirflowAvroAzureBigQueryMicrosoft FabricOrcParquetPythonRedshiftSnowflakeSQL
9 Days Ago
In-Office or Remote
4 Locations
Entry level
Entry level
Artificial Intelligence • Consumer Web • Digital Media • Machine Learning • Software
Join Quora's data infrastructure team to build and maintain data lakes, pipelines, query engines, and dashboards. Work on scalability, cost and performance improvements, participate in planning and on-call rotations, and contribute to tooling and reliability. Mentorship and training provided for new graduates.
Top Skills: A/B TestingAirflowData LakeIcebergKafkaPythonRedshiftSparkSQLTrino
14 Days Ago
Easy Apply
Hybrid
Waterloo, ON, CAN
Easy Apply
Senior level
Senior level
Fintech • Software
Build and maintain fault-tolerant (often real-time) data pipelines and self-service data experiences, implement role-based data controls, leverage agentic AI, and mentor junior engineers to support Carta's Customer Data Warehouse.
Top Skills: Snowflake,Dbt,Airflow,Python,Django,Javascript,Typescript,React,Postgres,Grpc,Kafka

What you need to know about the Toronto Tech Scene

Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account