The Data Engineer will collaborate with multidisciplinary teams to handle data engineering tasks, including the creation of data pipelines, data transformations, and integration of various data sources. The role requires expertise in cloud platforms, ETL processes, and proficiency in developing and maintaining scalable APIs.
We are seeking a talented and experienced Data Engineer to join our team at Provectus. As part of our diverse practices, including Data, Machine Learning, DevOps, Application Development, and QA, you will collaborate with a multidisciplinary team of data engineers, machine learning engineers, and application developers. You will encounter numerous technical challenges and have the opportunity to contribute to Provectus’ open source projects, build internal solutions, and engage in R&D activities, providing an excellent environment for professional growth.
Requirements
- Experience in data engineering;
- Experience working with Cloud Solutions (preferably AWS, also GCP or Azure);
- Experience with Cloud Data Platforms (e.g., Snowflake, Databricks);
- Proficiency with Infrastructure as Code (IaC) technologies like Terraform or AWS CloudFormation;
- Experience handling real-time and batch data flow and data warehousing with tools and technologies like Airflow, Dagster, Kafka, Apache Druid, Spark, dbt, etc.;
- Proficiency in programming languages relevant to data engineering such as Python and SQL;
- Experience in building scalable APIs;
- Experience in building Generative AI Applications (e.g., chatbots, RAG systems);
- Familiarity with Data Governance aspects like Quality, Discovery, Lineage, Security, Business Glossary, Modeling, Master Data, and Cost Optimization;
- Advanced or Fluent English skills;
- Strong problem-solving skills and the ability to work collaboratively in a fast-paced environment.
Nice to Have:
- Relevant AWS, GCP, Azure, Databricks certifications;
- Knowledge of BI Tools (Power BI, QuickSight, Looker, Tableau, etc.);
- Experience in building Data Solutions in a Data Mesh architecture;
- Familiarity with classical Machine Learning tasks and tools (e.g., OCR, AWS SageMaker, MLFlow, etc.).
Responsibilities:
- Collaborate closely with clients to deeply understand their existing IT environments, applications, business requirements, and digital transformation goals;
- Collect and manage large volumes of varied data sets;
- Work directly with Data Scientists and ML Engineers to create robust and resilient data pipelines that feed Data Products;
- Define data models that integrate disparate data across the organization;
- Design, implement, and maintain ETL/ELT data pipelines;
- Perform data transformations using tools such as Spark, Trino, and AWS Athena to handle large volumes of data efficiently;
- Develop, continuously test and deploy Data API Products with Python and frameworks like Flask or FastAPI.
Top Skills
Python
SQL
Similar Jobs
Agency • Digital Media • eCommerce • Professional Services • Software • Analytics • Consulting
The Lead Analytics Consultant will spearhead analytics projects for Google Marketing Platform, serving as a strategic advisor and subject matter expert. They will manage client relationships, enhance media initiatives, collaborate with teams to improve GCP capabilities, and drive quality in analytics solutions and project deliverables.
Top Skills:
BigQueryGa4Google Cloud PlatformGoogle Marketing PlatformGtmSQL
Fintech • HR Tech
The Senior Data Scientist will analyze payments and risk data to enhance Gusto's products and build secure platforms. They'll collaborate with cross-functional teams, build predictive models, and communicate data insights to stakeholders.
Top Skills:
PythonRSQL
eCommerce • Information Technology • On-Demand • Professional Services • Software
As a Lead Data Scientist, you will collaborate with customer service and product operations to shape strategies and build robust data infrastructure, conduct quantitative analyses, and create a consistent reporting framework. You will leverage data science methods to improve service performance, customer experience, and operational efficiency.
Top Skills:
PythonRSQL
What you need to know about the Toronto Tech Scene
Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.