Optimize, compress, and distill large language and vision models for on-device inference. Build pipelines for distillation and hardware-specific compilation, and benchmark performance across NPU/GPU architectures.
We are looking for an AI Specialist Engineer to enhance the performance of large language and vision models for on-device inference. Your expertise will be crucial in developing and deploying cutting-edge AI solutions, ensuring optimal efficiency across diverse hardware architectures.
Responsibilities:
- Compress and optimize large language and vision models for on-device inference.
- Develop pipelines for model distillation and hardware-specific compilation.
- Benchmark performance across various NPU/GPU architectures.
Qualifications:
- Expertise in model distillation, pruning, and 4-bit/8-bit quantization techniques.
- Hands-on experience with TensorRT, ONNX Runtime, and edge deployment.
- Strong C++ and Python skills.
Similar Jobs
Agency • Artificial Intelligence • Blockchain • Web3
Run adversarial tests on language and multimodal models, build guardrails and real-time filters for autonomous tool use, and support RLHF alignment and constitutional AI development to ensure safe AI deployment.
Top Skills:
Adversarial MlGuardrailsJailbreak TaxonomiesLlmsMultimodal AgentsPrompt EngineeringReal-Time FilteringRed-Teaming FrameworksRlhf
eCommerce • Fintech • Hardware • Payments • Software • Financial Services
Lead end-to-end enterprise sales for Square9s upmarket business: craft deal strategy, manage complex technical integrations and multi-stakeholder negotiations, partner with Solutions Engineering, align internal teams, represent the company to executives, and close high-value contracts while influencing product and go-to-market strategy.
Top Skills:
Ai ToolsAPIsPaymentsSaaSSquare
Artificial Intelligence • Cloud • HR Tech • Information Technology • Productivity • Software • Automation
Lead customer retention and adoption for ServiceNow customers by identifying churn risk, partnering with Sales on adoption/retention plans, advising on governance and SLA issues, and improving customer satisfaction through consulting, project oversight, and executive engagement.
Top Skills:
AIAi-Powered ToolsServicenow
What you need to know about the Toronto Tech Scene
Although home to some of the biggest names in tech, including Google, Microsoft and Amazon, Toronto has established itself as one of the largest startup ecosystems in the world. And with over 2,000 startups — more than 30 percent of the country's total startups — Toronto continues to attract new businesses. Be it helping entrepreneurs manage their finances, simplifying business operations by automating payroll or assisting pharmaceutical companies in launching new drugs, the city's tech scene is just getting started.

.png)
