All jobsZoox

Senior AI Inference Engineer - Model Optimization & Deployment

Foster City, CA, US Full-time Posted Apr 11, 2026
The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.

As a Model Optimization & Deployment Engineer, you will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience in compressing, accelerating, and deploying complex models (LLMs, VLMs, or FMs) for power- and thermal-constrained vehicle SOCs. You will optimize the ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.

via jobs.lever.co

Related jobs

© 2026 NoGigiddy · Commission-based platform

We like the way you work it·nogigiddy·Gotta bag it up·nogigiddy·
We like the way you work it·nogigiddy·Gotta bag it up·nogigiddy·