About the role
Join Snowflake's ML Platform team to build and scale LLM post-training platform.
- •Join Snowflake's ML Platform team to help customers run demanding ML/AI workloads.
- •You'll work on Cortex Training, our LLM post-training platform.
- •Key Responsibilities Design and build across the full stack from public training APIs to the GPU data plane.
- •Scale distributed systems for multi-tenant scheduling and capacity-aware routing.
- •Drive end-to-end performance under heavy concurrent load.
- •Productionize research building blocks for enterprise-scale usage.
- •Requirements 5+ years building and shipping production ML systems.
- •Strong distributed systems and infrastructure foundation.
- •Familiarity with GPU and LLM infrastructure.
- •Demonstrated ability to harden complex systems for reliability, throughput, and cost efficiency.
Tech stack
PythonMicroservices
Match insights
Tech:Python, Microservices
Level:Senior