Work with our team on designing and building a GPU platform for 10,000+ users. You'll tackle problems across cold-start times, performance, scalability, robustness, and security - at a scale where naive approaches break and every design decision matters.
Your focus
- Platform development: Build and improve core platform services in Go and Python: the systems that allocate, schedule, and manage GPU resources for our customers.
- Performance and reliability: Profile, benchmark, and optimise critical paths. Investigate and fix issues in production.
Your KPIs
- Code shipped and components owned
- Time-to-resolution on assigned issues
- Ramp speed and expanding scope of ownership over time
