Alexander Waitz
I’m a CS MS/BS student at Stanford applying to PhD programs for Fall 2026. My research focuses on hardware-software co-design for machine learning algorithms. Currently, I’m working on efficient architectures at Hazy Research.
Things I’ve done, in decreasing order of relevance:
- Designed memory-efficient LLMs with Scaling Intelligence Labs
- Worked on kernels at d-Matrix
- Built profiling tools for Legion
- Studied complexity theory and ML at Oxford
- Biked from San Francisco to Washington D.C. with Stanford Spokes
- Designed and taught “The Art of Walking”
- DJ’ed at KZSU
Projects
- [New work coming soon! Under review…]
- Test-Time Training for Efficient RL Sequence Modeling
- TTT for efficient RL.
- Fast Inference with Dynamic Tree Speculative Decoding
- Entropy-driven approach for faster speculative decoding.
- Megatron
- Agentic coding tool.
- TestNinja
- Context-aware test generation.
- 1st place & “Most Interesting Technical Achievement” at South Park Commons x Meta AI hackathon.
- Gaussian Closet
- Diffusion in-fill for clothing try-on.
- Ray Tracer
- Rust ray tracer compiled to WASM.
Contact