Writing
Notes and research writeups.
- How not to do research(Jan 2026)
Lessons learned from building multiplayer world models. Built a video tokenizer with spatial attention and a dynamics model with action spaces.
- My Experience using Tinker(Nov 2025)
My experience using Tinker, why you should use it, and how it will make RL research faster.
- LLMs can invent their own compression(Nov 2025)
As a constrained optimization problem, LLMs can use RL to invent their own compression schemes to increase its context window.
- Natural Deception with RL(Nov 2025)
Language models, when trained on hidden-information games, naturally learn deceptive techniques to win the game by any means.