Writing

Notes and research writeups.

Tiny Interaction Models(Jun 2026)
Post-training a small open model to process typing as a live stream, so it can look things up while you write, hold what it finds, and stay quiet the rest of the time.
How not to do research(Jan 2026)
Lessons learned from building multiplayer world models. Built a video tokenizer with spatial attention and a dynamics model with action spaces.
My Experience using Tinker(Nov 2025)
My experience using Tinker, why you should use it, and how it will make RL research faster.
LLMs can invent their own compression(Nov 2025)
As a constrained optimization problem, LLMs can use RL to invent their own compression schemes to increase its context window.
Natural Deception with RL(Nov 2025)
Language models, when trained on hidden-information games, naturally learn deceptive techniques to win the game by any means.