Rajan Agarwal

Writing

Notes and research writeups.

  1. How not to do research(Jan 2026)

    Lessons learned from building multiplayer world models. Built a video tokenizer with spatial attention and a dynamics model with action spaces.

  2. My Experience using Tinker(Nov 2025)

    My experience using Tinker, why you should use it, and how it will make RL research faster.

  3. LLMs can invent their own compression(Nov 2025)

    As a constrained optimization problem, LLMs can use RL to invent their own compression schemes to increase its context window.

  4. Natural Deception with RL(Nov 2025)

    Language models, when trained on hidden-information games, naturally learn deceptive techniques to win the game by any means.