Rajan Agarwal

Work

Writing, research, and projects.

  1. FrontierSWE

    Apr 2026, Project

    Ultra long-horizon coding agent benchmark for engineering and research.

  2. How not to do research

    Jan 2026, Writing

    Lessons learned from building multiplayer world models. Built a video tokenizer with spatial attention and a dynamics model with action spaces.

  3. Nova Act: SOTA browser-use model

    Dec 2025, Research

    In my internship with Amazon AGI, I worked on RL for a browser-use model. I led model performance on two benchmarks & worked on algorithms/performance.

  4. LLMs can invent their own compression

    Nov 2025, Research

    As a constrained optimization problem, LLMs can use RL to invent their own compression schemes to increase its context window.

  5. Natural Deception with RL

    Nov 2025, Research

    Language models, when trained on hidden-information games, naturally learn deceptive techniques to win the game by any means.

  6. My Experience using Tinker

    Nov 2025, Writing

    My experience using Tinker, why you should use it, and how it will make RL research faster.

  7. LLINK: Cross Lingual Alignment via Encoder Injection

    Oct 2025, Research

    Research under Cohere Labs for a compute-efficient post training to represent different languages as modalities for multilingual language models.

  8. Cursor Observe

    Sep 2025, Project

    Let agents actively observe the terminal and proactively fix training runs, security alerts and potential bugs.

  9. PokeOS

    Sep 2025, Project

    Local tunnel MCP for text agents to interact with your OS. Poke can text others, fill out forms on your browser, play music and fix code.

  10. nanochatVL

    Sep 2025, Project

    Giving vision to Karpathy's nanochat for <$10 of compute, by implementing LLaVA via SIGLIP encoder injection and fine-tuning on vision Q&A.

  11. Shadow

    Aug 2025, Project

    Open-source background coding agent. Feature-filled agent that works in a MicroVM with full codebase understanding. 1.5k+ Stars.

  12. Kino AI: Hollywood Video Editing Agent

    Apr 2025, Project

    Multimodal agent and long-context video understanding to help hollywood editors. Worked on the infra, video retrieval & the agent.

  13. Local VLM on a Samsung Galaxy

    Feb 2025, Project

    Tricked a Galaxy S24 to run Moondream 3B VLM locally, with quantization + local linux setup on phone. Built at TreeHacks 2025

  14. GPU optimized voxel grids

    Jan 2025, Project

    Designed and implemented GPU-optimized voxel grids for humanoid design team in Waterloo. Co-led ML team.

  15. Arceus: Distributed Training on Macbooks

    Nov 2024, Project

    A decentralized cross-device model training system with model and tensor parallelism to reduce compute needed to train large models.

  16. Crowdsourcing Policy

    Oct 2024, Project

    Generating policy recommendations with AI agents from citizen complaints.

  17. Interoperable Coding Subagents

    Sep 2024, Research

    One of the first implementations of coding subagents to work together to solve hard, diverse coding problems.

  18. Generative UI

    Aug 2024, Project

    Worked with Tempo Labs to build generative UI agents in multiple programming languages concurrently.

  19. Recommendations System

    Jun 2024, Research

    Worked with Aviato to build a recommendations system for recruiting top AI talent amongst 300k+ engineers.

  20. Bracket Bots

    Jun 2024, Project

    Helping with the software behind Bracket Bots, a self-balancing robot for under $200 that can roam around your house.

  21. NYC Train Safety

    May 2024, Project

    Worked on safety simulation software for New York trains with custom network protocols.

  22. Multimodal Memory Architecture

    Sep 2023, Project

    Long-term memory with multimodal knowledge graphs to search 7 days of video and audio within 5 seconds. Winners @ Hack the North 2023.

  23. Shapeshift

    Jul 2023, Project

    Deep learning analysis of seismic frequencies and local policy to design affordable earthquake-resistant buildings. Worked under RippleX Fellowship, RBCx.

  24. Offline Mesh Network

    Nov 2022, Project

    An offline mesh network written in Swift via MultiPeer Connectivity to allow for cross-device transfer of files entirely offline, creating a chain of encrypted nodes.