Work
Writing, research, and projects.


How not to do research
Jan 2026, WritingLessons learned from building multiplayer world models. Built a video tokenizer with spatial attention and a dynamics model with action spaces.

Nova Act: SOTA browser-use model
Dec 2025, ResearchIn my internship with Amazon AGI, I worked on RL for a browser-use model. I led model performance on two benchmarks & worked on algorithms/performance.

LLMs can invent their own compression
Nov 2025, ResearchAs a constrained optimization problem, LLMs can use RL to invent their own compression schemes to increase its context window.

Natural Deception with RL
Nov 2025, ResearchLanguage models, when trained on hidden-information games, naturally learn deceptive techniques to win the game by any means.

My Experience using Tinker
Nov 2025, WritingMy experience using Tinker, why you should use it, and how it will make RL research faster.

LLINK: Cross Lingual Alignment via Encoder Injection
Oct 2025, ResearchResearch under Cohere Labs for a compute-efficient post training to represent different languages as modalities for multilingual language models.

Cursor Observe
Sep 2025, ProjectLet agents actively observe the terminal and proactively fix training runs, security alerts and potential bugs.

PokeOS
Sep 2025, ProjectLocal tunnel MCP for text agents to interact with your OS. Poke can text others, fill out forms on your browser, play music and fix code.

nanochatVL
Sep 2025, ProjectGiving vision to Karpathy's nanochat for <$10 of compute, by implementing LLaVA via SIGLIP encoder injection and fine-tuning on vision Q&A.

Shadow
Aug 2025, ProjectOpen-source background coding agent. Feature-filled agent that works in a MicroVM with full codebase understanding. 1.5k+ Stars.

Kino AI: Hollywood Video Editing Agent
Apr 2025, ProjectMultimodal agent and long-context video understanding to help hollywood editors. Worked on the infra, video retrieval & the agent.

Local VLM on a Samsung Galaxy
Feb 2025, ProjectTricked a Galaxy S24 to run Moondream 3B VLM locally, with quantization + local linux setup on phone. Built at TreeHacks 2025

GPU optimized voxel grids
Jan 2025, ProjectDesigned and implemented GPU-optimized voxel grids for humanoid design team in Waterloo. Co-led ML team.

Arceus: Distributed Training on Macbooks
Nov 2024, ProjectA decentralized cross-device model training system with model and tensor parallelism to reduce compute needed to train large models.

Crowdsourcing Policy
Oct 2024, ProjectGenerating policy recommendations with AI agents from citizen complaints.

Interoperable Coding Subagents
Sep 2024, ResearchOne of the first implementations of coding subagents to work together to solve hard, diverse coding problems.

Generative UI
Aug 2024, ProjectWorked with Tempo Labs to build generative UI agents in multiple programming languages concurrently.

Recommendations System
Jun 2024, ResearchWorked with Aviato to build a recommendations system for recruiting top AI talent amongst 300k+ engineers.

Bracket Bots
Jun 2024, ProjectHelping with the software behind Bracket Bots, a self-balancing robot for under $200 that can roam around your house.

NYC Train Safety
May 2024, ProjectWorked on safety simulation software for New York trains with custom network protocols.

Multimodal Memory Architecture
Sep 2023, ProjectLong-term memory with multimodal knowledge graphs to search 7 days of video and audio within 5 seconds. Winners @ Hack the North 2023.

Shapeshift
Jul 2023, ProjectDeep learning analysis of seismic frequencies and local policy to design affordable earthquake-resistant buildings. Worked under RippleX Fellowship, RBCx.

Offline Mesh Network
Nov 2022, ProjectAn offline mesh network written in Swift via MultiPeer Connectivity to allow for cross-device transfer of files entirely offline, creating a chain of encrypted nodes.