Work

Writing, research, and projects.

Tiny Interaction Models
Jun 2026, Writing
Post-training a small open model to process typing as a live stream, so it can look things up while you write, hold what it finds, and stay quiet the rest of the time.
FrontierSWE
Apr 2026, Project
Ultra long-horizon coding agent benchmark for engineering and research.
How not to do research
Jan 2026, Writing
Lessons learned from building multiplayer world models. Built a video tokenizer with spatial attention and a dynamics model with action spaces.
Nova Act: SOTA browser-use model
Dec 2025, Research
In my internship with Amazon AGI, I worked on RL for a browser-use model. I led model performance on two benchmarks & worked on algorithms/performance.
LLMs can invent their own compression
Nov 2025, Research
As a constrained optimization problem, LLMs can use RL to invent their own compression schemes to increase its context window.
Natural Deception with RL
Nov 2025, Research
Language models, when trained on hidden-information games, naturally learn deceptive techniques to win the game by any means.
My Experience using Tinker
Nov 2025, Writing
My experience using Tinker, why you should use it, and how it will make RL research faster.
LLINK: Cross Lingual Alignment via Encoder Injection
Oct 2025, Research
Research under Cohere Labs for a compute-efficient post training to represent different languages as modalities for multilingual language models.
Cursor Observe
Sep 2025, Project
Let agents actively observe the terminal and proactively fix training runs, security alerts and potential bugs.
PokeOS
Sep 2025, Project
Local tunnel MCP for text agents to interact with your OS. Poke can text others, fill out forms on your browser, play music and fix code.
nanochatVL
Sep 2025, Project
Giving vision to Karpathy's nanochat for <$10 of compute, by implementing LLaVA via SIGLIP encoder injection and fine-tuning on vision Q&A.
Shadow
Aug 2025, Project
Open-source background coding agent. Feature-filled agent that works in a MicroVM with full codebase understanding. 1.5k+ Stars.
Kino AI: Hollywood Video Editing Agent
Apr 2025, Project
Multimodal agent and long-context video understanding to help hollywood editors. Worked on the infra, video retrieval & the agent.
Local VLM on a Samsung Galaxy
Feb 2025, Project
Tricked a Galaxy S24 to run Moondream 3B VLM locally, with quantization + local linux setup on phone. Built at TreeHacks 2025
GPU optimized voxel grids
Jan 2025, Project
Designed and implemented GPU-optimized voxel grids for humanoid design team in Waterloo. Co-led ML team.
Arceus: Distributed Training on Macbooks
Nov 2024, Project
A decentralized cross-device model training system with model and tensor parallelism to reduce compute needed to train large models.
Crowdsourcing Policy
Oct 2024, Project
Generating policy recommendations with AI agents from citizen complaints.
Interoperable Coding Subagents
Sep 2024, Research
One of the first implementations of coding subagents to work together to solve hard, diverse coding problems.
Generative UI
Aug 2024, Project
Worked with Tempo Labs to build generative UI agents in multiple programming languages concurrently.
Recommendations System
Jun 2024, Research
Worked with Aviato to build a recommendations system for recruiting top AI talent amongst 300k+ engineers.
Bracket Bots
Jun 2024, Project
Helping with the software behind Bracket Bots, a self-balancing robot for under $200 that can roam around your house.
NYC Train Safety
May 2024, Project
Worked on safety simulation software for New York trains with custom network protocols.
Multimodal Memory Architecture
Sep 2023, Project
Long-term memory with multimodal knowledge graphs to search 7 days of video and audio within 5 seconds. Winners @ Hack the North 2023.
Shapeshift
Jul 2023, Project
Deep learning analysis of seismic frequencies and local policy to design affordable earthquake-resistant buildings. Worked under RippleX Fellowship, RBCx.
Offline Mesh Network
Nov 2022, Project
An offline mesh network written in Swift via MultiPeer Connectivity to allow for cross-device transfer of files entirely offline, creating a chain of encrypted nodes.

Tiny Interaction Models

FrontierSWE

How not to do research

Nova Act: SOTA browser-use model

LLMs can invent their own compression

Natural Deception with RL

My Experience using Tinker

LLINK: Cross Lingual Alignment via Encoder Injection

Cursor Observe

PokeOS

nanochatVL

Shadow

Kino AI: Hollywood Video Editing Agent

Local VLM on a Samsung Galaxy

GPU optimized voxel grids

Arceus: Distributed Training on Macbooks

Crowdsourcing Policy

Interoperable Coding Subagents

Generative UI

Recommendations System

Bracket Bots

NYC Train Safety

Multimodal Memory Architecture

Shapeshift

Offline Mesh Network