This fall, I will work on RL for web agents as a Research Engineer at Amazon AGI Lab. Previously, I built multimodal video editing agents for hollywood at Kino AI and low-level train safety systems at Hitachi Rail.
I am a deeply technical person. I'm constantly building, learning and breaking things. I'm obsessed with learning how things work and designing novel solutions to problems I can't get out of my head. Right now, I'm most curious about multimodal models and coding agents.
Research projects, experiments, and creative applications.
Open-source background coding agent with 1.2k stars on GitHub. Feature-filled agent that works in a MicroVM with full codebase understanding.
Multimodal agent and long-context video understanding to help hollywood editors. Worked on the the most powerful video retrieval and editing agent.
A decentralized cross-device model training system with model and tensor parallelism to reduce compute needed to train large models.
Leading behaviour and interaction software for a humanoid robot design team in Waterloo. Built GPU-optimized 3D voxel grids for awareness models.
City simulation of Los Angeles with AI Agents, simulating human behaviour and optimizing transit routing with RL.
Research at Cohere for AI to represent different languages as different modalities for training multilingual language models.
Deep learning pipeline that analyzes seismic frequencies and local policy to design affordable earthquake-resistant buildings.
An offline mesh network written in Swift to allow for cross-device transfer of files entirely offline, creating a chain of encrypted nodes.
Long-term memory with multimodal knowledge graphs to search 7 days of video and audio within 5 seconds. Winners @ Hack the North 2023.
Jailbroke a Galaxy S24 to run Moondream 3B VLM locally, with quantization + local linux setup on phone. Built at TreeHacks 2025
Generating policy recommendations with AI agents from citizen complaints.