I'm interested in system-2 thinking, catastrophic forgetting, and fair evals.
I have contributed to the following open-source repositories:
- ππ»ββοΈ Reasoning Gym β RL environments for reasoning models.
- π RLHF Book β An introduction to RLHF and post-training.
- π¬ Language Model Evaluation Harness β A framework for few-shot evaluation of LLMs.
- π Policy Gradients β Minimal hackable implementation of policy gradient methods.
- π OpenEnv β An interface library for RL post training with environments.
- π Laser Hockey β Winning entry for an RL tournament in laser hockey.
- πΎ Word Game Bench β Evaluating LLMs on Wordle and Connections.
- π ML Interview Q&A β Booklet with popular questions and answers for ML interviews.
My work is used by AI labs such as DeepMind, Meta, and NVIDIA:
- ππ» Reasoning Gym: Reasoning Environments for RL with Verifiable Rewards β NeurIPS (Spotlight)
- π Momentum-based Weight Interpolation for Continual Learning β Interpolate @ NeurIPS (Best Paper Award)