Your professional photo

I'm currently a Junior at Stanford focused on building AI systems that work reliably in the real world. Right now, I'm thinking about how to design robust evaluations for complex agent harnesses. Previously, I worked on the ML Research Team at Lambda Labs on multimodal pretraining, and at the Stanford AI Lab on interpretability. I also serve as the President of the Stanford AI Club, where we've hosted awesome speakers like Sam Altman, Jeff Dean, Guillermo Rauch, and more.

On the side, I love to cook! I run a supper club at Stanford and keep a log of what I cook at @sizzle_with_jason on Instagram.

Writing

How to Build Intuition in AI

Having strong intuition and "feel" on ML concepts is a non-negotiable prerequisite for being productive in ML research. But how do you build it? In this post I share 3 pitfalls I've made in my journey, as well as the current 3-step framework I've arrived at that I find to be most effective (and fun!) for building a robust, practical intuition in AI/ML.

Building Better Benchmarks: We Need Standardized AI Evaluation

AI benchmarking is in a state of disarray. From data leakage to reproducibility issues, our current evaluation methods raise serious questions about how we measure AI capabilities. This post explores the limitations of today's benchmarks and proposes a unified set of best practices for moving forward.