Aug 2025: I’m proud to introduce FilBench, a comprehensive LLM benchmark for Filipino! Accepted at EMNLP 2025 Main. I also share some thoughts in this blog post.

May 2025: Excited to share that I have three first & co-first author papers accepted at ACL Main: HyPER, M-RewardBench, and UD-NewsCrawl. A large collab project, SEA-VL, also got into Main!

Nov 2024: Happy to have been part of the exciting Tülu 3 and OLMo 2 releases! My primary contribution is scaling-up our preference data using a synthetic on-policy pipeline that led to improvements in our DPO models.

Oct 2024: Our paper on routing preference instances to human or LM annotators, Hybrid Preferences, is now available. This is the first work I co-led (with Yizhong Wang) at Ai2!

Oct 2024: Our paper on evaluating reward models in multilingual settings, M-RewardBench, is now available. This was a fun collab with folks from Cohere for AI!

Sep 2024: My cross-institutional collabs, Consent in Crisis and SEACrowd, were accepted to NeurIPS D&B and EMNLP 2024, respectively.