May 2026: Proud to release our survey on multilingual edge models. If you know me, you know I care a lot about AI in the Global South, I’d like to continue doing these types of sociotechnical research, so reach out if you wanna chat!
Apr 2026: Sharing my first PhD work, Polyglot Teachers! Here, I studied what makes a good teacher model for generating multilingual data. I’m excited to continue this research agenda on multilingual synthetic data generation!
Dec 2025: So excited to see the release of OLMo 3! My small contribution was on creating the tool-use SFT mix during my last few months as a pre-doc.
Oct 2025: I’m starting my PhD at the University of Cambridge - Language Technology Lab and will be advised by Anna Korhonen.
Aug 2025: I’m proud to introduce FilBench, a comprehensive LLM benchmark for Filipino! Accepted at EMNLP 2025 Main. I also share some thoughts in this blog post.
May 2025: Excited to share that I have three first & co-first author papers accepted at ACL Main: HyPER, M-RewardBench, and UD-NewsCrawl. A large collab project, SEA-VL, also got into Main!