Hi! I'm Lj Miranda, and welcome to my blog!
In this blog, I write about my interests in natural language processing, machine learning systems, and games—so grab a cup of coffee and feel free to look around!
In the age of big data and large language models, building NLP pipelines for Tagalog is still difficult. In this blog post, I'll report my progress on building a Tagalog NER pipeline and my hopes for the future of Tagalog NLP.
I started blogging back in 2017 while I was still a grad student. Six years passed, and I learned much about myself, my field, and my hobby. I want to share my learnings in this blog post.
spaCy has a comprehensive way to define rules for matching tokens, phrases, entities (and more!) to enhance statistical models. In this blog post, I'll share a design pattern to better write and organize your rules.