I’m broadly interested in building equitable language technologies in the presence of severe constraints— such as the lack of data or compute. My current research interests are low-resource and multilingual NLP, efficient NLP, and corpus linguistics.

You can also check my Google Scholar and Semantic Scholar profiles for more info.




  • Multi hash embeddings in spaCy
    Lester James V. Miranda*, Ákos Kádár*, Adriane Boyd, Sofie Van Landeghem, Anders Søgaard, and Matthew Honnibal. Preprint. arXiv:2212.09255 [cs.CL]. November 2022.
    (∗: equal contributions)



Previous research

I used to be a bioinformatics researcher at the Furuzuki Neurocomputing Systems Laboratory, working on nature-inspired algorithms and proteomics.

I was also involved in research early on during my undergrad: