catherinearnett
·
AI & ML interests
multilingual NLP, tokenization
Recent Activity
Organizations
-
-
-
-
-
-
-
-
-
-
-
view article
There is no such thing as a tokenizer-free lunch
view article
An Analysis of Multilingual Models on Hugging Face
view article
Best Practices for Open Multilingual LLM Evaluation
published
an
article
about 1 year ago
published
an
article
about 1 year ago
view article
Releasing the largest multilingual open pretraining dataset
published
an
article
about 1 year ago
published
an
article
about 1 year ago