Omri Uzan

I am a student researcher working on ML and NLP, advised by Yuval Pinter.
I work as a software engineer at Meta.

Email  /  CV  /  Scholar  /  X  /  Github /  Linkedin

profile photo

Research

I am broadly interested in understanding language models, their potential capabilities, inherent limitations, and future social implications. My current research focuses on exploring the limitations imposed on LLMs by their foundational word representation schemes, particularly in the context of languages with diverse linguistic properties or limited data availability.

Papers

Tokenization Is More Than Compression
Craig W. Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner
EMNLP, 2024   (Oral Presentation)
arXiv
Greed is All You Need: An Evaluation of Tokenizer Inference Methods
Omri Uzan, Craig W.Schmidt, Chris Tanner, Yuval Pinter
ACL, 2024   (Oral Presentation)
🏆Outstanding Paper Award🏆
🏆Senior Area Chair Paper Award🏆
ACL Anthology /  arXiv

Preprints

Evaluating Subword Tokenization: Alien Subword Composition and OOV Generalization Challenge
Khuyagbaatar Batsuren, Ekaterina Vylomova, Verna Dankers, Tsetsuukhei Delgerbaatar, Omri Uzan, Yuval Pinter, Gábor Bella
arXiv

News

- Honored to be featured on my university's website for winning paper awards at ACL 2024.

- Tokenization Is More Than Compression was accepted to EMNLP 2024 main with an oral presentation.

- Greed is All You Need: An Evaluation of Tokenizer Inference Methods, received both an outstanding paper award and a senior area chair award at ACL 2024! 🏆

- I've received the Dean’s award for outstanding honor students for my achievements during my B.Sc.

- Greed is All You Need: An Evaluation of Tokenizer Inference Methods accepted to ACL 2024 main.

Personal

Outside of work, I really enjoy working out in the gym, running and cycling, especially with Django(🐕).
I'm a big anime/manga fan and really enjoy reading everything fantasy 🧙‍♂️📚