About me

I am a final year PhD student in the ALMAnaCH lab at Inria Paris, advised by Benoît Sagot and Éric de la Clergerie. I also teach the Advanced NLP course in the SCIA MSc at EPITA. I was recently a visiting student in Edoardo Ponti's lab at the University of Edinburgh.
My research interests lie within the intersection of natural language processing and representation learning. I am specifically interested in understanding the representations of current language models, and studying how learning better representations can lead to better language models.

Featured publications

Why do small language models underperform? Studying Language Model Saturation via the Softmax Bottleneck

Small LM can saturate in performance during training because of the softmax bottleneck.

Apr 11, 2024 featured, publications

Headless Language Models: Learning without Predicting with Contrastive Weight Tying

A simple token-level contrastive loss can replace cross-entropy and improve data and compute-efficiency for Masked and Causal LM training, especially when token vocabularies are larger.

Sep 15, 2023 featured, publications

MANTa: Efficient Gradient-Based Tokenization for End-to-End Robust Language Modeling

We train a gradient-based neural tokenizer which learns segmentation softly, and improves robustness to misspellings and specific domains with atypical vocabularies.

Jun 9, 2023 featured, publications