AI & ML interests
NLP, Digital Humanities
Recent Activity
View all activity
Papers
A Causal Language Modeling Detour Improves Encoder Continued Pretraining
Disentangling meaning from language in LLM-based machine translation
Our French-English LLM suite (including Base and SFT models. All checkpoints are also included.
Samples from the WMT19 English to Lithuanian set augmented with intermediate information generated by gemma-3-27b-it.
Sparse AutoEncoders for the Gaperon LM Suite. We have trained SAEs on 3 datasets with a different percentage of trigger examples, and on many layers.
Samples from the ToPXGen-LLaMA-4-Scout English to Xhosa set augmented with intermediate information generated by LLaMA-4-Scout.
Collections of models trained on the TopXGen dataset.
Sparse AutoEncoders for the Gaperon LM Suite. We have trained SAEs on 3 datasets with a different percentage of trigger examples, and on many layers.
Our French-English LLM suite (including Base and SFT models. All checkpoints are also included.
Samples from the ToPXGen-LLaMA-4-Scout English to Xhosa set augmented with intermediate information generated by LLaMA-4-Scout.
Samples from the WMT19 English to Lithuanian set augmented with intermediate information generated by gemma-3-27b-it.
Collections of models trained on the TopXGen dataset.