👋🏽 About
🔬 Research
📘 CV
📬 Contact
Tokenization Matters: Navigating Data-Scarce Tokenization for Gender Inclusive Language Technologies (2024, NAACL Findings)
Anaelia Ovalle
,
Ninareh Mehrabi
,
Palash Goyal
,
Jwala Dhamala
,
Kai-Wei Chang
,
Richard Zemel
,
Aram Galstyan
,
Yuval Pinter
,
Rahul Gupta
June 2023
Go to Project Site
PDF
Type
Conference paper
Publication
ACM FAccT 2023
Source Themes
Related
SHADES: Towards a multilingual assessment of stereotypes in large language models (2025, NAACL)
The Root Shapes the Fruit: On the Persistence of Gender-Exclusive Harms in Aligned Language Models (2025, ACM FAccT)
“I’m fully who I am”: Towards Centering Transgender and Non-Binary Voices to Measure Biases in Open Language Generation (ACM FAccT, 2023)
Queer In AI: A Case Study in Community-Led Participatory AI (ACM FAcct Best Paper Award, 2023)
Bound by the Bounty: Collaboratively Shaping Evaluation Processes for Queer AI Harms
Cite
×