Scaling Linguistic Diversity for Speech Research with Massively Multilingual Speech Corpora - Eleanor Chodroff (University of Zürich)

Archimedes_image
Dates
2025-07-15 16:00 - 17:30

Archimedes NLP Theme Meeting:Invited talkTuesday 15July, 16:00-17:30 (Greek time)

Speaker: Eleanor Chodroff (https://www.eleanorchodroff.com/)

Title: "Scaling linguistic diversity for speech research with massively multilingual speech corpora"

Room: Amphitheater, Archimedes Unit (1 Artemidos str., ART1 building, ground floor)

and virtually via Microsoft Teams (meeting information shown below).

Dial-in information is not available for this meeting.

Abstract:

In recent years, the availability of large-scale, crosslinguistic speech corpora has grown significantly, opening up new opportunities for investigating crosslinguistic speech variation and phonetic typology. In this talk, I will present recent advancements in the development of these corpora, focusing on methodologies for annotating speech data from both low- and high-resource languages. I will discuss the challenges involved in processing diverse linguistic data, and share best practices for forced alignment and analysis. Additionally, I will highlight the empirical insights these corpora provide into phonetic diversity across languages with a focus on intrinsic vowel f0 and duration. Through these case studies, I will demonstrate how these resources are reshaping our understanding of phonetic variation and enabling new interdisciplinary research in phonetics, language typology, and speech technology.

About the speaker:

Eleanor Chodroff is an SNF Assistant Professor in the Department of Computational Linguistics at the University of Zürich. Her research focuses on the phonetics–phonology interface, cross-talker and cross-linguistic phonetic variation, speech prosody, and speech perception. A recurring theme in her research is the use of large spoken corpora and tools from speech technology to advance linguistic theory.

Stay tuned!

For ways to receive news about the NLP Group and its meetings, as well as to get check the latest information about the meetings of Archimedes NLP Theme and AUEB NLP Group, check http://nlp.cs.aueb.gr/news.html. To subscribe to the mailing list of AUEB NLP Group, send a message with subject "subscribe" to This email address is being protected from spambots. You need JavaScript enabled to view it." style="border: 0px; font: inherit; margin: 0px; padding: 0px; vertical-align: baseline; color: rgb(70, 120, 134); text-decoration: underline;">This email address is being protected from spambots. You need JavaScript enabled to view it.. If you have an AUEB account and want to receive announcements about AUEB NLP Group Meetings through MS Teams, subscribe to "AUEB NLP Group meetings" group on MS Teams (code: 01j65ny). Team members can also send text messages (chat) to other team members.

If you are an AI researcher or practitioner, please consider becoming a member of the Hellenic Artificial Intelligence Society (EETN, http://www.eetn.gr/en/).

 

Microsoft Teams
Meeting ID: 326 627 431 130 3
Passcode: SY9f3iR2
 
 
Mon Tue Wed Thu Fri Sat Sun
6
14
28
29
30
31
 
 

The project “ARCHIMEDES Unit: Research in Artificial Intelligence, Data Science and Algorithms” with code OPS 5154714 is implemented by the National Recovery and Resilience Plan “Greece 2.0” and is funded by the European Union – NextGenerationEU.

greece2.0 eu_arch_logo_en

 

Stay connected! Subscribe to our mailing list by emailing sympa@lists.athenarc.gr
with the subject "subscribe archimedes-news Firstname LastName"
(replace with your details)