[Archimedes NLP Group Invited Talk]Understanding the Trade-Offs Between Hallucinations and Mode Collapse in Language Generation

Dates

2025-01-15 14:30 - 16:00

Venue

Artemidos 1 - Amphitheater

TITLE: Understanding the Trade-Offs Between Hallucinations and Mode Collapse in Language Generation

SPEAKER: Grigoris Velegkas(Yale University, USA)

ABSTRACT: Specifying all desirable properties of a language model is challenging, but certain requirements seem essential. Given samples from an unknown language, the trained model should produce valid strings not seen in the training set, and be expressive enough to capture the language's full breadth. Otherwise, outputting invalid strings constitutes "hallucination," and failing to capture the full breadth leads to "mode collapse." Recent work by Kleinberg and Mullainathan [KM24], building on classical work on the closely related problem of language identification by Gold [Gol67] and Angluin [Ang79, 80], provides a concrete mathematical framework to study the problem of language generation. Kleinberg and Mullainathan showed that for all countable collections of languages, it is possible to create a language model that does not hallucinate but suffers from mode collapse. They asked whether this tension between validity and breadth is inherent for language generation.

In this talk, we define various notions of breadth for language generation, and completely characterize when generation with validity and breadth is possible under each of these notions. Our results answer the question of [KM24] and show that this tension between validity and breadth is indeed inherent for language generation. Moreover, we formalize the notion of stable generation, a natural requirement derived from Gold’s work [Gold67], and discuss when this type of generation is achievable. Finally, we discuss the implications of our results in the universal rates setting of Bousquet, Hanneke, Moran, van Handel, and Yehudayoff [BGMvY21]. The talk is based on joint works with Alkis Kalavasis and Anay Mehrotra.

References: https://arxiv.org/abs/2411.09642 , https://arxiv.org/abs/2412.18530

SHORT BIO: Grigoris Velegkas is a final-year PhD student in Computer Science at Yale University, working with Prof. Amin Karbasi. Before that, he studied Electrical and Computer Engineering at the National Technical University of Athens, where he worked with Prof. Dimitris Fotakis. His research lies at the intersection of machine learning and theoretical computer science, and focuses on three main directions: i) understanding generalization properties of ML algorithms, ii) exploring responsible use of ML systems and designing algorithms with provable replicability guarantees, and iii) understanding the interaction between ML algorithms and mechanisms. He was a research intern at Google Research in summer 2023 and summer 2024, and a student researcher from October 2023 to May 2024.

________________________________________________________________________________

Microsoft Teams Need help?

Join the meeting now

Meeting ID: 364 928 182 762

Passcode: w24fHy

________________________________________________________________________________

Mon	Tue	Wed	Thu	Fri	Sat	Sun
	1 6th ACM Europe Summer School on Data Science Grand Serai Hotel, Ioannina, Greece ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young Registration Closed Date : 2025-07-01	2 Distributed Machine Learning and Network Resource Allocation for Intelligent Edge Services - Iordanis Koutsopoulos (AUEB) 11:00 Title: Distributed Machine Learning and Network Resource Allocation for Intelligent Edge ServicesSpeaker: Prof. Iordanis Koutsopoulos (Athens University of Economics and Registration Closed AI Preparedness for Cancer Registries - Dimitris Katsimpokis (Comprehensive Cancer Organisation of the Netherlands - IKNL) 13:00 Title: AI Preparedness for Cancer RegistriesSpeaker: Dimitris Katsimpokis, PhD, Netherlands Comprehensive Cancer Organisation (Comprehensive Cancer Organisation of the Netherlands - Registration Closed 6th ACM Europe Summer School on Data Science Grand Serai Hotel, Ioannina, Greece ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young Registration Closed Date : 2025-07-02	3 6th ACM Europe Summer School on Data Science Grand Serai Hotel, Ioannina, Greece ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young Registration Closed Date : 2025-07-03	4 Self, Empathy, and the Social Brain: Human Neuroscience to Artificial Systems - Akila Kadambi (UCLA, USC) 14:00 Zampolli Hall, Athena Research Center HQ (19 Aigialeias str., Marousi, Athena Research Center Main Building - Ground Floor) Title: Self, Empathy, and the Social Brain: Human Neuroscience to Artificial SystemsSpeaker: Dr. Akila Kadambi (University of California, Los Angeles, USA Registration Closed 6th ACM Europe Summer School on Data Science Grand Serai Hotel, Ioannina, Greece ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young Registration Closed 1st International Conference on the Dialect of Lesvos Conference Room of the Ancient Castle of Mytilene, Saplitza location, Mytilene island, Greece. About the ICDL conference Lesbian belongs to the widely spoken varieties of the Modern Greek dialectal area. It is part of the group of northern dialects and idioms, which is characterized Registration Closed Date : 2025-07-04	5 6th ACM Europe Summer School on Data Science Grand Serai Hotel, Ioannina, Greece ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young Registration Closed 1st International Conference on the Dialect of Lesvos Conference Room of the Ancient Castle of Mytilene, Saplitza location, Mytilene island, Greece. About the ICDL conference Lesbian belongs to the widely spoken varieties of the Modern Greek dialectal area. It is part of the group of northern dialects and idioms, which is characterized Registration Closed Date : 2025-07-05	6 1st International Conference on the Dialect of Lesvos Conference Room of the Ancient Castle of Mytilene, Saplitza location, Mytilene island, Greece. About the ICDL conference Lesbian belongs to the widely spoken varieties of the Modern Greek dialectal area. It is part of the group of northern dialects and idioms, which is characterized Registration Closed Date : 2025-07-06
7	8	9 Enhancing Entity Resolution and Retrieval through Dataset Decomposition - Yannis Velegrakis (Utrecht University) 19:11 Title: Enhancing Entity Resolution and Retrieval through Dataset DecompositionSpeaker: Prof. Yannis Velegrakis (Utrecht University, The Netherlands)Abstract: Any Date : 2025-07-09	10 Enhancing Entity Resolution and Retrieval through Dataset Decomposition - Yannis Velegrakis (Utrecht University) 19:11 Title: Enhancing Entity Resolution and Retrieval through Dataset DecompositionSpeaker: Prof. Yannis Velegrakis (Utrecht University, The Netherlands)Abstract: Any Date : 2025-07-10	11	12	13
14 Gabow’s O( √ nm) General Matching Algorithm Background, Implementation, Engineering, Experiments - Kurt Mehlhorn (Max Planck) 11:00 Title: Gabow’s O( √ nm) General Matching Algorithm Background, Implementation, Engineering, Experiments - Kurt Mehlhorn (Max Planck)Speaker: Kurt Mehlhorn (Director Emeritus of Date : 2025-07-14	15 2025 Archimedes Geometry Day 10:00 Archimedes Amphitheatre, 1 Artemidos Street, 15125, Marousi, Archimedes Research Unit, Athena Research Center, Athens, Greece 2025 Archimedes Geometry Day Chairs: Ioannis Emiris, Athena Research Center, Greece and NKUA and Panagiotis Kaklis, University of Strathclyde, IACM-FORTH and Archimedes, Athena 2025 Archimedes Workshop on Algorithmic Game Theory Athens University of Economics and Business (AUEB) premises, Athens, Greece 2025 Archimedes Workshop on Algorithmic Game TheoryDuration: 2 days - July 15-16, 2025Place: Athens University of Economics and Business (AUEB), Athens, Greece Date : 2025-07-15	16 2025 Archimedes Workshop on Algorithmic Game Theory Athens University of Economics and Business (AUEB) premises, Athens, Greece 2025 Archimedes Workshop on Algorithmic Game TheoryDuration: 2 days - July 15-16, 2025Place: Athens University of Economics and Business (AUEB), Athens, Greece Date : 2025-07-16	17	18	19 Greeks in AI 2025 Serafeio City of Athens, Athens, Greece Our Vision & Mission The Greeks in AI Symposium is an annual gathering that brings together Greek AI scientists and practitioners from around the world to connect, collaborate, and inspire. Date : 2025-07-19	20 Greeks in AI 2025 Serafeio City of Athens, Athens, Greece Our Vision & Mission The Greeks in AI Symposium is an annual gathering that brings together Greek AI scientists and practitioners from around the world to connect, collaborate, and inspire. Date : 2025-07-20
21 Greeks in AI 2025 Serafeio City of Athens, Athens, Greece Our Vision & Mission The Greeks in AI Symposium is an annual gathering that brings together Greek AI scientists and practitioners from around the world to connect, collaborate, and inspire. Date : 2025-07-21	22	23	24	25	26	27
28	29	30	31

Mon

Tue

Wed

Thu

Fri

Sat

Sun

6th ACM Europe Summer School on Data Science

Grand Serai Hotel, Ioannina, Greece

ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young

Registration Closed

Date : 2025-07-01

Distributed Machine Learning and Network Resource Allocation for Intelligent Edge Services - Iordanis Koutsopoulos (AUEB)

11:00

Title: Distributed Machine Learning and Network Resource Allocation for Intelligent Edge ServicesSpeaker: Prof. Iordanis Koutsopoulos (Athens University of Economics and

Registration Closed

AI Preparedness for Cancer Registries - Dimitris Katsimpokis (Comprehensive Cancer Organisation of the Netherlands - IKNL)

13:00

Title: AI Preparedness for Cancer RegistriesSpeaker: Dimitris Katsimpokis, PhD, Netherlands Comprehensive Cancer Organisation (Comprehensive Cancer Organisation of the Netherlands -

Registration Closed

6th ACM Europe Summer School on Data Science

Grand Serai Hotel, Ioannina, Greece

ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young

Registration Closed

Date : 2025-07-02

6th ACM Europe Summer School on Data Science

Grand Serai Hotel, Ioannina, Greece

ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young

Registration Closed

Date : 2025-07-03

Self, Empathy, and the Social Brain: Human Neuroscience to Artificial Systems - Akila Kadambi (UCLA, USC)

14:00

Zampolli Hall, Athena Research Center HQ (19 Aigialeias str., Marousi, Athena Research Center Main Building - Ground Floor)

Title: Self, Empathy, and the Social Brain: Human Neuroscience to Artificial SystemsSpeaker: Dr. Akila Kadambi (University of California, Los Angeles, USA

Registration Closed

6th ACM Europe Summer School on Data Science

Grand Serai Hotel, Ioannina, Greece

ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young

Registration Closed

1st International Conference on the Dialect of Lesvos

Conference Room of the Ancient Castle of Mytilene, Saplitza location, Mytilene island, Greece.

About the ICDL conference Lesbian belongs to the widely spoken varieties of the Modern Greek dialectal area. It is part of the group of northern dialects and idioms, which is characterized

Registration Closed

Date : 2025-07-04

6th ACM Europe Summer School on Data Science

Grand Serai Hotel, Ioannina, Greece

ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young

Registration Closed

1st International Conference on the Dialect of Lesvos

Conference Room of the Ancient Castle of Mytilene, Saplitza location, Mytilene island, Greece.

About the ICDL conference Lesbian belongs to the widely spoken varieties of the Modern Greek dialectal area. It is part of the group of northern dialects and idioms, which is characterized

Registration Closed

Date : 2025-07-05

1st International Conference on the Dialect of Lesvos

Conference Room of the Ancient Castle of Mytilene, Saplitza location, Mytilene island, Greece.

About the ICDL conference Lesbian belongs to the widely spoken varieties of the Modern Greek dialectal area. It is part of the group of northern dialects and idioms, which is characterized

Registration Closed

Date : 2025-07-06

Enhancing Entity Resolution and Retrieval through Dataset Decomposition - Yannis Velegrakis (Utrecht University)

19:11

Title: Enhancing Entity Resolution and Retrieval through Dataset DecompositionSpeaker: Prof. Yannis Velegrakis (Utrecht University, The Netherlands)Abstract: Any

Date : 2025-07-09

Enhancing Entity Resolution and Retrieval through Dataset Decomposition - Yannis Velegrakis (Utrecht University)

19:11

Title: Enhancing Entity Resolution and Retrieval through Dataset DecompositionSpeaker: Prof. Yannis Velegrakis (Utrecht University, The Netherlands)Abstract: Any

Date : 2025-07-10

Gabow’s O( √ nm) General Matching Algorithm Background, Implementation, Engineering, Experiments - Kurt Mehlhorn (Max Planck)

11:00

Title: Gabow’s O( √ nm) General Matching Algorithm Background, Implementation, Engineering, Experiments - Kurt Mehlhorn (Max Planck)Speaker: Kurt Mehlhorn (Director Emeritus of

Date : 2025-07-14

2025 Archimedes Geometry Day

10:00

Archimedes Amphitheatre, 1 Artemidos Street, 15125, Marousi, Archimedes Research Unit, Athena Research Center, Athens, Greece

2025 Archimedes Geometry Day Chairs: Ioannis Emiris, Athena Research Center, Greece and NKUA and Panagiotis Kaklis, University of Strathclyde, IACM-FORTH and Archimedes, Athena

2025 Archimedes Workshop on Algorithmic Game Theory

Athens University of Economics and Business (AUEB) premises, Athens, Greece

2025 Archimedes Workshop on Algorithmic Game TheoryDuration: 2 days - July 15-16, 2025Place: Athens University of Economics and Business (AUEB), Athens, Greece

Date : 2025-07-15

2025 Archimedes Workshop on Algorithmic Game Theory

Athens University of Economics and Business (AUEB) premises, Athens, Greece

2025 Archimedes Workshop on Algorithmic Game TheoryDuration: 2 days - July 15-16, 2025Place: Athens University of Economics and Business (AUEB), Athens, Greece

Date : 2025-07-16

Greeks in AI 2025

Serafeio City of Athens, Athens, Greece

Our Vision & Mission The Greeks in AI Symposium is an annual gathering that brings together Greek AI scientists and practitioners from around the world to connect, collaborate, and inspire.

Date : 2025-07-19

Greeks in AI 2025

Serafeio City of Athens, Athens, Greece

Our Vision & Mission The Greeks in AI Symposium is an annual gathering that brings together Greek AI scientists and practitioners from around the world to connect, collaborate, and inspire.

Date : 2025-07-20

Greeks in AI 2025

Serafeio City of Athens, Athens, Greece

Our Vision & Mission The Greeks in AI Symposium is an annual gathering that brings together Greek AI scientists and practitioners from around the world to connect, collaborate, and inspire.

Date : 2025-07-21

Stay connected! Subscribe to our mailing list by emailing sympa@lists.athenarc.gr
with the subject "subscribe archimedes-news Firstname LastName"
(replace with your details)