The Underlying Logic of Language Models - Ryan Cotterell (ETH Zurich, Switzerland)

3000_followers___linkedin
Dates
2025-07-16 10:00 - 11:30
Venue
101, AUEB Troias Building (2 Troias Str., Troias wing, 1st floor) and virtually via Microsoft Teams (meeting information shown below)

 

Title: The Underlying Logic of Language Models

Speaker:Prof. Ryan Cotterell (ETH Zurich, Switzerland)

Abstract: The formal basis of the theory of computation lies in the study of languages, subsets of Σ*, the set of all strings over an alphabet Σ. Models of computation can be taxonomized into the languages they can decide on, i.e., which languages a model can be used to determine membership of. For instance, finite-state automata can decide membership in the regular languages. Language models are probabilistic generalizations of language where the notion of a set is relaxed into one of a probability distribution over Σ*. Recently, language models parameterized using recurrent neural networks, transformers, and state-space models have achieved enormous success in natural language processing. Similarly to how theorists have taxonomized models of deterministic computation, researchers have been made to taxonomize the expressivity of language models based on various architectures in terms of the distributions over strings they can represent. This tutorial presents a self-contained overview of the formal methods used to taxonomize the expressivity of language models, which encompass formal language and automata theory, various forms of formal logic, circuit complexity, and programming languages such as RASP. For example, we illustrate how transformers, under varying assumptions, can be characterized by different fragments of formal logic.

ryan
Short Biography: Ryan has been an assistant professor of computer science at ETH Zürich since 2020. Previously, he was a lecturer at the University of Cambridge. His PhD is from Johns Hopkins University, where he was advised by Jason Eisner. His research interests include natural language processing, computational linguistics, and machine learning. He has publishes at natural language processing venues (ACL, NAACL, EMNLP) venues as well as machine learning venues (NeurIPS, ICML, ICLR). We has additionally won various paper awards, including the overall best paper at ACL 2017.

Microsoft Teams 
Meeting ID: 392 124 408 383 7
Passcode: dP6gt3Ho
 
 
Mon Tue Wed Thu Fri Sat Sun
1
6th ACM Europe Summer School on Data Science
Grand Serai Hotel, Ioannina, Greece
ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young
Registration Closed
Date : 2025-07-01
3
6th ACM Europe Summer School on Data Science
Grand Serai Hotel, Ioannina, Greece
ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young
Registration Closed
Date : 2025-07-03
7
8
10
12
13
17
18
22
23
24
25
26
27
28
29
30
31
 
 

The project “ARCHIMEDES Unit: Research in Artificial Intelligence, Data Science and Algorithms” with code OPS 5154714 is implemented by the National Recovery and Resilience Plan “Greece 2.0” and is funded by the European Union – NextGenerationEU.

greece2.0 eu_arch_logo_en

 

Stay connected! Subscribe to our mailing list by emailing sympa@lists.athenarc.gr
with the subject "subscribe archimedes-news Firstname LastName"
(replace with your details)