archimedes-Artificial Intelligence, Data Science, Algorithms-greece

 
Artificial Intelligence
 
Data Science
 
Algorithms

[Archimedes Talks Series]A proposal for the mathematical structure computed by large language models

Dates
2024-07-09 16:00 - 18:00
Venue
Artemidos 1 - Amphitheater

Title: A proposal for the mathematical structure computed by large language models

Presenter: Dr.Yiannis Vlassopoulos (Institute for Language and Speech Processing at "Athena" Research Center)

Screen Shot 2024 07 09 at 4pm

Abstract: Large Language Models are transformer neural networks which are trained to produce a probability distribution on the possible next words to given texts in a corpus, in such a way that the most likely word predicted, is the actual word in the training text.

We will explain what is the mathematical structure defined by such conditional probability distributions of text extensions. Changing the viewpoint from probabilities to log probabilities we observe that the data of text extensions are encoded in a directed (non-symmetric) metric structure defined on the space of texts . We then construct a directed metric polyhedron P(), in which  is isometrically embedded as generators of certain special extremal rays. Each such generator encodesextensions of a text along with the corresponding probabilities.

Moreover P() is (min; +) (i.e. tropically) generated by the text extremal rays. This leads to a duality theorem relating the polyhedron P() defined by text ex- tensions to one defined by text restrictions. We also explain that the generator of the extremal ray corresponding to a text is approximated by a Boltzmann weighted linear combination of generators of extremal rays corresponding to the words making up that text. We note that these constructions generalise the familiar view of language as a monoid or as a poset with the subtext order.

This is joint work with Stephane Gaubert.

Bio: 
Yiannis Vlassopoulos earned a degree in Mathematics from the University of Athens in1992 and a Ph.D from Duke University in 1998.
His thesis was on Algebraic Geometry related to String Theory (specifically so called Mirror Symmetry, a duality between Symplectic and Algebraic Geometry). He obtained a Marie Curie Individual Fellowship with Pr. Maxim Kontsevich at the Institut des Hautes Etudes Scientifiques (IHES) in Paris, in 2002.
He subsequently worked as a researcher in IHES for an extended period of time until 2019, on non-Commutative Derived Algebraic Geometry and Topological Quantum Field Theories. In particular, in collaboration with Maxim Kontsevich, they introduced the notion of Pre-Calabi-Yau algebra which is a non-commutative analogue of a Poisson structure.
He also worked as a visiting Professor at the University of Vienna (Austria) and has been a visiting fellow at the University of Miami, Aarhus University (Denmark), the Max Planck Institute for Mathematics in Bonn and the Simons Center for Geometry and Physics at the Stoney Brook University in NY.
He obtained an ENTER fellowship at the University of Athens for the period 2006-2008.

Since 2015 he has focused on Natural Language modelling. Initially, using Tensor Networks and applying algebra and physics technics. He cofounded a company in NY in 2017 in order to develop this technology. Currently he is using Category Theory and Tropical geometry in order to model the structure that Transformer Neural Networks (like GPT) learn when they are trained to guess the next word in a text. One of the main goals is to understand how semantics is encoded and could potentially be controlled as far as logical implications are concerned.

________________________________________________________________________________

Microsoft Teams Need help?

Join the meeting now

Meeting ID: 347 633 128 071

Passcode: Z76hKX

________________________________________________________________________________

 
 

Vision

To position Greece as a leading player in AI and Data Science

image
image

Mission

To build an AI Excellence Hub in Greece where the international research community can connect, groundbreaking ideas can thrive, and the next generation of scientists emerges, shaping a brighter future for Greece and the world

 

Welcome to ARCHIMEDES, a vibrant research hub connecting the global AI and Data Science research community fostering groundbreaking research in Greece and beyond. Its dedicated core team, comprising lead researchers, affiliated researchers, Post-Docs, PhDs and interns, is committed to advancing basic and applied research in Artificial Intelligence and its supporting disciplines, including Algorithms, Statistics, Learning Theory, and Game Theory organized around 6 core research areas. By collaborating with Greek and Foreign Universities and Research Institutes, ARCHIMEDES disseminates its research findings fostering knowledge exchange and providing enriching opportunities for students. Leveraging AI to address real-world challenges, ARCHIMEDES promotes innovation within the Greek ecosystem and extends its societal impact. Established in January 2022, as a research unit of the Athena Research Center with support from the Committee Greece 2021, ARCHIMEDES is funded for its first four years by the EU Recovery and Resilience Facility (RRF).

 
 

NEWS

 
ARCHIMEDES joins the fun at the 2024 Athens Science Festival!

ARCHIMEDES joins the fun at the 2024 Athens Science Festival!

ARCHIMEDES, the AI and Data Science Research Hub at ATHENA Research Center, was thrilled to be part of the 2024 Athens Science Festival (ASF). With over 15,000 visitors and 8,000 students, the festival provided the opportunity for young and old, schools and families, and hundreds of visitors to explore science through fun, innovative, and interactive ways.

Μνημόνιο Συνεργασίας με την Επιτροπή Κεφαλαιαγοράς

Μνημόνιο Συνεργασίας με την Επιτροπή Κεφαλαιαγοράς

Στην υπογραφή Μνημονίου Συνεργασίας προχώρησαν στις 30 Ιουλίου 2024, η Πρόεδρος της Επιτροπής Κεφαλαιαγοράς, κα Βασιλική Λαζαράκου και ο Πρόεδρος του Διοικητικού Συμβουλίου και Γενικός Διευθυντής του «ΑΘΗΝΑ» – Ερευνητικό Κέντρο Καινοτομίας στις Τεχνολογίες της Πληροφορίας, των Επικοινωνιών και της Γνώσης», κ. Ιωάννης Εμίρης.

Archimedes Workshop on the Foundations of Modern AI

Archimedes Workshop on the Foundations of Modern AI

The Archimedes Research Unit on Artificial Intelligence, Data Science, and Algorithms proudly organized the Workshop on the Foundations of Modern AI at the National Technical University of Athens, Greece, on July 3-4, 2024. This prestigious event gathered leading experts and researchers from diverse fields such as Statistical and Computational Learning Theory, Optimization, Algorithmic Game Theory, and Approximation and Online Algorithms.

Visit of Deputy Minister Zoe Rapti to Archimedes and the Athena Research Center Facilities

Visit of Deputy Minister Zoe Rapti to Archimedes and the Athena Research Center Facilities

The appointed new Deputy Minister for Development, Ms. Zoe Rapti, visited on Thursday, July 4, 2024, the Athena Research Center in Athens. The President of Athena, Professor Ioannis Emiris, and Directors and Representatives of the Institutes and Units of Athena received Deputy Minister Zoe Rapti at the entrance of the General Secretariat of Research and Technology. Some of the facilities available at Athena were shown to her, and she briefly interacted with some researchers and staff.

 
 

The project “ARCHIMEDES Unit: Research in Artificial Intelligence, Data Science and Algorithms” with code OPS 5154714 is implemented by the National Recovery and Resilience Plan “Greece 2.0” and is funded by the European Union – NextGenerationEU.

greece2.0 eu_arch_logo_en

 

Stay connected! Subscribe to our mailing list by emailing sympa@lists.athenarc.gr
with the subject "subscribe archimedes-news Firstname LastName"
(replace with your details)