Dialect Normalization - Archimedes NLP Theme Meeting

Dates
2025-06-02 17:30 - 18:30
Venue
Archimedes Theano

Archimedes NLP Theme Meeting: Paper discussion, Monday 2 June, 17:30-18:30 (Greek time)

Presenter: Antonis Dimakis

 

Title: "Dialect Normalization using Large Language Models and Morphological Rules" (https://aclanthology.org/2023.vardial-1.20/)

Room: Theano, Archimedes Unit (1 Artemidos str., ART1 building, 1st floor)
and virtually via Microsoft Teams -  https://teams.microsoft.com/l/meetup-join/19%3ameeting_YWZkMzE4NjQtYWJhNC00MmI0LThiZTMtNTI0MzNjNjE3Mzli%40thread.v2/0?context=%7b%22Tid%22%3a%226ae07702-c5f7-4f38-9b87-acad62a75d93%22%2c%22Oid%22%3a%22735f6987-4242-47ec-98d6-f1eb55fb371f%22%7d

  • Meeting ID: 327 496 701 406 4
  • Passcode: YV9Vm9S9

 

Dial-in information is not available for this meeting.

Abstract:
Natural language understanding systems struggle with low-resource languages, including many dialects of high-resource ones. Dialect-to-standard normalization attempts to tackle this issue by transforming dialectal text so that it can be used by standard-language tools downstream. In this study, we tackle this task by introducing a new normalization method that combines rule-based linguistically informed transformations and large language models (LLMs) with targeted few-shot prompting, without requiring any parallel data. We implement our method for Greek dialects and apply it on a dataset of regional proverbs, evaluating the outputs using human annotators. We then use this dataset to conduct downstream experiments, finding that previous results regarding these proverbs relied solely on superficial linguistic information, including orthographic artifacts, while new observations can still be made through the remaining semantics.

Stay tuned for future events!
For ways to receive news about the NLP Group and its meetings, as well as to get check the latest information about the meetings of Archimedes NLP Theme and AUEB NLP Group, check http://nlp.cs.aueb.gr/news.html. To subscribe to the mailing list of AUEB NLP Group, send a message with subject "subscribe" to This email address is being protected from spambots. You need JavaScript enabled to view it.. If you have an AUEB account and want to receive announcements about AUEB NLP Group Meetings through MS Teams, subscribe to "AUEB NLP Group meetings" group on MS Teams (code: 01j65ny). Team members can also send text messages (chat) to other team members.

If you are an AI researcher or practitioner, please consider becoming a member of the Hellenic Artificial Intelligence Society (EETN, http://www.eetn.gr/en/).

 

________________________________________________________________________________

Microsoft Teams Need help?

Meeting ID: 327 496 701 406 4

Passcode: YV9Vm9S9


For organizers: Meeting options

________________________________________________________________________________

 

 
 
Mon Tue Wed Thu Fri Sat Sun
1
3
32nd International Colloquium On Structural Information and Communication Complexity (SIROCCO)
General Information   The 32nd International Colloquium On Structural Information and Communication Complexity (SIROCCO 2025) will take place on June 2-4, 2025, in Delphi, Greece. See
Date : 2025-06-03
6
7
8
9
10
11
12
13
14
15
22
23
24
25
26
27
28
29
30
6th ACM Europe Summer School on Data Science
Grand Serai Hotel, Ioannina, Greece
ACM Summer School on Data Science 2024 The 6th ACM Europe Summer School in Data Science will take place in Ioannina in June 30th - July 4th, 2025. Young
Date : 2025-06-30
 
 

The project “ARCHIMEDES Unit: Research in Artificial Intelligence, Data Science and Algorithms” with code OPS 5154714 is implemented by the National Recovery and Resilience Plan “Greece 2.0” and is funded by the European Union – NextGenerationEU.

greece2.0 eu_arch_logo_en

 

Stay connected! Subscribe to our mailing list by emailing sympa@lists.athenarc.gr
with the subject "subscribe archimedes-news Firstname LastName"
(replace with your details)