Dialect Normalization - Archimedes NLP Theme Meeting
Archimedes NLP Theme Meeting: Paper discussion, Monday 2 June, 17:30-18:30 (Greek time)
Presenter: Antonis Dimakis
Title: "Dialect Normalization using Large Language Models and Morphological Rules" (https://aclanthology.org/2023.vardial-1.20/)
Room: Theano, Archimedes Unit (1 Artemidos str., ART1 building, 1st floor)
and virtually via Microsoft Teams -
https://teams.microsoft.com/l/meetup-join/19%3ameeting_YWZkMzE4NjQtYWJhNC00MmI0LThiZTMtNTI0MzNjNjE3Mzli%40thread.v2/0?context=%7b%22Tid%22%3a%226ae07702-c5f7-4f38-9b87-acad62a75d93%22%2c%22Oid%22%3a%22735f6987-4242-47ec-98d6-f1eb55fb371f%22%7d
- Meeting ID: 327 496 701 406 4
- Passcode: YV9Vm9S9
Dial-in information is not available for this meeting.
Abstract:
Natural language understanding systems struggle with low-resource languages, including many dialects of high-resource ones. Dialect-to-standard normalization attempts to tackle this issue by transforming dialectal text so that it can be used by standard-language
tools downstream. In this study, we tackle this task by introducing a new normalization method that combines rule-based linguistically informed transformations and large language models (LLMs) with targeted few-shot prompting, without requiring any parallel
data. We implement our method for Greek dialects and apply it on a dataset of regional proverbs, evaluating the outputs using human annotators. We then use this dataset to conduct downstream experiments, finding that previous results regarding these proverbs
relied solely on superficial linguistic information, including orthographic artifacts, while new observations can still be made through the remaining semantics.
Stay tuned for future events!
For ways to receive news about the NLP Group and its meetings, as well as to get check the latest information about the meetings of Archimedes NLP Theme and AUEB NLP Group, check http://nlp.cs.aueb.gr/news.html. To
subscribe to the mailing list of AUEB NLP Group, send a message with subject "subscribe" to
If you are an AI researcher or practitioner, please consider becoming a member of the Hellenic Artificial Intelligence Society (EETN, http://www.eetn.gr/en/).
Microsoft Teams Need help?
Meeting ID: 327 496 701 406 4
Passcode: YV9Vm9S9
For organizers: Meeting options