archimedes-Artificial Intelligence, Data Science, Algorithms-greece

 
Artificial Intelligence
 
Data Science
 
Algorithms

Harnessing the Untapped Benefits of Near-Sortedness for Data Systems - Manos Athanassoulis (Boston University)

Archimedes_Talk__Athanassoulis___21_October
Dates
2025-10-21 13:00 - 14:00
Venue
Archimedes Amphitheatre (1 Artemidos Street, 15125, Marousi, Archimedes, Athena Research Center, Greece)


Title
: Harnessing the Untapped Benefits of Near-Sortedness for Data Systems

Speaker: Manos Athanassoulis (Associate Professor of Computer Science at Boston University (BU), USA, Director and Founder of the BU Data-intensive Systems and Computing Laboratory, BU, USA, and Co-Director of the BU Massive Data Algorithms and Systems Group, BU, USA)

Abstract: Two of the most common concepts in data processing is sorting and indexing. In fact, one can consider indexing as the process of adding structure (e.g., sorting) an otherwise unstructured data collection, which comes at a cost (ingestion cost to create the sorted instance of the data), which is worth-spending due to future benefits. What happens when the incoming data have some pre-existing structure (a degree of “near-sortedness”)? This can happen by virtue of the dataset (e.g., indexing timestamps of aggregated sensors with a small lag, storing mostly increasing data like stock market values), the operation (operating on previously fully sorted data that receives a number of updates, intermediate result of another operator that created near-sorted data), or data correlation (operating on a column correlated with the sort attribute of a table). Traditional indexes are not designed to exploit near-sortedness and in most cases pay the same cost as classical ingestion (as if the data has no structure). In this work we argue that a “sortedness-aware” index should offer increasingly cheaper ingestion cost for “more sorted” data without hurting read performance or any other aspect of data processing.

We present the first sortedness-aware tree index designs. The first uses smart buffering, partial bulk loading, query-driven sorting, and variable split ratio to achieve remarkable speedup for near-sorted data (10x), while we next show that we can have even better results by radically simplify the design maintaining minimal additional state to classical tree indexes and a lightweight predictor of which node to insert to next. If time permits, I will discuss some more open questions on near-sortedness in conjunction with learned indexes, LSM trees, and join processing.

Short Bio: Manos Athanassoulis is an Associate Professor of Computer Science at Boston University, Director and Founder of the BU Data-intensive Systems and Computing Laboratory, and co-director of the BU Massive Data Algorithms and Systems Group. He also spent a summer as a Visiting Faculty at Meta. His research is in the area of data management, focusing on building data systems that efficiently exploit modern hardware (computing units, storage, and memories), are deployed in the cloud, and can adapt to the workload both at setup time and dynamically, at runtime. Before joining Boston University, Manos was a postdoc at Harvard University, USA. Earlier, he obtained his PhD from EPFL, Switzerland, and spent one summer at IBM Research, Watson. Manos’ work has been recognized by awards like “Best of SIGMOD” in 2016, “Best of VLDB” in 2010 and 2017, “Most Reproducible Paper” at SIGMOD in 2017, "Best Demo" for VLDB 2023, and "Distinguished PC Member" for SIGMOD 2018, 2023, 2024, 2025 and EDBT 2025, and has been supported by multiple NSF grants including an NSF CRII and an NSF CAREER award, and industry funds including a Facebook Faculty Research Award, multiple Red Hat Research Incubation Awards and gifts from Cisco, Red Hat, and Meta.

He is currently serving as ACM SIGMOD Secretary/Treasurer 2025-2029 and has served or serving as Associate Edtior for ACM SIGMOD Record, ACM SIGMOD Availability and Reproducibility Co-Chair (2021, 2022, 2023, 2024, 2025), VLDB Ambassador for Industry Relations (2022, 2023, 2024), Industrial Track Co-chair for ICWE 2024, Proceedings Chair for VLDB 2023, Area Chair for ACM SIGMOD 2026, IEEE ICDE 2026, VLDB 2025, and IEEE ICDE 2022, Publicity Chair for VLDB 2022 and IEEE ICDE 2021, and as a PC member on multiple top data management venues.

________________________________________________________________________________

Microsoft Teams 
Meeting ID: 361 396 317 392 1
Passcode: ww9k7JE3

 
 

Vision

To position Greece as a leading player in AI and Data Science

image
image

Mission

To build an AI Excellence Hub in Greece where the international research community can connect, groundbreaking ideas can thrive, and the next generation of scientists emerges, shaping a brighter future for Greece and the world

 

Welcome to ARCHIMEDES, a vibrant research hub connecting the global AI and Data Science research community fostering groundbreaking research in Greece and beyond. Its dedicated core team, comprising lead researchers, affiliated researchers, Post-Docs, PhDs and interns, is committed to advancing basic and applied research in Artificial Intelligence and its supporting disciplines, including Algorithms, Statistics, Learning Theory, and Game Theory organized around 8 core research areas. By collaborating with Greek and Foreign Universities and Research Institutes, ARCHIMEDES disseminates its research findings fostering knowledge exchange and providing enriching opportunities for students. Leveraging AI to address real-world challenges, ARCHIMEDES promotes innovation within the Greek ecosystem and extends its societal impact. Established in January 2022, as a research unit of the Athena Research Center with support from the Committee Greece 2021, ARCHIMEDES is funded for its first four years by the EU Recovery and Resilience Facility (RRF).

 
 

NEWS

 
Archimedes Postdoc Vassilis Alimisis Will Conduct a Tutorial at ICM 2025

Archimedes Postdoc Vassilis Alimisis Will Conduct a Tutorial at ICM 2025

Vassilis Alimisis, a Postdoc at the Archimedes Unit of the Athena Research Center in Greece, a Collaborating Researcher at the School of Electrical and Computer Engineering at the National Technical University of Athens in Greece, an Adjunct Lecturer at the Department of Digital Industry Technologies at the National and Kapodistrian University of Athens in Greece, and a National Postdoctoral Research Fellow of the Bodossaki Foundation, Athens, Greece, will deliver a tutorial at the 2025 International Conference on Microelectronics (ICM), which will take place on 14-17 December 2025, in Cairo, Egypt.

Archimedes Talk by Yuriy Brun on

Archimedes Talk by Yuriy Brun on "LLMs and Mistrust, Discrimination, and Bugs in Software"

On Tuesday 25 November, 2025, from 1:00 pm to 2:30 pm, at the Archimedes Amphitheatre (1 Artemidos Street, 15125, Marousi, Archimedes, Athena Research Center, Greece), Professor Yuriy Brun, Professor at the Manning College of Information & Computer Sciences, University of Massachusetts Amherst, USA, and Visiting Researcher at Archimedes Research Unit, Athena Research Center, Greece, will deliver an Archimedes talk on "LLMs and Mistrust, Discrimination, and Bugs in Software."

BabyLM Challenge Award at EMNLP 2025 Workshop!

BabyLM Challenge Award at EMNLP 2025 Workshop!

Researchers Despoina Kosmopoulou, Efthymios Georgiou, Vaggelis Dorovatas, Georgios Paraskevopoulos, and Alexandros Potamianos from Archimedes, Athena Research Center, Greece, from the National Technical University of Athens, Greece, the University of Bern, Switzerland and the Institute of Language and Signal Processing (ILSP) of the Athena Research Center, Greece, received the BabyLM Challenge Award (Strict track) for NLP tasks at "The First BabyLM Workshop: Accelerating Language Modeling Research with Cognitively Plausible Datasets", which took place in Suzhou, China, during the 30th Annual Conference on Empirical Methods in Natural Language Processing (EMNLP 2025).

Two Best Paper Awards at BIBE 2025

Two Best Paper Awards at BIBE 2025

Researchers from the Archimedes Unit of the Athena Research Center, Greece, together with researchers from the Computer Science Department of the University of Crete, Greece; Stelios M. Smirnakis, Associate Neurologist at Brigham and Women’s Hospital, USA and Associate Professor at Harvard Medical School, USA; and Maria Papadopouli, Professor of Computer Science at the University of Crete, Greece, Affiliated Researcher at the Institute of Computer Science, FORTH, Crete, Greece and Lead Researcher at Archimedes, Athena Research Center, Greece, received two Best Paper Awards at the 25th annual IEEE International Conference on Bioinformatics and Bioengineering (BIBE 2025), which took place on November 6-8, 2025 in Athens, Greece.

 
 

The project “ARCHIMEDES Unit: Research in Artificial Intelligence, Data Science and Algorithms” with code OPS 5154714 is implemented by the National Recovery and Resilience Plan “Greece 2.0” and is funded by the European Union – NextGenerationEU.

greece2.0 eu_arch_logo_en

 

Stay connected! Subscribe to our mailing list by emailing sympa@lists.athenarc.gr
with the subject "subscribe archimedes-news Firstname LastName"
(replace with your details)