archimedes-Artificial Intelligence, Data Science, Algorithms-greece

 
Artificial Intelligence
 
Data Science
 
Algorithms

Harnessing the Untapped Benefits of Near-Sortedness for Data Systems - Manos Athanassoulis (Boston University)

Archimedes_Talk__Athanassoulis___21_October
Dates
2025-10-21 13:00 - 14:00
Venue
Archimedes Amphitheatre (1 Artemidos Street, 15125, Marousi, Archimedes, Athena Research Center, Greece)


Title
: Harnessing the Untapped Benefits of Near-Sortedness for Data Systems

Speaker: Manos Athanassoulis (Associate Professor of Computer Science at Boston University (BU), USA, Director and Founder of the BU Data-intensive Systems and Computing Laboratory, BU, USA, and Co-Director of the BU Massive Data Algorithms and Systems Group, BU, USA)

Abstract: Two of the most common concepts in data processing is sorting and indexing. In fact, one can consider indexing as the process of adding structure (e.g., sorting) an otherwise unstructured data collection, which comes at a cost (ingestion cost to create the sorted instance of the data), which is worth-spending due to future benefits. What happens when the incoming data have some pre-existing structure (a degree of “near-sortedness”)? This can happen by virtue of the dataset (e.g., indexing timestamps of aggregated sensors with a small lag, storing mostly increasing data like stock market values), the operation (operating on previously fully sorted data that receives a number of updates, intermediate result of another operator that created near-sorted data), or data correlation (operating on a column correlated with the sort attribute of a table). Traditional indexes are not designed to exploit near-sortedness and in most cases pay the same cost as classical ingestion (as if the data has no structure). In this work we argue that a “sortedness-aware” index should offer increasingly cheaper ingestion cost for “more sorted” data without hurting read performance or any other aspect of data processing.

We present the first sortedness-aware tree index designs. The first uses smart buffering, partial bulk loading, query-driven sorting, and variable split ratio to achieve remarkable speedup for near-sorted data (10x), while we next show that we can have even better results by radically simplify the design maintaining minimal additional state to classical tree indexes and a lightweight predictor of which node to insert to next. If time permits, I will discuss some more open questions on near-sortedness in conjunction with learned indexes, LSM trees, and join processing.

Short Bio: Manos Athanassoulis is an Associate Professor of Computer Science at Boston University, Director and Founder of the BU Data-intensive Systems and Computing Laboratory, and co-director of the BU Massive Data Algorithms and Systems Group. He also spent a summer as a Visiting Faculty at Meta. His research is in the area of data management, focusing on building data systems that efficiently exploit modern hardware (computing units, storage, and memories), are deployed in the cloud, and can adapt to the workload both at setup time and dynamically, at runtime. Before joining Boston University, Manos was a postdoc at Harvard University, USA. Earlier, he obtained his PhD from EPFL, Switzerland, and spent one summer at IBM Research, Watson. Manos’ work has been recognized by awards like “Best of SIGMOD” in 2016, “Best of VLDB” in 2010 and 2017, “Most Reproducible Paper” at SIGMOD in 2017, "Best Demo" for VLDB 2023, and "Distinguished PC Member" for SIGMOD 2018, 2023, 2024, 2025 and EDBT 2025, and has been supported by multiple NSF grants including an NSF CRII and an NSF CAREER award, and industry funds including a Facebook Faculty Research Award, multiple Red Hat Research Incubation Awards and gifts from Cisco, Red Hat, and Meta.

He is currently serving as ACM SIGMOD Secretary/Treasurer 2025-2029 and has served or serving as Associate Edtior for ACM SIGMOD Record, ACM SIGMOD Availability and Reproducibility Co-Chair (2021, 2022, 2023, 2024, 2025), VLDB Ambassador for Industry Relations (2022, 2023, 2024), Industrial Track Co-chair for ICWE 2024, Proceedings Chair for VLDB 2023, Area Chair for ACM SIGMOD 2026, IEEE ICDE 2026, VLDB 2025, and IEEE ICDE 2022, Publicity Chair for VLDB 2022 and IEEE ICDE 2021, and as a PC member on multiple top data management venues.

________________________________________________________________________________

Microsoft Teams 
Meeting ID: 361 396 317 392 1
Passcode: ww9k7JE3

 
 

Vision

To position Greece as a leading player in AI and Data Science

image
image

Mission

To build an AI Excellence Hub in Greece where the international research community can connect, groundbreaking ideas can thrive, and the next generation of scientists emerges, shaping a brighter future for Greece and the world

 

Welcome to ARCHIMEDES, a vibrant research hub connecting the global AI and Data Science research community fostering groundbreaking research in Greece and beyond. Its dedicated core team, comprising lead researchers, affiliated researchers, Post-Docs, PhDs and interns, is committed to advancing basic and applied research in Artificial Intelligence and its supporting disciplines, including Algorithms, Statistics, Learning Theory, and Game Theory organized around 8 core research areas. By collaborating with Greek and Foreign Universities and Research Institutes, ARCHIMEDES disseminates its research findings fostering knowledge exchange and providing enriching opportunities for students. Leveraging AI to address real-world challenges, ARCHIMEDES promotes innovation within the Greek ecosystem and extends its societal impact. Established in January 2022, as a research unit of the Athena Research Center with support from the Committee Greece 2021, ARCHIMEDES is funded for its first four years by the EU Recovery and Resilience Facility (RRF).

 
 

NEWS

 
Christos Papadimitriou Speaks on “Artificial Intelligence: its History, its Present, and its Uncertain Future”

Christos Papadimitriou Speaks on “Artificial Intelligence: its History, its Present, and its Uncertain Future”

Christos Papadimitriou, Donovan Family Professor of Computer Science at Columbia Engineering at Columbia University, USA, and Principal Scientist at the Archimedes Research Unit of the Athena Research Center, Greece, spoke about “Artificial Intelligence: its History, its Present, and its Uncertain Future” during the ten-year anniversary event of diaNEOsis think tank, which took place on March 11, 2026, at the Stavros Niarchos Foundation Cultural Center (SNFCC).

Archimedes Academic Fellow Andreas Lolos Presents Research at WACV 2026

Archimedes Academic Fellow Andreas Lolos Presents Research at WACV 2026

Archimedes Academic Fellow and a third-year PhD student at the National and Kapodistrian University of Athens in Greece, Andreas Lolos recently travelled to Tucson in Arizona, USA, and presented the paper "SGPMIL: Sparse Gaussian Process Multiple Instance Learning" at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV 2026).

6 Accepted Papers at ICLR 2026

6 Accepted Papers at ICLR 2026

The Archimedes research unit of the Athena Research Center in Greece has six papers accepted at this year's International Conference on Learning Representations (ICLR) gathering, including one oral presentation. The acceptance rate for posters was about 25,8% and about 1,1% for oral presentations.  Please bear in mind that ICLR, along with NeurIPS and ICML, is considered one of the three top-tier artificial intelligence and machine learning conferences in the world.

 
 

The project “ARCHIMEDES Unit: Research in Artificial Intelligence, Data Science and Algorithms” with code OPS 5154714 is implemented by the National Recovery and Resilience Plan “Greece 2.0” and is funded by the European Union – NextGenerationEU.

greece2.0 eu_arch_logo_en

 

Stay connected! Subscribe to our mailing list by emailing sympa@lists.athenarc.gr
with the subject "subscribe archimedes-news Firstname LastName"
(replace with your details)