High-dimensional similarity search for scalable data science

Authors / Editors

Research Areas

No matching items found.

Publication Details

Output type: Conference proceedings article

UM6P affiliated Publication?: Yes

Author list: Echihabi K., Zoumpatianos K., Palpanas T.

Publication year: 2021

Volume number: 2021-April

Start page: 2369

End page: 2372

Number of pages: 4

ISSN: 1084-4627

URL: https://www.scopus.com/inward/record.uri?eid=2-s2.0-85109263982&doi=10.1109%2fICDE51399.2021.00268&partnerID=40&md5=101bd803bbd732acd7305a42a4442838

Languages: English (EN-GB)

View in Web of Science | View on publisher site | View citing articles in Web of Science


Similarity search is a core operation of many critical data science applications, involving massive collections of high-dimensional objects. Similarity search finds objects in a collection close to a given query according to some definition of sameness. Objects can be data series, text, multimedia, graphs, database tables or deep network embeddings. In this tutorial, we revisit the similarity search problem in light of the recent advances in the field and the new big data landscape. We discuss key data science applications that require efficient high-dimensional similarity search, we survey the state-of-the-art high-dimensional similarity search approaches and share surprising insights about their strengths and weaknesses, and we discuss the challenges and open research problems in this area. © 2021 IEEE.


No matching items found.


No matching items found.

Last updated on 2021-26-11 at 23:16