Scalable Machine Learning on High-Dimensional Vectors: From Data Series to Deep Network Embeddings

TitreScalable Machine Learning on High-Dimensional Vectors: From Data Series to Deep Network Embeddings
Publication TypeConference Paper
Year of Publication2020
AuthorsEchihabi, K, Zoumpatianos, K, Palpanas, T
Conference NameACM International Conference Proceeding Series
Mots-clésApplication requirements, Astrophysics, Deep learning, Designing techniques, Embeddings, High-dimensional, Intelligent systems, Knowledge extraction, Learning systems, Machine learning techniques, Real time systems, Research problems, Scalable machine learning, Semantics, Similarity search, Vectors, Video recording
Abstract

There is an increasingly pressing need, by several applications in diverse domains, for developing techniques able to analyze very large collections of static and streaming sequences (a.k.a. data series), predominantly in real-time. Examples of such applications come from Internet of Things installations, neuroscience, astrophysics, and a multitude of other scientific and application domains that need to apply machine learning techniques for knowledge extraction. It is not unusual for these applications, for which similarity search is a core operation, to involve numbers of data series in the order of hundreds of millions to billions, which are seldom analyzed in their full detail due to their sheer size. Such application requirements have driven the development of novel similarity search methods that can facilitate scalable analytics in this context. At the same time, a host of other methods have been developed for similarity search of high-dimensional vectors in general. All these methods are now becoming increasingly important, because of the growing popularity and size of sequence collections, as well as the growing use of high-dimensional vector representations of a large variety of objects (such as text, multimedia, images, audio and video recordings, graphs, database tables, and others) thanks to deep network embeddings. In this work, we review recent efforts in designing techniques for indexing and analyzing massive collections of data series, and argue that they are the methods of choice even for general high-dimensional vectors. Finally, we discuss the challenges and open research problems in this area. © 2020 Owner/Author.

URLhttps://www.scopus.com/inward/record.uri?eid=2-s2.0-85091520891&doi=10.1145%2f3405962.3405989&partnerID=40&md5=b3b9f8c365e940d7a488d9a9a20df419
DOI10.1145/3405962.3405989
Revues: 

Partenaires

Localisation

Suivez-nous sur

         

    

Contactez-nous

ENSIAS

Avenue Mohammed Ben Abdallah Regragui, Madinat Al Irfane, BP 713, Agdal Rabat, Maroc

  Télécopie : (+212) 5 37 68 60 78

  Secrétariat de direction : 06 61 48 10 97

        Secrétariat général : 06 61 34 09 27

        Service des affaires financières : 06 61 44 76 79

        Service des affaires estudiantines : 06 62 77 10 17 / n.mhirich@um5s.net.ma

        CEDOC ST2I : 06 66 39 75 16

        Résidences : 06 61 82 89 77

Contacts

    

    Compteur de visiteurs:640,923
    Education - This is a contributing Drupal Theme
    Design by WeebPal.