Toward Representation Independent Similarity Search Over Graphs

التفاصيل البيبلوغرافية
العنوان: Toward Representation Independent Similarity Search Over Graphs
المؤلفون: Yizhou Sun, Amirhossein Aleyasin, Arash Termehchy, Jose Picado, Yodsawalai Chodpathumwan
المصدر: Proceedings of Workshop on GRAph Data management Experiences and Systems.
بيانات النشر: ACM, 2014.
سنة النشر: 2014
مصطلحات موضوعية: Theoretical computer science, Empirical research, Nearest neighbor search, Leverage (statistics), Degree of similarity, Data graph, Data mining, Heuristics, External Data Representation, USable, computer.software_genre, computer, Mathematics
الوصف: Finding similar entities over data graphs is an important problem with many applications. Current similarity search algorithms use intuitively appealing heuristics that leverage the link information in the data graph to quantify the degree of similarity between its entities. In this paper, using examples from real-world data sets, we show that people represent the same information using data graphs with different shapes. We argue that in order for a similarity search algorithm to be usable and effective, it should be representation independent: it should return essentially the same answers for a query over different graphs that represent the same information. We formalize this property and show that the outcome of current similarity search algorithms depend highly on data representation. Hence, they may be effective on some datasets and ineffective over others. We also perform an empirical study and analyze the sensitivity of current methods against changes in data representation. Our results indicate that the output of these algorithms are highly affected by changes in data representation.
DOI: 10.1145/2621934.2621946
URL الوصول: https://explore.openaire.eu/search/publication?articleId=doi_________::bac4876c0ea530923245d25c429ca9ee
https://doi.org/10.1145/2621934.2621946
رقم الانضمام: edsair.doi...........bac4876c0ea530923245d25c429ca9ee
قاعدة البيانات: OpenAIRE