Academic Journal

Outlier preservation by dimensionality reduction techniques

التفاصيل البيبلوغرافية
العنوان: Outlier preservation by dimensionality reduction techniques
المؤلفون: Onderwater, M. (Martijn)
المصدر: International Journal of Data Analysis Techniques and Strategies vol. 7 no. 3, pp. 231-252
بيانات النشر: Inderscience
سنة النشر: 2015
المجموعة: CWI's Institutional Repository (Centrum voor Wiskunde en Informatica)
مصطلحات موضوعية: dimensionality reduction, outlier detection, multidimensional scaling, principal component analysis, t-stochastic neighbourhood embedding, peeling, F1-score, Matthews Correlation, Relative Information Score, sensor network
الوصف: Sensors are increasingly part of our daily lives: motion detection, lighting control, and energy consumption all rely on sensors. Combining this information into, for instance, simple and comprehensive graphs can be quite challenging. Dimensionality reduction is often used to address this problem, by decreasing the number of variables in the data and looking for shorter representations. However, dimensionality reduction is often aimed at normal daily data, and applying it to events deviating from this daily data (so-called outliers) can affect such events negatively. In particular, outliers might go unnoticed.In this paper we show that dimensionality reduction can indeed have a large impact on outliers. To that end we apply three dimensionality reduction techniques to three real-world data sets, and inspect how well they preserve outliers. We use several performance measures to show how well these techniques are capable of preserving outliers, and we discuss the results.
نوع الوثيقة: article in journal/newspaper
وصف الملف: application/pdf
اللغة: English
Relation: https://ir.cwi.nl/pub/22628
DOI: 10.1504/IJDATS.2015.071365
الاتاحة: https://ir.cwi.nl/pub/22628
https://doi.org/10.1504/IJDATS.2015.071365
رقم الانضمام: edsbas.DBC704D1
قاعدة البيانات: BASE
الوصف
DOI:10.1504/IJDATS.2015.071365