Academic Journal

Regional soft error vulnerability and error propagation analysis for GPGPU applications

التفاصيل البيبلوغرافية
العنوان: Regional soft error vulnerability and error propagation analysis for GPGPU applications
المؤلفون: Öz, Işıl, Karadaş, Ömer Faruk
بيانات النشر: Springer
سنة النشر: 2021
مصطلحات موضوعية: Soft error reliability, GPGPU programs, Fault injection
الوصف: The wide use of GPUs for general-purpose computations as well as graphics programs makes soft errors a critical concern. Evaluating the soft error vulnerability of GPGPU programs and employing efficient fault tolerance techniques for more reliable execution become more important. Protecting only the most error-sensitive program regions maintains an acceptable reliability level by eliminating the large performance overheads due to redundant operations. Therefore, fine-grained regional soft error vulnerability analysis is crucial for the systems targeting both performance and reliability. In this work, we present a regional fault injection framework and perform a detailed error propagation analysis to evaluate the soft error vulnerability of GPGPU applications. We evaluate both intra-kernel and inter-kernel vulnerabilities for a set of programs and quantify the severity of the data corruptions by considering metrics other than SDC rates. Our experimental study demonstrates that the code regions inside GPGPU programs exhibit different characteristics in terms of soft error vulnerability and the soft errors corrupting the variables propagate into the program output in several ways. We present the potential impact of our analysis by discussing the usage scenarios after we compile our observations acquired from our empirical work. ; This work was supported by the Scientific and Technological Research Council of Turkey (TuBTAK), Grant No: 119E011.
نوع الوثيقة: article in journal/newspaper
اللغة: English
تدمد: 0920-8542
1573-0484
Relation: Journal of Supercomputing; Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı; https://doi.org/10.1007/s11227-021-04026-6; https://hdl.handle.net/11147/11400; WOS:000687489300002; 2-s2.0-85113722511; Q2; Q3
DOI: 10.1007/s11227-021-04026-6
الاتاحة: https://hdl.handle.net/11147/11400
https://doi.org/10.1007/s11227-021-04026-6
Rights: open
رقم الانضمام: edsbas.28BF9094
قاعدة البيانات: BASE
الوصف
تدمد:09208542
15730484
DOI:10.1007/s11227-021-04026-6