Academic Journal
FORTE: Few Samples for Recognizing Hand Gestures with a Smartphone-attached Radar
العنوان: | FORTE: Few Samples for Recognizing Hand Gestures with a Smartphone-attached Radar |
---|---|
المؤلفون: | Chioccarello, Stefano, Sluÿters, Arthur, Testolin, Alberto, Vanderdonckt, Jean, Lambot, Sébastien |
المساهمون: | UCL - SSH/LouRIM - Louvain Research Institute in Management and Organizations, UCL - SST/ELI - Earth and Life Institute, UCL - SST/ICTM - Institute of Information and Communication Technologies, Electronics and Applied Mathematics |
المصدر: | Proceedings of the ACM on Human-Computer Interaction, Vol. 7, no.179, p. 25 (2023) |
بيانات النشر: | Association for Computing Machinery |
سنة النشر: | 2023 |
المجموعة: | DIAL@USL-B (Université Saint-Louis, Bruxelles) |
مصطلحات موضوعية: | Gestural input, Graphical user interfaces, Interactive systems and tools, Cross-validation, Runtime environments, Radio frequency and wireless interconnect |
الوصف: | Radar sensing technologies offer several advantages over other gesture input modalities, such as the ability to reliably sense human movements, a reasonable deployment cost, insensitivity to ambient conditions such as light, temperature, and the ability to preserve anonymity. These advantages come at the price of high processing complexity mainly due to the spatio-temporal variations of gesture articulation performed by different people. Deep learning methods, such as CNN-LSTM and 3D CNN-LSTM, have a high potential to recognize radar-based gestures but usually require hundreds or thousands of labeled training samples and high processing power. Asking a lot of people to acquire a lot of gestures is particularly tedious and tiring to the point of being unrealistic. To overcome these challenges, we propose Forte, a hand gesture recognition with a few samples based on an optimized CNN architecture working on pre-processed raw data. Using a ð‘˜=5-fold cross-validation, we define and compare three alternative CNNs for recognizing hand gestures acquired in a semi-mobile context of use with a portable radar attached to a smartphone. The best CNN reaches an accuracy of 94.96% with a precision of 95.92% and a recall of 96.03% for a dataset composed of solely 5 participants producing 2 samples for 20 classes covering 1 pointing, 2 pantomimic, 3 iconic, and 14 semaphoric gestures. We suggest some implications for designing radar-based gestures and we discuss the limitations of this approach. |
نوع الوثيقة: | article in journal/newspaper |
اللغة: | English |
تدمد: | 2573-0142 |
Relation: | info:eu-repo/grantAgreement/Wallonie-Bruxelles-International//SUB/2021/519018; info:eu-repo/grantAgreement/Fonds de la Recherche Scientifique - FNRS/Mandat aspirant/40001931; info:eu-repo/grantAgreement/Fonds de la Recherche Scientifique - FNRS/Mandat aspirant/40011629; boreal:274722; http://hdl.handle.net/2078.1/274722; urn:ISSN:2573-0142; urn:EISSN:2573-0142 |
DOI: | 10.1145/3593231 |
الاتاحة: | http://hdl.handle.net/2078.1/274722 https://doi.org/10.1145/3593231 |
Rights: | info:eu-repo/semantics/openAccess |
رقم الانضمام: | edsbas.AB093472 |
قاعدة البيانات: | BASE |
تدمد: | 25730142 |
---|---|
DOI: | 10.1145/3593231 |