Semi-Active Control of a Shear Building based on Reinforcement Learning: Robustness to measurement noise and model error

التفاصيل البيبلوغرافية
العنوان: Semi-Active Control of a Shear Building based on Reinforcement Learning: Robustness to measurement noise and model error
المؤلفون: Jedlińska, Aleksandra, Pisarski, Dominik, Mikułowski, Grzegorz, Błachowski, Bartłomiej, Jankowski, Łukasz
المساهمون: Institute of Fundamental Technological Research (IPPT), Polska Akademia Nauk = Polish Academy of Sciences = Académie polonaise des sciences (PAN), The authors gratefully acknowledge the support of the National Science Centre, Poland, granted under the grant agreement 2020/39/B/ST8/02615., Maria Ganzha, Leszek Maciaszek, Marcin Paprzycki, Dominik Ślęzak
المصدر: Annals of Computer Science and Information Systems ; 18th Conference on Computer Science and Intelligence Systems ; https://hal.science/hal-04282960 ; 18th Conference on Computer Science and Intelligence Systems, Sep 2023, Warsaw, Poland. pp.1007-1010, ⟨10.15439/2023F8946⟩
بيانات النشر: HAL CCSD
سنة النشر: 2023
المجموعة: Archive ouverte HAL (Hyper Article en Ligne, CCSD - Centre pour la Communication Scientifique Directe)
مصطلحات موضوعية: structural control, semi-active control, reinforcement learning, tuned mass damper (TMD), [INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG], [SPI.MECA]Engineering Sciences [physics]/Mechanics [physics.med-ph]
جغرافية الموضوع: Warsaw, Poland
الوصف: This paper considers structural control by reinforcement learning. The aim is to mitigate vibrations of a shear building subjected to an earthquake-like excitation and fitted with a semi-active tuned mass damper (TMD). The control force is coupled with the structural response, making the problem intrinsically nonlinear and challenging to solve using classical methods. Structural control by reinforcement learning has not been extensively explored yet. Here, Deep-Q-Learning is used, which appriximates the Q-function with a neural network and optimizes initially random control sequences through interaction with the controlled system. For safety reasons, training must be performed using an inevitably inexact numerical model instead of the real structure. It is thus crucial to assess the robustness of the control with respect to measurement noise and model errors. It is verified to significantly outperform an optimally tuned conventional TMD, and the key outcome is the high robustness to measurement noise and model error.
نوع الوثيقة: conference object
اللغة: English
Relation: hal-04282960; https://hal.science/hal-04282960; https://hal.science/hal-04282960/document; https://hal.science/hal-04282960/file/8946.pdf
DOI: 10.15439/2023F8946
الاتاحة: https://hal.science/hal-04282960
https://hal.science/hal-04282960/document
https://hal.science/hal-04282960/file/8946.pdf
https://doi.org/10.15439/2023F8946
Rights: http://creativecommons.org/licenses/by/ ; info:eu-repo/semantics/OpenAccess
رقم الانضمام: edsbas.64B98761
قاعدة البيانات: BASE