Dissertation/ Thesis

BLIND SOURCE SEPARATION FOR AUTOMATIC MUSIC TRANSCRIPTION

التفاصيل البيبلوغرافية
العنوان: BLIND SOURCE SEPARATION FOR AUTOMATIC MUSIC TRANSCRIPTION
المؤلفون: Kurmangaliyev, Bauyrzhan
بيانات النشر: Nazarbayev University School of Engineering and Digital Sciences
سنة النشر: 2024
المجموعة: Nazarbayev University Repository
مصطلحات موضوعية: Type of access: Embargo, Automatic Music Transcription, Gradient Optimization, Latent Dimensionality Estimation, Nonnegative Matrix Factorization, Singular Value Thresholding
الوصف: The primary objective of this project is to develop methods aimed to the conduct the blind signal separation of musical notes with Nonnegative Matrix Factorization (NMF). This is motivated by the fact that music signals are often recorded with a single microphone, hence, there is a need to develop the Automatic Music Transcription (AMT) methods that could mitigate this assumption and produce the desirable separation result. Therefore, this project report presents the rank estimation method for determination of number of musical notes in the recording. It is motivated by the fact that most of the research works on NMF assume \emph{a priori} knowledge regarding the rank of factorization which may not be available in most of the real world scenarios. As a result, the Weighted Singular Value Thresholding based on Stein's Unbiased Risk Estimate (WSVT-SURE) in which rank estimation is performed by non-uniform shrinkage of singular values via weight vector is presented. We also introduce gradient optimization of a smooth approximation of WSVT-SURE (GWSVT-SURE) to estimate the optimal threshold parameter. In the context of AMT, the proposed algorithms allow one to estimate the number of musical note components in the recordings. The proposed algorithms have been evaluated with the polyphonic piano music excerpts. It is observed that the proposed WSVT-SURE algorithm reaches significant improvement in the estimation performance, while GWSVT-SURE shows substantial savings in the computational cost.
نوع الوثيقة: bachelor thesis
اللغة: English
Relation: Kurmangaliyev, B. (2024). Blind Source Separation for Automatic Music Transcription. Nazarbayev University School of Engineering and Digital Sciences; http://nur.nu.edu.kz/handle/123456789/7880
الاتاحة: http://nur.nu.edu.kz/handle/123456789/7880
رقم الانضمام: edsbas.A350EF83
قاعدة البيانات: BASE