APPARATUS AND METHOD FOR CONVERTING AN INFORMATION SIGNAL TO A SPECTRAL REPRESENTATION WITH VARIABLE RESOLUTION

التفاصيل البيبلوغرافية
العنوان: APPARATUS AND METHOD FOR CONVERTING AN INFORMATION SIGNAL TO A SPECTRAL REPRESENTATION WITH VARIABLE RESOLUTION
Document Number: 20090100990
تاريخ النشر: April 23, 2009
Appl. No: 11/629594
Application Filed: April 27, 2005
مستخلص: The apparatus for converting an information signal from a time to a variable spectral representation includes a means for windowing the information signal, a means for converting the windowed information signal to a spectral representation, and a means for weighting a set of information signal spectral coefficients with several sets of complex base function coefficients provided from a means for providing the sets of base function coefficients. The sets of base function coefficients are derived from base functions of various frequencies by windowing and transform, wherein several sets of base function coefficients are provided for one and the same base function for base functions of higher frequencies, wherein the windows for providing these sets are related to various time portions of the base function. The variable spectral representation exhibits variable bandwidth of the variable spectral coefficients, which are efficient and accurate to calculate and especially suited for music analysis purposes.
Inventors: Cremer, Markus (Berkeley, CA, US); Derboven, Claas (Ilmenau, DE); Streich, Sebastian (Barcelona, ES)
Claim: 1. An apparatus for converting an information signal, which is given as a series of samples, to a spectral representation with variable spectral coefficients, with a frequency value and a bandwidth being associated with a variable spectral coefficient, and with a frequency spacing of the variable spectral coefficients being variable, comprising: a window filter for windowing the information signal to obtain a windowed block of the information signal having a length in time; a converter for converting the windowed block of samples to a spectral representation having a set of information signal spectral coefficients; a provider for providing a first set of complex base function coefficients, a second set of complex base function coefficients and a third set of complex base function coefficients, wherein the base function coefficients of the first set represent a result of a first windowing and transform of a first base function, which has a frequency corresponding to a first frequency value of a first variable spectral coefficient, wherein the base function coefficients of the second set represent a result of a second windowing and transform of a second base function, which has a frequency corresponding to a second frequency value of a second variable spectral coefficient, and wherein the base function coefficients of the third set represent a result of a third windowing and transform of the second base function, which has the second frequency value, wherein the first windowing, the second windowing and the third windowing differ in that a window length of a window in the first windowing differs from a window length of a window in the second and the third windowing, and that a window position of the second window and of the third window differ with reference to the second base function; and a weighter for weighting the set of information signal spectral coefficients with the first set of base function coefficients, in order to calculate the first variable spectral coefficient, for weighting the set of information signal spectral coefficients with the second set of base function coefficients, in order to obtain the second variable spectral coefficient for a first portion of the windowed block of the information signal, and for weighting the set of information signal spectral coefficients with the third set of base function coefficients, in order to obtain the second variable spectral coefficient for a second portion of the windowed block of the information signal, which is different from the first portion of the windowed block of the information signal.
Claim: 2. The apparatus of claim 1, wherein the information signal is an audio signal with music information and the variable spectral coefficients have frequency values that are halftones of a note system.
Claim: 3. The apparatus of claim 1, wherein the weighter is formed to perform a multiplication of a matrix by the sets of base function coefficients and of a vector by the information signal spectral coefficients.
Claim: 4. The apparatus of claim 1, wherein the window filter is formed to use a rectangular window as audio window.
Claim: 5. The apparatus of claim 1, wherein the windows for the first windowing, the second windowing and the third windowing for determining the base function coefficients are rectangular windows.
Claim: 6. The apparatus of claim 1, wherein a window length of a window for determining the second set of base function coefficients and a window length of a window for determining the third set of base function coefficients are equal and half as long as a window for determining the first set of base function coefficients.
Claim: 7. The apparatus of claim 1, wherein the provider is formed to provide further sets of base function coefficients, which represent the results of further windowings of further base functions, and the number of which is twice as large as a number of sets of base function coefficients for a base function with a lower frequency value.
Claim: 8. The apparatus of claim 1, wherein the provider is formed to provide a further set of base function coefficients for a further base function having a lower frequency value than the frequency value of the first base function, wherein a further window for windowing the further base function is longer than the window for determining the first set of base function coefficients and has a window position different from a window position of the window for determining the first set of base function coefficients.
Claim: 9. The apparatus of claim 8, wherein all base functions have the same reference phase, which is in a predetermined ratio to a window position of the further window.
Claim: 10. The apparatus of claim 8, wherein the window position of an audio window for windowing the information signal coincides with the window position of the further window, and wherein the window filter is formed to window the information signal in overlapping manner.
Claim: 11. The apparatus of claim 1, wherein the window filter is formed to window the information signal so that a window position of an audio window coincides with a window position of a window for determining the first set of base function coefficients and of a window for determining the second set of base function coefficients.
Claim: 12. The apparatus of claim 1, wherein the provider is formed to provide, in a set of base function coefficients, only such base function coefficients that satisfy a criterion, and to set the base function coefficients not satisfying the criterion to zero.
Claim: 13. The apparatus of claim 12, wherein the criterion is given by the fact that a base function coefficient satisfying the criterion, summed with other base function coefficients also satisfying the criterion, is needed to achieve a predetermined percentage of an overall energy of all base function coefficients.
Claim: 14. The apparatus of claim 1, wherein the provider is formed to provide the set of base function coefficients as a result of a selection, wherein the selection at first includes a squaring and summation of all base function coefficients obtained by windowing and transform, and wherein the summation further includes a summation with reference to the size of the squared base function coefficients starting from the greatest base function coefficient, until a summed value has a predetermined percentage of a summed value for all base function coefficients obtained by windowing and transform.
Claim: 15. The apparatus of claim 14, wherein the provider is formed to provide a set of base function coefficients as a result of a scaling, wherein all base function coefficients satisfying the predetermined criterion are weighted with the result of the summation of all base function coefficients obtained by windowing and transform.
Claim: 16. The apparatus of claim 1, wherein a window for determining the third set of base function coefficients immediately follows a window for determining the second set of base function coefficients.
Claim: 17. The apparatus of claim 1, wherein the converter is formed to provide complex spectral coefficients as the set of information signal spectral coefficients.
Claim: 18. The apparatus of claim 1, wherein the converter is formed to perform a discrete Fourier transform and in particular a fast Fourier transform.
Claim: 19. The apparatus of claim 1, wherein the provider is formed to provide sets of base function coefficients so that windows for providing the sets of base function coefficients all have a length that is an integer fraction of a window length of a window for determining the first set of base function coefficients.
Claim: 20. The apparatus of claim 1, wherein the provider is formed to provide the first set of base function coefficients as a result of a windowing with the first window, which has a temporal length of 128 ms, and wherein the provider is further formed to provide the second set of base function coefficients and the third set of base function coefficients as a result of a windowing with a window having a length of 64 ms.
Claim: 21. An apparatus for providing sets of base function coefficients, comprising: a provider for providing a time representation of a first and a second base function, wherein the first base function has a first frequency value, and wherein the second base function has a second frequency value, which is higher than the first frequency value; a window filter for windowing the first base function with a first window and for windowing the second base function with a second window and a third window, wherein the third window relates to a portion of the second base function later in time than the second window; and a transformer for transforming a result of a windowing of the first base function with the first window, in order to obtain a first set of base function coefficients, for transforming a result of a windowing of the second base function with the second window, in order to obtain a second set of base function coefficients, and for windowing a result of a third windowing of the second base function with the third window, in order to obtain a third set of base function coefficients.
Claim: 22. The apparatus of claim 21, further comprising: a selector for selecting base function coefficients from a set of base function coefficients satisfying a predetermined criterion, and for setting base function coefficients not satisfying the predetermined criterion to zero.
Claim: 23. The apparatus of claim 22, wherein the selector is formed to square and sum the base function coefficients, in order to determine an overall energy of the base function coefficients, and in order to select the highest values of the base function coefficients required to obtain a predetermined percentage of the overall energy of all base function coefficients as the base function coefficients satisfying the criterion.
Claim: 24. A method of converting an information signal, which is given as a series of samples, to a spectral representation with variable spectral coefficients, with a frequency value and a bandwidth being associated with a variable spectral coefficient, and with a frequency spacing of the variable spectral coefficients being variable, comprising the steps of: windowing the information signal to obtain a windowed block of the information signal having a length in time; converting the windowed block of samples to a spectral representation having a set of information signal spectral coefficients; providing a first set of complex base function coefficients, a second set of complex base function coefficients and a third set of complex base function coefficients, wherein the base function coefficients of the first set represent a result of a first windowing and transform of a first base function, which has a frequency corresponding to a first frequency value of a first variable spectral coefficient, wherein the base function coefficients of the second set represent a result of a second windowing and transform of a second base function, which has a frequency corresponding to a second frequency value of a second variable spectral coefficient, and wherein the base function coefficients of the third set represent a result of a third windowing and transform of the second base function, which has the second frequency value, wherein the first windowing, the second windowing and the third windowing differ in that a window length of a window in the first windowing differs from a window length of a window in the second and the third windowing, and that a window position of the second window and of the third window differ with reference to the second base function; and weighting the set of information signal spectral coefficients with the first set of base function coefficients, in order to calculate the first variable spectral coefficient, weighting the set of information signal spectral coefficients with the second set of base function coefficients, in order to obtain the second variable spectral coefficient for a first portion of the windowed block of the information signal, and weighting the set of information signal spectral coefficients with the third set of base function coefficients, in order to obtain the second variable spectral coefficient for a second portion of the windowed block of the information signal, which is different from the first portion of the windowed block of the information signal.
Claim: 25. A method of providing sets of base function coefficients, comprising the steps of: providing a time representation of a first and a second base function, wherein the first base function has a first frequency value, and wherein the second base function has a second frequency value, which is higher than the first frequency value; windowing the first base function with a first window and windowing the second base function with a second window and a third window, wherein the third window relates to a portion of the second base function later in time than the second window; and transforming a result of a windowing of the first base function with the first window, in order to obtain a first set of base function coefficients, transforming a result of a windowing of the second base function with the second window, in order to obtain a second set of base function coefficients, and windowing a result of a third windowing of the second base function with the third window, in order to obtain a third set of base function coefficients.
Claim: 26. A computer program with a program code for performing, when the computer program is executed on a computer, a method of converting an information signal, which is given as a series of samples, to a spectral representation with variable spectral coefficients, with a frequency value and a bandwidth being associated with a variable spectral coefficient, and with a frequency spacing of the variable spectral coefficients being variable, comprising the steps of: windowing the information signal to obtain a windowed block of the information signal having a length in time; converting the windowed block of samples to a spectral representation having a set of information signal spectral coefficients; providing a first set of complex base function coefficients, a second set of complex base function coefficients and a third set of complex base function coefficients, wherein the base function coefficients of the first set represent a result of a first windowing and transform of a first base function, which has a frequency corresponding to a first frequency value of a first variable spectral coefficient, wherein the base function coefficients of the second set represent a result of a second windowing and transform of a second base function, which has a frequency corresponding to a second frequency value of a second variable spectral coefficient, and wherein the base function coefficients of the third set represent a result of a third windowing and transform of the second base function, which has the second frequency value, wherein the first windowing, the second windowing and the third windowing differ in that a window length of a window in the first windowing differs from a window length of a window in the second and the third windowing, and that a window position of the second window and of the third window differ with reference to the second base function; and weighting the set of information signal spectral coefficients with the first set of base function coefficients, in order to calculate the first variable spectral coefficient, weighting the set of information signal spectral coefficients with the second set of base function coefficients, in order to obtain the second variable spectral coefficient for a first portion of the windowed block of the information signal, and weighting the set of information signal spectral coefficients with the third set of base function coefficients, in order to obtain the second variable spectral coefficient for a second portion of the windowed block of the information signal, which is different from the first portion of the windowed block of the information signal.
Claim: 27. A computer program with a program code for performing, when the computer program is executed on a computer, a method of providing sets of base function coefficients, comprising the steps of: providing a time representation of a first and a second base function, wherein the first base function has a first frequency value, and wherein the second base function has a second frequency value, which is higher than the first frequency value; windowing the first base function with a first window and windowing the second base function with 2 spend window and a third window, wherein the third window relates to a portion of the second base function later in time than the second window; and transforming a result of a windowing of the first base function with the first window, in order to obtain a first set of base function coefficients, transforming a result of a windowing of the second base function with the second window, in order to obtain a second set of base function coefficients, and windowing a result of a third windowing of the second base function with the third window, in order to obtain a third set of base function coefficients.
Current U.S. Class: 84/623
Current International Class: 10
رقم الانضمام: edspap.20090100990
قاعدة البيانات: USPTO Patent Applications