University of Limerick
Browse
- No file added yet -

Blind source separation of audio streams using wavelets

Download (15.01 MB)
thesis
posted on 2023-01-30, 09:17 authored by Declan Quinn
In this thesis, Blind Source Separation is studied in the context of both instantaneous and convolutive mixtures of musical sources. In order to appreciate how the application of BSS techniques to music may differ from application to speech samples, the statistical and spectral characteristics of commercial musical recordings are determined. Furthermore, the extent to which certain assumptions underlying many BSS (or, more generally, ICA) techniques are valid is investigated. The use of short-time Fourier Transform techniques in BSS is discussed, and the use of wavelet and wavelet-packet techniques to perform BSS is investigated. The most efficient wavelets to use for the decomposition of musical sources are determined, and measures of the sparsity of transformed sample data are compared for different transform techniques. A number of strategies for reducing the computational complexity of the application of BSS techniques are developed (principally around the use of wavelet transforms). Summaries of the most popular BSS techniques (for both instantaneous and convolutive mixtures) are provided, and metrics frequently used to evaluate both signal separation and reconstruction performance are listed. To support the work of this thesis, a large library of Matlab routines has been developed. Throughout the work, clear examples are provided to support the theoretical discussion. This thesis concludes with a summary of the investigations performed, and an outline of future research directions suggested by the work contained herein.

History

Note

peer-reviewed

Language

English

Usage metrics

    Doctoral

    Categories

    No categories selected

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC