Blind source separation of audio streams using wavelets

Quinn, Declan

Quinn_2009_Blind.pdf (15.01 MB)

Blind source separation of audio streams using wavelets

thesis

posted on 2023-01-30, 09:17 authored by Declan Quinn

In this thesis, Blind Source Separation is studied in the context of both instantaneous and convolutive mixtures of musical sources. In order to appreciate how the application of BSS techniques to music may differ from application to speech samples, the statistical and spectral characteristics of commercial musical recordings are determined. Furthermore, the extent to which certain assumptions underlying many BSS (or, more generally, ICA) techniques are valid is investigated. The use of short-time Fourier Transform techniques in BSS is discussed, and the use of wavelet and wavelet-packet techniques to perform BSS is investigated. The most efficient wavelets to use for the decomposition of musical sources are determined, and measures of the sparsity of transformed sample data are compared for different transform techniques. A number of strategies for reducing the computational complexity of the application of BSS techniques are developed (principally around the use of wavelet transforms). Summaries of the most popular BSS techniques (for both instantaneous and convolutive mixtures) are provided, and metrics frequently used to evaluate both signal separation and reconstruction performance are listed. To support the work of this thesis, a large library of Matlab routines has been developed. Throughout the work, clear examples are provided to support the theoretical discussion. This thesis concludes with a summary of the investigations performed, and an outline of future research directions suggested by the work contained herein.

History

Note

peer-reviewed

Language

English

Usage metrics

Keywords

Blind Source Separation BSS techniques wireless communication

Licence

CC BY-NC-SA 1.0

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

Blind source separation of audio streams using wavelets

History

Note

Language

Usage metrics

Categories

Keywords

Licence

Exports