posted on 2023-01-30, 09:17authored byDeclan Quinn
In this thesis, Blind Source Separation is studied in the context of both instantaneous and
convolutive mixtures of musical sources. In order to appreciate how the application of
BSS techniques to music may differ from application to speech samples, the statistical and
spectral characteristics of commercial musical recordings are determined. Furthermore,
the extent to which certain assumptions underlying many BSS (or, more generally, ICA)
techniques are valid is investigated.
The use of short-time Fourier Transform techniques in BSS is discussed, and the use of
wavelet and wavelet-packet techniques to perform BSS is investigated. The most efficient
wavelets to use for the decomposition of musical sources are determined, and measures of
the sparsity of transformed sample data are compared for different transform techniques.
A number of strategies for reducing the computational complexity of the application of
BSS techniques are developed (principally around the use of wavelet transforms).
Summaries of the most popular BSS techniques (for both instantaneous and convolutive
mixtures) are provided, and metrics frequently used to evaluate both signal separation
and reconstruction performance are listed.
To support the work of this thesis, a large library of Matlab routines has been developed.
Throughout the work, clear examples are provided to support the theoretical discussion.
This thesis concludes with a summary of the investigations performed, and an outline of
future research directions suggested by the work contained herein.