The need for accurate voice source characterisation
is an established goal in speech processing research. Practical
limitations prohibit the widescale use of a glottal source/vocal tract
filter implementation for many speech processing applications. In
coding applications, for example, the transduction of the speech
signal is with non-specialist microphones under diverse and often
adverse conditions. In addition the transmission path and decoding
process introduces further phase distortion. In the case of synthesis
the accurate recording of a phase sensitive database is not overly
problematic, however the extraction of the flow waveform from
such a database is still a non-trivial task and as yet no automatic
inverse filtering technique is readily available. One possible solution
for overcoming the problem of extracting the timing events of the
glottal flow is to implement a frequency domain representation and
parameterization of the glottal flow waveform. An analytical
spectral formulation of an existing time domain glottal model is
presented.
History
Publication
Irish Signals and Systems Conference 2003 (ISSC'03);H. Mahdi, M. J. Hayes eds.;242-246