I frequently use a speech processing technique called vocoding for my
research. This technique became famous in the psychoacousticians' world
because it can simulate important aspects of cochlear implant listening.
More generally, it is a neat tool that allows us fine control over
acoustic features for perceptual experiments.
What
perceptual features of noise-, sine-, pulse-, or pointy-vocoded sound
are important in challenging acoustic environments?
How can vocoding help answer questions about energetic and informational masking?
Have a look at some example spectrograms for the utterance "Mike"
Related Work:
Ihlefeld, A., Shinn-Cunningham,
B. G., and Carlyon, R. P. (in press). "Comodulation masking release for
speech identification with cochlear implants"
Ihlefeld, A., Deeks, J. M., Axon, P. R., and Carlyon, R. P.
(2010) "Simulations of cochlear-implant speech perception in modulated
and unmodulated noise," J. Acoust. Soc. Am. 128, 870-880. pdf
Kidd, Jr., G., Streeter, T. M., Ihlefeld,
A., Maddox, R. K., and
Mason, C. R. (2009) "The intelligibility of Pointillistic Speech," J.
Acoust. Soc. Am. 126, 196-201 (EL). pdf
Ihlefeld A. and
Shinn-Cunningham B.G. (2008), "Disentangling the
effects of spatial cues on selection and formation of auditory objects
,"
J. Acoust. Soc. Am. 123, 124, 2224-2235. pdf
Ihlefeld A. and
Shinn-Cunningham B.G. (2008), "Spatial release
from energetic and informational masking in divided listening," J.
Acoust.
Soc. Am. 123, 4380. pdf
Ihlefeld A. and
Shinn-Cunningham B.G. (2008), "Spatial release
from energetic and informational masking in selective listening," J.
Acoust. Soc. Am. 123, 4369. pdf
Shinn-Cunningham B.G, Ihlefeld
A., Satyavarta, Larson E.(2005).
" Bottom-up and top-down influences on spatial unmasking," Acta
Acustica united with Acustica 91, 967. pdf