I frequently use a speech processing technique called vocoding for my
research. This technique became famous in the psychoacousticians' world
because it can simulate important aspects of cochlear implant listening.
More generally, it is a neat tool that allows us fine control over
acoustic features for perceptual experiments.
What
perceptual features of noise-, sine-, pulse-, or pointy-vocoded sound
are important in challenging acoustic environments?
How can vocoding help answer questions about energetic and informational masking?
Have a look at some example spectrograms for the utterance "Mike"
Related Work:
Kidd, Jr., G., Streeter, T. M., Ihlefeld,
A., Maddox, R. K., and
Mason, C. R. (2009) "The intelligibility of Pointillistic Speech," J.
Acoust. Soc. Am. 126, 196-201 (EL). pdf
Ihlefeld A. and
Shinn-Cunningham B.G. (2008), "Disentangling the
effects of spatial cues on selection and formation of auditory objects
,"
J. Acoust. Soc. Am. 123, 124, 2224-2235. pdf
Ihlefeld A. and
Shinn-Cunningham B.G. (2008), "Spatial release
from energetic and informational masking in divided listening," J.
Acoust.
Soc. Am. 123, 4380. pdf
Ihlefeld A. and
Shinn-Cunningham B.G. (2008), "Spatial release
from energetic and informational masking in selective listening," J.
Acoust. Soc. Am. 123, 4369. pdf
Shinn-Cunningham B.G, Ihlefeld
A., Satyavarta, Larson E.(2005).
" Bottom-up and top-down influences on spatial unmasking," Acta
Acustica united with Acustica 91, 967. pdf