This page demonstrates the tandem algorithm proposed by G. Hu and D.L. Wang. For details of this algorithm see:
Hu G. and Wang D.L. (2008): A tandem algorithm for pitch estimation and voiced speech segregation. IEEE Transactions on Audio, Speech, and Language Processing, vol. 18, pp. 2067-2079.For the second set of demos with a naturalic utterance (including both voiced and unvoiced speech), an unvoiced speech segregation algorithm is additionally used. This unvoiced segregation algorithm is described in:
Hu G. and Wang D.L. (2008): Segregation of unvoiced speech from nonspeech interference. Journal of the Acoustical Society of America, vol. 124, pp. 1306-1319.
Voiced speech segregation (for comparison with earlier systems click here)
Naturalistic speech segregation (SNR = 0 dB)
Noise | Mixture | Segregated speech |
White Noise | ![]() |
![]() |
Rock Music | ![]() |
![]() |
Electric Fan | ![]() |
![]() |
Alarm Clock | ![]() |
![]() |
  Bird Chirp with Water Flow   | ![]() |
![]() |
Wind Noise | ![]() |
![]() |
Rain | ![]() |
![]() |
Cocktail Party | ![]() |
![]() |
Playground | ![]() |
![]() |
Crowd Noise | ![]() |
![]() |