Binaural speech segregation

Sound Demo

This page demonstrates a sound separation method based on binaural cues extracted from the responses of a KEMAR dummy head. The system was systematically tested for multiple source configurations in anechoic conditions.

For more details see:

N. Roman, D. L. Wang and G. J. Brown, "Speech segregation based on sound localization," J. Acoust. Soc. Am., vol. 114, pp. 2236-2252, 2003.

Two-source configuration (Target at 0^o ; Noise at 30^o)

Target

Noise Mixture Reconstructed target

Pure Tone

White Noise

Noise Burst

Cocktail Party

Rock Music

Siren

Trill Telephone

Female Speech

Male Speech

Female Speech

Three-source configuration (Target at 0^o; Noise 1 at -30^o; Noise 2 at 30^o)

Target    Noise 1

    Noise 2 Mixture Reconstructed target

Pure Tone

White Noise

Noise Burst

Cocktail Party

Rock Music

Siren

    Trill Telephone

Female Speech

Male Speech

Female Speech

Speech signals of the training set

ID Speaker ID Utterance Wave Sound

S0 MKLSO "Primitive tribes have an upbeat attitude"

S1 FCKE0 "Only the best players enjoy popularity"

S2 MCDC0 "Our aim must be to learn as much as to teach"

S3 FEAR0 "Development requires a long-term approach"

S4 FDMS0 "Poets, moreover, dwell on human passions"

S5 FETB0 "Change involves the displacement of form"

S6 FCMM0 "The system works as an impersonal mechanism"

S7 MJWS0 "Most assuredly ideas are invaluable"

S8 MRVG0 "False ideas surfeit another sector of our life"

S9 MJRH0 "But in every period it has been humanism"

Noise	Mixture	Reconstructed target
Pure Tone
White Noise
Noise Burst
Cocktail Party
Rock Music
Siren
Trill Telephone
Female Speech
Male Speech
Female Speech

Noise 2	Mixture	Reconstructed target
Pure Tone
White Noise
Noise Burst
Cocktail Party
Rock Music
Siren
Trill Telephone
Female Speech
Male Speech
Female Speech

ID	Speaker ID	Utterance	Wave Sound
S0	MKLSO	"Primitive tribes have an upbeat attitude"
S1	FCKE0	"Only the best players enjoy popularity"
S2	MCDC0	"Our aim must be to learn as much as to teach"
S3	FEAR0	"Development requires a long-term approach"
S4	FDMS0	"Poets, moreover, dwell on human passions"
S5	FETB0	"Change involves the displacement of form"
S6	FCMM0	"The system works as an impersonal mechanism"
S7	MJWS0	"Most assuredly ideas are invaluable"
S8	MRVG0	"False ideas surfeit another sector of our life"
S9	MJRH0	"But in every period it has been humanism"