Alice Jane B. Brush - Bellevue WA, US Nissanka Arachchige Bodhi Priyantha - Redmond WA, US Jie Liu - Medina WA, US Amy K. Karlson - Bellevue WA, US Hong Lu - Lebanon NH, US
Assignee:
MICROSOFT CORPORATION - Redmond WA
International Classification:
G10L 17/00
US Classification:
704246, 704E17001
Abstract:
Functionality is described herein for recognizing speakers in an energy-efficient manner. The functionality employs a heterogeneous architecture that comprises at least a first processing unit and a second processing unit. The first processing unit handles a first set of audio processing tasks (associated with the detection of speech) while the second processing unit handles a second set of audio processing tasks (associated with the identification of speakers), where the first set of tasks consumes less power than the second set of tasks. The functionality also provides unobtrusive techniques for collecting audio segments for training purposes. The functionality also encompasses new applications which may be invoked in response to the recognition of speakers.