- Redmond WA, US Sefik Emre ESKIMEZ - Bellevue WA, US Min TANG - Yarrow Point WA, US Hemin YANG - Bellevue WA, US Zirun ZHU - Bellevue WA, US Zhuo CHEN - Woodinville WA, US Huaming WANG - Clyde Hill WA, US Takuya YOSHIOKA - Bellevue WA, US
International Classification:
G10L 21/0208 G10L 25/30 G10L 25/51 G06N 3/08
Abstract:
Systems and methods are provided for generating and operating a speech enhancement model optimized for generating noise-suppressed speech outputs for improved human listening and live captioning. A computing system obtains a speech enhancement model trained on a first training dataset to generate noise-suppressed speech outputs and an automatic speech recognition model trained on a second training dataset to generate transcription labels for spoken language utterances. A third training dataset comprising a set of spoken language utterances is applied to the speech enhancement model to obtain a first noise-suppressed speech output which is applied to the automatic speech recognition model to generate a noise-suppressed transcription output for the set of spoken language utterances. Speech enhancement model parameters are updated to optimize the speech enhancement model to generate optimized noise-suppressed speech outputs based on a comparison of the noise-suppressed transcription output and ground truth transcription labels.
A method for selecting a speech recognition result on a computing device includes receiving a first speech recognition result determined by the computing device, receiving first features, at least some of the features being determined using the first speech recognition result, determining whether to select the first speech recognition result or to wait for a second speech recognition result determined by a cloud computing service based at least in part on the first speech recognition result and the first features.
Integration Of Domain Information Into State Transitions Of A Finite State Transducer For Natural Language Processing
The invention relates to a system and method for integrating domain information into state transitions of a Finite State Transducer (“FST”) for natural language processing. A system may integrate semantic parsing and information retrieval from an information domain to generate an FST parser that represents the information domain. The FST parser may include a plurality of FST paths, at least one of which may be used to generate a meaning representation from a natural language input. As such, the system may perform domain-based semantic parsing of a natural language input, generating more robust meaning representations using domain information. The system may be applied to a wide range of natural language applications that use natural language input from a user such as, for example, natural language interfaces to computing systems, communication with robots in natural language, personalized digital assistants, question-answer query systems, and/or other natural language processing applications.
Integration Of Domain Information Into State Transitions Of A Finite State Transducer For Natural Language Processing
The invention relates to a system and method for integrating domain information into state transitions of a Finite State Transducer (“FST”) for natural language processing. A system may integrate semantic parsing and information retrieval from an information domain to generate an FST parser that represents the information domain. The FST parser may include a plurality of FST paths, at least one of which may be used to generate a meaning representation from a natural language input. As such, the system may perform domain-based semantic parsing of a natural language input, generating more robust meaning representations using domain information. The system may be applied to a wide range of natural language applications that use natural language input from a user such as, for example, natural language interfaces to computing systems, communication with robots in natural language, personalized digital assistants, question-answer query systems, and/or other natural language processing applications.
- Atlanta GA, US Yucel Saygin - Instanbul, TR Min Tang - Redmond WA, US Gokhan Tur - Los Altos CA, US
Assignee:
AT&T Intellectual Property II, L.P. - Atlanta GA
International Classification:
G10L 15/26
US Classification:
704235
Abstract:
An apparatus and a method for preserving privacy in natural language databases are provided. Natural language input may be received. At least one of sanitizing or anonymizing the natural language input may be performed to form a clean output. The clean output may be stored.
Microsoft
Principal Software Design Engineer
Nuance Communications
Senior Principal Software Engineer
Voicebox Technologies Jun 2016 - Apr 2018
Director of Cloud Asr
Voicebox Technologies Dec 2010 - Jun 2016
Speech Architect and Speech and Language Technician Lead and Senior Speech Scientist
Voicebox Technologies Jan 2007 - Dec 2010
Senior Speech Scientist
Education:
University of Colorado Boulder 2000 - 2005
Master of Science, Masters, Computer Science
Institute of Automation, Chinese Academy of Science 1996 - 1999
Masters, Master of Engineering, Computer Science, Engineering
Chinese Academy of Science 1996 - 1999
Master of Science, Masters, Computer Science
University of Science and Technology of China 1991 - 1996
Bachelors
Guilin High School 1988 - 1991
University of Science & Technology Chittagong
Skills:
Natural Language Processing Speech Recognition C++ Machine Learning C Software Engineering Algorithms Perl Python Embedded Systems Software Development Software Project Management Speech Technology Image Processing Visual Studio Product Management Software Design Agile Methodologies
"The stiffness of the silk biomaterial could be tuned to accommodate the cortical neurons and the different types of gels, maintaining both stability in culture and brain-like tissue elasticity," said the paper's first author, Min Tang-Schomer, post-doctoral scholar in biomedical engineering at Tuf
Date: Aug 12, 2014
Category: Health
Source: Google
Lab-grown brain tissue 'doughnuts' could aid research into brain injuries
The tissue maintained viability for at least nine weeks significantly longer than cultures made of collagen or hydrogel alone and also offered structural support for network connectivity that is crucial for brain activity, said Min Tang-Schomer, Tufts University researcher and the principal au
The biggest drawback in the quarter was from corn processing, said Morningstar analyst Min Tang-Varner. The segment operating margin dropped to 8% from 16% in the quarter before that because product prices havent caught up to commodity prices.
Date: May 03, 2011
Category: Business
Source: Google
Googleplus
Min Tang
Lived:
Redmond, Washington, USA Guilin, China Hefei, China Beijing, China Boulder, Colorado, USA
Work:
VoiceBox - Speech scientist Conversay
Education:
University of Science and Technology of China, Institute of Automation, Chinese Academy of Science, Colorado University at Boulder