Chin-Hui Lee - Basking Ridge NJ Qi P. Li - New Providence NJ Olivier Siohan - New Providence NJ Arun Chandrasekaran Surendran - Highland Park NJ
Assignee:
Lucent Technologies Inc. - Murray Hill NJ
International Classification:
G10L 1514
US Classification:
704246, 704250, 704256
Abstract:
A speaker verification method and apparatus which advantageously minimizes the constraints on the customer and simplifies the system architecture by using a speaker dependent, rather than a speaker independent, background model, thereby obtaining many of the advantages of using a background model in a speaker verification process without many of the disadvantages thereof. In particular, no training data (e. g. speech) from anyone other than the customer is required, no speaker independent models need to be produced, no a priori knowledge of acoustic rules are required, and, no multi-lingual phone models, dictionaries, or letter-to-sound rules are needed. Nonetheless, in accordance with an illustrative embodiment of the present invention, the customer is free to select any password phrase in any language. Specifically, and in accordance with an illustrative embodiment of the present invention, the background model comprises a hidden Markov model having a cruder acoustic resolution than the customer model, which may, for example, be achieved by providing a background model containing fewer states than the customer model.
Automatic Speech Recognition With Psychoacoustically-Based Feature Extraction, Using Easily-Tunable Single-Shape Filters Along Logarithmic-Frequency Axis
Qi P. Li - New Providence NJ Olivier Siohan - New Providence NJ Frank Kao-Ping Soong - Warren NJ
Assignee:
Lucent Technologies Inc. - Murray Hill NY
International Classification:
G10L 1502
US Classification:
704236, 704251
Abstract:
A method and apparatus for extracting speech features from a speech signal in which the linear frequency spectrum data, as generated, for example, by a conventional frequency transform, is first converted to logarithmic frequency spectrum data having frequency data distributed on a substantially logarithmic (rather than linear) frequency scale. Then, a plurality of digital auditory filters is applied to the resultant logarithmic frequency spectrum data, each of these filters having a substantially similar shape, but centered at different points on the logarithmic frequency scale. Because each of the filters have a similar shape, the feature extraction approach of the present invention advantageously can be easily modified or tuned by adjusting each of the filters in a coordinated manner, with the adjustment of only a handful of filter parameters.
Method And Apparatus For Performing Real-Time Endpoint Detection In Automatic Speech Recognition
Chin-Hui Lee - Basking Ridge NJ Qi P. Li - New Providence NJ Jinsong Zheng - Edison NJ Qiru Zhou - Scotch Plains NJ
Assignee:
Lucent Technologies Inc. - Murray Hill NJ
International Classification:
G10L 1500
US Classification:
704248, 704233, 704210, 704215, 704253
Abstract:
A method and apparatus for performing real-time endpoint detection for use in automatic speech recognition. A filter is applied to the input speech signal and the filter output is then evaluated with use of a state transition diagram (i. e. , a finite state machine). The filter is advantageously designed in light of several criteria in order to increase the accuracy and robustness of detection. The state transition diagram advantageously has three states. The endpoints which are detected may then be advantageously applied to the problem of energy normalization of the speech portion of the signal.
Method And Apparatus For Interactive Language Instruction
Katherine Grace August - Matawan NJ, US Nadine Blackwood - Matawan NJ, US Qi P. Li - New Providence NJ, US Michelle McNerney - Freehold NJ, US Chi-Lin Shih - Berkeley Heights NJ, US Arun Chandrasekaran Surendran - Highland Park NJ, US Jialin Zhong - Berkeley Heights NJ, US Qiru Zhou - Scotch Plains NJ, US
Assignee:
Lucent Technologies Inc. - Murray Hill NJ
International Classification:
G09B 5/06 G09B 17/00 G10L 15/22
US Classification:
704270, 704 8, 434167, 434185, 434319
Abstract:
A method and apparatus for interactive language instruction is provided that displays text files for processing, provide key features and functions for interactive learning, displays facial animation, and provides a workspace for language building functions. The system includes a stored set of language rules as part of the text-to-speech sub-system, as well as another stored set of rules as applied to the process of learning a language. The method implemented by the system includes digitally converting text to audible speech, providing the audible speech to a user or student (with the aid of an animated image in selected circumstances), prompting the student to replicate the audible speech, comparing the student's replication with the audible speech provided by the system, and providing feedback and reinforcement to the student by, for example, selectively recording or playing back the audible speech and the student's replication.
Qi Li - New Providence NJ, US Manli Zhu - Pearl River NY, US Joshua J. Hajicek - Astoria NY, US Uday Jain - Lawrenceville NJ, US Yan Yin - Chatham NJ, US Tom R. Burchfield - Orlando FL, US Yan Huang - New Providence NJ, US
Uday Jain - Lawrenceville NJ, US Manli Zhu - Pearl River NY, US Danny Kopit - Brooklyn NY, US Huixian Tang - Oak Ridge NJ, US Qi Li - New Providence NJ, US
Assignee:
LI Creative Technologies, Inc. - Florham Park NJ
International Classification:
1401
US Classification:
D14167
Method And Apparatus For Processing Audio And Speech Signals
A method and device for processing signals representing speech or audio via a plurality of filters that approximate behaviors of the basilar membrane of human cochlea. Each of the plurality of filters is formed from a mother filter via the dilation and a shift in time and has the similar impulse response of the basilar membrane to the frequency band for which the filter represents. Any process can be conducted and any feature can be extracted in the domain of the filters' outputs for applications, such as noise reduction, speech synthesis, coding, and speech and speaker recognition. Processed signals can be synthesized back to the time domain via an inverse cochlear transform.
Name / Title
Company / Classification
Phones & Addresses
Qi Li President
LI Creative Technologies Commercial Physical and Biological Research
30 Vreeland Rd 130, Florham Park, NJ 07932
Qi Li Chairman, Secretary, Treasurer
N&N CHILD DEVELOPMENT CLUB INTERNATIONAL INC
Qi Li President
Sino-Gain International Investment Group Inc
Qi Li President
New Sunny Nails & Spa Inc
136 Bowery, New York, NY 10013 1954 3 St S, Jacksonville, FL 32250 202 Towne Ctr Cir, Sanford, FL 32771
Jul 2014 to Aug 2014 Entry Level Mechanical EngineerStevens Institute of Technology
Feb 2014 to Jun 2014 On Campus ProjectStevens Institute of Technology
Jan 2014 to Jun 2014 Teaching Assistant of Mechanical Engineering DepartmentBall Dispenser Robot Design
Dec 2013 to Feb 2014 On Campus ProjectNavigation, Localization
Sep 2013 to Dec 2013 On Campus Project
Education:
Stevens Institute of Technology Hoboken, NJ May 2014 Master of Engineering in Mechanical EngineeringHarbin Engineering University Harbin, CN Jun 2012 Bachelor of Unclear Engineering
Skills:
Design Software: SolidWorks, Pro/Engineering, AutoCAD, MATLAB, LaTex, Protel99<br/>Languages: C++, C<br/>Hardware: 3D Printer, CNC Machine, PLC, Millling, Lathe, Drilling, and Laser Cutting<br/>Robotics: Localization, Obstacle Avoidance, Robot Programming<br/>Control Design: PID Control, Lead/Lag Control, H 2, H Infinity Design<br/>Mechatronics: Digital & Analog Circuits, Signal Conditioning, User I/O, Sensors, Motors
Lightwave Research Laboratory, Columbia University
Sep 2010 to 2000 Research AssistantColumbia University
Mar 2011 to May 2011 Teaching Assistant, Introduction to Electrical EngineeringLightwave Research Laboratory, Columbia University
Sep 2010 to May 2011 AdvisorSignal & Systems
Sep 2010 to Nov 2010 Teaching AssistantMobile Computing Systems Lab Hong Kong, Hong Kong Island Jun 2009 to May 2010 Undergraduate ResearcherOriental Cable Network
Jun 2007 to Aug 2007 Summer intern
Education:
Columbia University New York, NY Sep 2010 to 2000 Ph.D in Lightwave Research LaboratoryHong Kong University of Science and Technology Hong Kong, Hong Kong Island Sep 2006 to Jun 2010 B.Eng. in Electrical and Computer EngineeringCornell University Ithaca, NY Jan 2009 to May 2009School of Electrical and Computer Engineering
Googleplus
Qi Li (Ricky)
Education:
Lehigh University - Mechanical engineering
Relationship:
Single
Qi Li
Education:
National University of Singapore, Raffles Institution
Shanghai, ChinaArbitrator at Shanghai Arbitration Commission An accomodator and facilitator with a caring heart.
An active attorney admitted in China and New York.