Systems And Methods For Constructing An Artificially Diverse Corpus Of Training Data Samples For Training A Contextually-Biased Model For A Machine Learning-Based Dialogue System
- Ann Arbor MI, US Stefan Larson - Ann Arbor MI, US Christopher Clarke - Ann Arbor MI, US Kevin Leach - Ann Arbor MI, US Jonathan K. Kummerfeld - Ann Arbor MI, US Parker Hill - Ann Arbor MI, US Johann Hauswald - Ann Arbor MI, US Michael A. Laurenzano - Ann Arbor MI, US Lingjia Tang - Ann Arbor MI, US Jason Mars - Ann Arbor MI, US
International Classification:
G06F 40/35 G06N 5/04 G06N 20/00 G06F 40/284
Abstract:
Systems and methods for constructing an artificially diverse corpus of training data includes evaluating a corpus of utterance-based training data samples, identifying a slot replacement candidate; deriving distinct skeleton utterances that include the slot replacement candidate, wherein deriving the distinct skeleton utterances includes replacing slots of each of the plurality of distinct utterance training samples with one of a special token and proper slot classification labels; selecting a subset of the distinct skeleton utterances; converting each of the distinct skeleton utterances of the subset back to distinct utterance training samples while still maintaining the special token at a position of the slot replacement candidate; altering a percentage of the distinct utterance training samples with a distinct randomly-generated slot token value at the position of the slot replacement candidate; and constructing the artificially diverse corpus of training samples based on a collection of the percentage of the distinct utterance training samples.
Systems And Methods For Machine Learning Based Multi Intent Segmentation And Classification
- Ann Arbor MI, US Parker Hill - Ann Arbor MI, US Kevin Leach - Ann Arbor MI, US Sean Stapleton - Ann Arbor MI, US Jonathan K. Kummerfeld - Ann Arbor MI, US Johann Hauswald - Ann Arbor MI, US Michael A. Laurenzano - Ann Arbor MI, US Lingjia Tang - Ann Arbor MI, US Jason Mars - Ann Arbor MI, US
International Classification:
G06F 40/30 G06N 20/00 G06N 7/00 G06F 40/284
Abstract:
Systems and methods for synthesizing training data for multi-intent utterance segmentation include identifying a first corpus of utterances comprising a plurality of distinct single-intent in-domain utterances; identifying a second corpus of utterances comprising a plurality of distinct single-intent out-of-domain utterances; identifying a third corpus comprising a plurality of distinct conjunction terms; forming a multi-intent training corpus comprising synthetic multi-intent utterances, wherein forming each distinct multi-intent utterance includes: selecting a first distinct in-domain utterance from the first corpus of utterances; probabilistically selecting one of a first out-of-domain utterance from the second corpus and a second in-domain utterance from the first corpus; probabilistically selecting or not selecting a distinct conjunction term from the third corpus; and forming a synthetic multi-intent utterance including appending the first in-domain utterance with one of the first out-of-domain utterance from the second corpus of utterances and the second in-domain utterance from the first corpus of utterances.
Systems And Methods For Machine Learning-Based Multi-Intent Segmentation And Classification
- Ann Arbor MI, US Parker Hill - Ann Arbor MI, US Kevin Leach - Ann Arbor MI, US Sean Stapleton - Ann Arbor MI, US Jonathan K. Kummerfeld - Ann Arbor MI, US Johann Hauswald - Ann Arbor MI, US Michael Laurenzano - Ann Arbor MI, US Lingjia Tang - Ann Arbor MI, US Jason Mars - Ann Arbor MI, US
International Classification:
G06F 40/30 G06F 40/284 G06N 20/00 G06N 7/00
Abstract:
Systems and methods for synthesizing training data for multi-intent utterance segmentation include identifying a first corpus of utterances comprising a plurality of distinct single-intent in-domain utterances; identifying a second corpus of utterances comprising a plurality of distinct single-intent out-of-domain utterances; identifying a third corpus comprising a plurality of distinct conjunction terms; forming a multi-intent training corpus comprising synthetic multi-intent utterances, wherein forming each distinct multi-intent utterance includes: selecting a first distinct in-domain utterance from the first corpus of utterances; probabilistically selecting one of a first out-of-domain utterance from the second corpus and a second in-domain utterance from the first corpus; probabilistically selecting or not selecting a distinct conjunction term from the third corpus; and forming a synthetic multi-intent utterance including appending the first in-domain utterance with one of the first out-of-domain utterance from the second corpus of utterances and the second in-domain utterance from the first corpus of utterances.
Suburban Radiologic AssocsSuburban Radiological Consultants Ltd 6545 France Ave S STE 125, Minneapolis, MN 55435 (952)8379700 (phone), (952)8379701 (fax)
Education:
Medical School University of Minnesota Medical School at Minneapolis Graduated: 1990
Languages:
English
Description:
Dr. Leach graduated from the University of Minnesota Medical School at Minneapolis in 1990. He works in Edina, MN and specializes in Diagnostic Radiology. Dr. Leach is affiliated with Fairview Lakes Medical Center, Fairview Ridges Hospital, Fairview Southdale Hospital and University Of Minnesota Masonic Childrens Hospital.
Clinc, Inc.
Senior Research Fellow at the University of Michigan and Research Scientist
University of Virginia Jan 2017 - Aug 2017
Research Scientist
University of Michigan Jan 2017 - Aug 2017
Senior Research Fellow
University of Virginia Aug 1, 2013 - Dec 2016
Phd Candidate
Grammatech May 2016 - Sep 2016
Research Scientist Intern
Education:
University of Virginia 2013 - 2016
Doctorates, Doctor of Philosophy, Computer Engineering, Philosophy
George Mason University 2011 - 2013
Master of Science, Masters, Computer Science
University of Virginia 2007 - 2011
Bachelors, Bachelor of Science, Computer Engineering
Skills:
Latex Microsoft Office Programming Computer Science Java Php Linux Computer Security Computer Architecture Compilers Reverse Engineering C++ Python C/C++ Stl Windows Cil Machine Learning Algorithms C Data Analysis Matlab
Interests:
Scientific Research Weightlifting Web Development Piano Debugging Transparency
Teaching Leadership Microsoft Office Data Analysis Team Building Litigation Legal Writing Research Microsoft Excel New Business Development Public Speaking Healthcare Legal Research Customer Service Strategic Planning Insurance Sales Mediation
2007 to Present Security SpecialistDaimlerChrysler Financial Farmington Hills, MI 2003 to 2007 Executive Protection SpecialistsDaimlerChrysler Financial Farmington Hills, MI 1999 to 2003 Security SpecialistChrysler Financial Farmington Hills, MI 1988 to 1999 Records Clerk/Group Leader
Education:
Northwood University 2009 Bachelor in Business ManagementOakland Community College 1988 to 2007 Automotive Design
Skills:
Administering programs Planning agendas/meetings Planning organizational needs, Analyzing data Writing reports Budgeting expenses, Coaching managing people Investigating problems Teaching/instructing/training individuals Enduring long hours Inspecting physical objects Operating equipment Reporting and summarizing information Delegating responsibilities Determining and defining a problem Analyzing problems Recommending courses of action Conducting interviews Thinking of creative ideas Providing discipline when necessary enforcing rules and regulations Developing a climate of enthusiasm, teamwork, and cooperation Skillfully applying professional knowledge
Googleplus
Kevin Leach
Work:
George Mason University - Graduate Research Assistant (2011)
Education:
George Mason University - Computer Science, University of Virginia - Computer Engineering
Christ the King-Saint Thomas the Apostle School Minneapolis MN 1982-1983, Northgate Elementary School Bloomington MN 1983-1987, Oak Grove Intermediate School Bloomington MN 1987-1989, Olson Middle School Bloomington MN 1989-1991
Bolton, OntarioPresident at Trump Systems Inc. IT Infrastructure and Security Professional, President of Trump Systems Inc., SMB Chair for IAMCP Canada