Liwei Ren - Sunnyvale CA, US Dehua Tan - Milpitas CA, US Fei Huang - Fremont CA, US Shu Huang - San Jose CA, US Aiguo Dong - Mountain View CA, US
Assignee:
Trend Micro, Inc. - Tokyo
International Classification:
G06F 17/30 G06F 7/00
US Classification:
707 5
Abstract:
A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.
Liwei Ren - Sunnyvale CA, US Shu Huang - San Jose CA, US Fei Huang - Fremont CA, US Aiguo Dong - Mountain View CA, US Dehua Tan - Milpitas CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 17/30
US Classification:
707780, 707917
Abstract:
A system generates an output of documents having with a particular relevance range. The system receives an initial document comprising text, a list of documents for matching, each document comprising text, and a minimum substring match length. The system normalizes the text of the documents of the list of documents. The system searches common sub-strings between the text of the initial document and the text of each document of the list of documents. The system calculates a match percentage based on the search common sub-strings and outputs documents having a match percentage corresponding to a predetermined value. Also disclosed is a process for generating an output of documents within a particular relevance range.
Document Matching Engine Using Asymmetric Signature Generation
An automated method of matching an input document to a set of documents from a document repository. A signature database is stored, the signature database including a document identifier and signatures generated by a first signature generator for each of the set of documents. The input document is received and signatures are generated for the input document using a second signature generator, and the signature database is searched using the signatures generated for the input document. The first and second signature generators are configured such that different numbers of signatures are generated for a same document. Other embodiments, aspects and features are also disclosed.
Liwei Ren - Sunnyvale CA, US Shu Huang - San Jose CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 7/00 G06F 17/00
US Classification:
726 26, 707673, 707675, 707602, 704 7, 704 9
Abstract:
A system (and a method) is disclosed for fingerprinting based entity extraction using a rolling hash technique. The system is configured to receive an input stream of a predetermined length comprising characters, and a hash table having indexed entries. The system isolates, through a defined fixed window length, a set of characters of the input stream. A hash key is generated and used to index into the hash table. The system compares the isolated set of characters of the input stream with the entry corresponding to the index into the hash table to determine whether there is an exact match with the entry. The system slides the fixed window length one character to isolate another set of characters of the input stream in response to no exact match from the comparison. Alternatively, the system stores the input stream in response to an exact match from the comparison.
Fei Huang - Fremont CA, US Shu Huang - San Jose CA, US Liwei Ren - Sunnyvale CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 7/04
US Classification:
726 27, 726 2
Abstract:
A system and a method are disclosed for sensitive document management. The system includes one or more agents, a behavior analysis engine, a local policy engine, and a local matching service. The method identifies whether a document is sensitive, identifies behaviors applied to the document, determines whether the document contains sensitive information and determines whether to allow the identified behavior to continue based on security policies.
Graphical User Interface Based Sensitive Information And Internal Information Vulnerability Management System
Shu Huang - San Jose CA, US Fei Huang - Fremont CA, US Liwei Ren - Sunnyvale CA, US Aiguo Dong - Mountain View CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
H04L 9/00 G06F 12/14
US Classification:
709224, 726 22, 713156
Abstract:
A system and method provides a graphical user interface (GUI) for users to monitor and manage sensitive information within an enterprise network. The GUI can provide users with information, such as the presence of input/output devices (I/O device), the location of documents containing sensitive information (sensitive documents), and the status of local security policy. The GUI can also provide users with real-time information, such as the occurrence of local security policy violations, the life-cycle of sensitive documents, and the sensitive information dynamic flow within the enterprise network.
Liwei Ren - Sunnyvale CA, US Dehua Tan - Milpitas CA, US Fei Huang - Sunnyvale CA, US Shu Huang - San Jose CA, US Aiguo Dong - Mountain View CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 17/30
US Classification:
707694
Abstract:
A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.
Two Tiered Architecture Of Named Entity Recognition Engine
A system (and a method) is disclosed to extract entity values from texts. The system receives, at a first tier entity recognition engine, an input data string having a plurality of entities. The first tier entity recognition engine marks entities of the plurality of entities that are regular expression and transmits the input data stream with the marked entities to a second tier entity recognition engine. The second tier entity recognition engine receives the input data stream and identifies unmarked entities in the input data stream received at the second tier entity recognition engine. The second tier entity recognition engine determines whether the unmarked entities comprise a predetermined data format, and if so, outputs those unmarked entities of the plurality of entities that comprise the predetermined data format.
Metrohealth Medical Center Physical Medicine & Rehabilitation 2500 Metrohealth Dr, Cleveland, OH 44109 (216)7784414 (phone), (216)7787766 (fax)
Education:
Medical School Natl Taiwan Univ Coll of Med, Taipei, Taiwan (385 02 Prior 1/71) Graduated: 1970
Procedures:
Neurological Testing Occupational Therapy Evaluation Physical Medicine and Rehabilitation, Tests and Measurements Physical Therapy Physical Therapy Evaluation
Languages:
Chinese English Spanish
Description:
Dr. Huang graduated from the Natl Taiwan Univ Coll of Med, Taipei, Taiwan (385 02 Prior 1/71) in 1970. She works in Cleveland, OH and specializes in Physical Medicine & Rehabilitation. Dr. Huang is affiliated with MetroHealth Medical Center.
Genentech
Intern
San Francisco State University Jan 2014 - May 2016
Gc and Ms Technician and Undergraduate Researcher
Sri International Jun 2015 - Aug 2015
Reu Student Researcher
Education:
San Francisco State University 2011 - 2016
Bachelors, Biochemistry
Qualcomm Jun 2016 - Sep 2018
Senior Engineer
Qualcomm Nov 2013 - Jun 2016
Engineer
Entropic Communications Jun 2011 - Sep 2012
Intern
Intel Corporation Jun 2011 - Sep 2012
Digital Design Engineer
Education:
Uc San Diego 2011 - 2013
Master of Science, Masters, Design
Southeast University 2007 - 2011
Bachelors, Bachelor of Science, Communications, Engineering, Electronics
Wenzhou Middle School
Skills:
C C++ Spices Multisim Quartus Vcs Astro Dc Primetime Matlab Verilog Vhdl Cadence Virtuoso Ads Asic
Interests:
Hardware Design Analog Circuit Design Mixed Signal Design
Fremont Group Foundation Jun 2009 - Oct 2019
Senior Director of Client Relations and Vice President and Director
Fremont Group Jun 2009 - Oct 2019
Senior Director and Head of Client Relations
Fremont Group Sep 2000 - Mar 2008
Director of Tax Compliance
Fremont Group Sep 2000 - Mar 2008
Vice President and Director
Education:
Golden Gate University
Skills:
Investments Strategic Financial Planning Wealth Management Alternative Investments Due Diligence Finance