Liwei Ren - Sunnyvale CA, US Dehua Tan - Milpitas CA, US Fei Huang - Fremont CA, US Shu Huang - San Jose CA, US Aiguo Dong - Mountain View CA, US
Assignee:
Trend Micro, Inc. - Tokyo
International Classification:
G06F 17/30 G06F 7/00
US Classification:
707 5
Abstract:
A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.
Liwei Ren - Sunnyvale CA, US Shu Huang - San Jose CA, US Fei Huang - Fremont CA, US Aiguo Dong - Mountain View CA, US Dehua Tan - Milpitas CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 17/30
US Classification:
707780, 707917
Abstract:
A system generates an output of documents having with a particular relevance range. The system receives an initial document comprising text, a list of documents for matching, each document comprising text, and a minimum substring match length. The system normalizes the text of the documents of the list of documents. The system searches common sub-strings between the text of the initial document and the text of each document of the list of documents. The system calculates a match percentage based on the search common sub-strings and outputs documents having a match percentage corresponding to a predetermined value. Also disclosed is a process for generating an output of documents within a particular relevance range.
Document Matching Engine Using Asymmetric Signature Generation
An automated method of matching an input document to a set of documents from a document repository. A signature database is stored, the signature database including a document identifier and signatures generated by a first signature generator for each of the set of documents. The input document is received and signatures are generated for the input document using a second signature generator, and the signature database is searched using the signatures generated for the input document. The first and second signature generators are configured such that different numbers of signatures are generated for a same document. Other embodiments, aspects and features are also disclosed.
Liwei Ren - Sunnyvale CA, US Shu Huang - San Jose CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 7/00 G06F 17/00
US Classification:
726 26, 707673, 707675, 707602, 704 7, 704 9
Abstract:
A system (and a method) is disclosed for fingerprinting based entity extraction using a rolling hash technique. The system is configured to receive an input stream of a predetermined length comprising characters, and a hash table having indexed entries. The system isolates, through a defined fixed window length, a set of characters of the input stream. A hash key is generated and used to index into the hash table. The system compares the isolated set of characters of the input stream with the entry corresponding to the index into the hash table to determine whether there is an exact match with the entry. The system slides the fixed window length one character to isolate another set of characters of the input stream in response to no exact match from the comparison. Alternatively, the system stores the input stream in response to an exact match from the comparison.
Fei Huang - Fremont CA, US Shu Huang - San Jose CA, US Liwei Ren - Sunnyvale CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 7/04
US Classification:
726 27, 726 2
Abstract:
A system and a method are disclosed for sensitive document management. The system includes one or more agents, a behavior analysis engine, a local policy engine, and a local matching service. The method identifies whether a document is sensitive, identifies behaviors applied to the document, determines whether the document contains sensitive information and determines whether to allow the identified behavior to continue based on security policies.
Graphical User Interface Based Sensitive Information And Internal Information Vulnerability Management System
Shu Huang - San Jose CA, US Fei Huang - Fremont CA, US Liwei Ren - Sunnyvale CA, US Aiguo Dong - Mountain View CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
H04L 9/00 G06F 12/14
US Classification:
709224, 726 22, 713156
Abstract:
A system and method provides a graphical user interface (GUI) for users to monitor and manage sensitive information within an enterprise network. The GUI can provide users with information, such as the presence of input/output devices (I/O device), the location of documents containing sensitive information (sensitive documents), and the status of local security policy. The GUI can also provide users with real-time information, such as the occurrence of local security policy violations, the life-cycle of sensitive documents, and the sensitive information dynamic flow within the enterprise network.
Liwei Ren - Sunnyvale CA, US Dehua Tan - Milpitas CA, US Fei Huang - Sunnyvale CA, US Shu Huang - San Jose CA, US Aiguo Dong - Mountain View CA, US
Assignee:
Trend Micro Incorporated - Tokyo
International Classification:
G06F 17/30
US Classification:
707694
Abstract:
A system and a method generates at least one signature associated with document. In one embodiment, a document comprised of text is received and parsed to generate a token set. The token set includes a plurality of tokens. Each token corresponds to the text in the document that is separated by a predefined character characteristic. A score is calculated for each token in the token set based on a frequency and distribution of the text in the document. Each token is then ranked based on the calculated score. A subset of the ranked tokes is selected and a signature is generated for each occurrence of the selected tokens. The selected list of signatures is then output.
Two Tiered Architecture Of Named Entity Recognition Engine
A system (and a method) is disclosed to extract entity values from texts. The system receives, at a first tier entity recognition engine, an input data string having a plurality of entities. The first tier entity recognition engine marks entities of the plurality of entities that are regular expression and transmits the input data stream with the marked entities to a second tier entity recognition engine. The second tier entity recognition engine receives the input data stream and identifies unmarked entities in the input data stream received at the second tier entity recognition engine. The second tier entity recognition engine determines whether the unmarked entities comprise a predetermined data format, and if so, outputs those unmarked entities of the plurality of entities that comprise the predetermined data format.
Metrohealth Medical Center Physical Medicine & Rehabilitation 2500 Metrohealth Dr, Cleveland, OH 44109 (216)7784414 (phone), (216)7787766 (fax)
Education:
Medical School Natl Taiwan Univ Coll of Med, Taipei, Taiwan (385 02 Prior 1/71) Graduated: 1970
Procedures:
Neurological Testing Occupational Therapy Evaluation Physical Medicine and Rehabilitation, Tests and Measurements Physical Therapy Physical Therapy Evaluation
Languages:
Chinese English Spanish
Description:
Dr. Huang graduated from the Natl Taiwan Univ Coll of Med, Taipei, Taiwan (385 02 Prior 1/71) in 1970. She works in Cleveland, OH and specializes in Physical Medicine & Rehabilitation. Dr. Huang is affiliated with MetroHealth Medical Center.
Amd
Senior Tax Analyst
Ey
Tax Senior
Ey Oct 2017 - Sep 2019
Tax Staff
Rina Accountancy Corporation Jun 2017 - Aug 2017
Accounting Intern
Ey Jan 2017 - Mar 2017
Tax Intern
Education:
California Polytechnic State University - San Luis Obispo 2013 - 2017
Bachelors, Bachelor of Science, Business Administration
Oakland Technical High School 2013
Oakland Technical Senior High 2009 - 2013
Skills:
Teamwork Microsoft Excel Public Speaking Powerpoint Time Management Microsoft Office Customer Service Accounting Event Planning Social Media Facebook Access Leadership Quickbooks Shopify Project Planning Google Analytics Microsoft Publisher Microsoft Access