International Business Machines Corporation - Armonk NY
International Classification:
G06F 1720
US Classification:
704 1, 704 9
Abstract:
A technique for identifying a language in which a computer document is written. Words from the document are compared to words in a plurality of word tables. Each of the word tables is associated with a respective candidate language and contains a selection of the most frequently used words in the language. The words in each word table are selected based on the frequency of occurrence in a candidate language so that each word table covers an equivalent percentage of the associated candidate language. A count is accumulated for each candidate language each time one of the plurality of words from the document is present in the associated word table. In the simple counting embodiment of the invention, the count is incremented by one. The language of the document is identified as the language associated with the count having the highest value.
Natural Language Determination Using Correlation Between Common Words
Michael John Martino - Austin TX Robert Charles Paulsen - Austin TX
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 1728 G06F 1721
US Classification:
704 8
Abstract:
The language in which a computer document is written is identified. A plurality of words from the document are compared to words in a word list associated with a candidate language. The words in the word list are a selection of the most frequently used words in the candidate language. A count of matches between words in the document and words in the word list for each word in the word list to produce a sample count. The sample count is correlated to a reference count for the candidate language to produce a correlation score for the candidate language. The language of the document is identified based on the correlation score. Generally, there are a plurality of candidate languages. Thus, comparing, accumulating, correlating and identifying processes are practiced for each language. The language of the document is identified as the candidate language having a reference count which generates a highest correlation score.
Natural Language Determination Using Partial Words
Michael John Martino - Austin TX Robert Charles Paulsen - Austin TX
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 1728 G06F 1721
US Classification:
704 9
Abstract:
Comparing the short and truncated words of a document to word tables of most frequently used words in each of the respective candidate language to identify the language in which the document is written. First, a plurality of words from a document is read into a computer memory. Then, words within the plurality of words which exceed a predetermined length are truncated to produce a set of short and truncated words. The set of short and truncated words are compared to words in a plurality of word tables. Each word table is associated with and contains a selection of most frequently used words in a respective candidate language. Although the most frequently words in most languages tend to be short those which which exceed the predetermined length may be truncated in the word tables. A respective count for each candidate language each time one of the set of short and truncated words from the document matches a word in a word table associated with the candidate language. In some embodiments, the count may weighted by factors related to the frequency of occurrence of the words in the respective candidate languages.
Avoiding Cache Collisions Between Frequently Accessed, Pinned Routines Or Data Structures
Manas Mandal - Austin TX Michael John Martino - Austin TX Bruce Lee Worthington - Austin TX
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 1208
US Classification:
711118
Abstract:
The performance of a computer system having a faster memory unit and a slower memory unit is improved. Memory locations of the faster memory unit are shared by a plurality of memory locations of the slower memory unit. The frequently accessed routines and data structures in the system are identified. The size of each frequently accessed routine is determined. Each routine is associated with a Moment Value computed according to a size of each routine and a frequency of access of the routine. The Moment Values and the associated routines are sorted in descending order in a sorted Moment Value list so that the routine with the largest Moment Value is first in the sorted Moment Value list. The associated routines are arranged in the order of decreasing Moment Value at memory locations in the slower memory unit of the computer. The performance of the program running on the computer system is improved by reducing contention for faster memory space among the frequently accessed routines.
Word Storage Table For Natural Language Determination
Michael John Martino - Austin TX Robert Charles Paulsen - Austin TX
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 1721 G06F 1728
US Classification:
704 1
Abstract:
A language in which a document is written is identified through the use of sets of most frequently used words in each of a plurality of candidate languages. Each set of most frequently used words in a respective set of word tables for a respective candidate language according to letter pairs in each set of most frequently used words. In the preferred embodiment, each word table is an N. times. N bit table, where each bit represents a given letter pair at a particular place in one of the most frequently used words in one of the candidate languages. Words from the document are compared to the most frequently used words stored in the word tables. A count of the number of matches between the words from the document and the words stored in each respective set of word tables is kept for each respective language. The language of the document as the respective candidate language having the greatest number of matches.
Determining A Natural Language Shift In A Computer Document
Michael John Martino - Austin TX Robert Charles Paulsen - Austin TX
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 1727 G06F 1721
US Classification:
704 8
Abstract:
Language shift points in a computer document written in a plurality of natural languages are determined. An interval is defined on and moved through a text document in a computer memory, the interval contains a portion of the text in the document. As the interval is moved through the document for each position of the interval, a probability that the text in the interval is written in each of a plurality of candidate languages is determined for the position. For the first position of the interval, generally the beginning of the document, a first candidate language is classified as the current language if it has the highest probability of all the candidate languages within the interval. A language shift point in the document is identified where the relative probability of a second candidate language is higher than the current language at a new position of the interval. At this point, the second candidate language is classified as the current language in the document after the language shift point.
Fast, Efficient Hardware Mechanism For Natural Language Determination
Michael John Martino - Austin TX Robert Charles Paulsen - Austin TX
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 1727
US Classification:
704 9
Abstract:
A language in which a document is written is identified by comparing the words of a document to the most frequently used words in a plurality of candidate languages. The words are stored in a plurality of sets of word tables, each set of word tables for storing a selected set of most frequently used words in a respective candidate language according to letter pairs in the words. In the preferred embodiment, each of the word tables is an N. times. N bit table, where each bit represents a given letter pair at a particular place in one of the most frequently used words in a respective candidate language. A set of table access registers, is used for accessing a respective set of word tables to compare words from the document to words stored in the word tables; each table access register accesses word tables for a respective candidate language. One or more word counting registers count a number of matches for a respective candidate language. A comparator selects a candidate language which corresponds to the word counting register having the highest count as the language in which the document is written.
System For Searching Internet Using Automatic Relevance Feedback
Robert Charles Paulsen - Austin TX Michael John Martino - Austin TX
Assignee:
International Business Machines Corporation - Armonk NY
International Classification:
G06F 1730
US Classification:
707 6
Abstract:
A method of retrieving documents from a document database is disclosed. A set of documents is retrieved according to a first search statement. A signature for a first retrieved document, and preferably other documents by searching for words in the first document and removing common words which occur in a relatively high frequency in a natural language in which the first document is written. The document for which the signature was developed is displayed. Responsive to a user indication that a second search is to be made, deriving a second search statement from the signature of the document. In the preferred embodiment, a "spectrum" of documents is prepared and presented to the user. The signatures of a plurality of documents from the documents retrieved according to the first search statement by searching for words in the documents and removing common words which occur in a relatively high frequency in a natural language in which the documents are written. The spectrum of documents is selected so that the document signatures differ by at least a predetermined amount.
Cholelethiasis or Cholecystitis Diverticulitis Diverticulosis Gastric Cancer Intestinal Obstruction
Languages:
English Spanish
Description:
Dr. Martino graduated from the Univ Di Roma La Sapienza, Fac Di Med E Chirurgia, Roma, Italy in 1983. He works in West Paterson, NJ and 1 other location and specializes in Gastroenterology. Dr. Martino is affiliated with Chilton Medical Center, St Josephs Regional Medical Center and St Josephs Wayne Hospital.
James K Cardi MD Inc 677 Atwood Ave, Cranston, RI 02920 (401)9426500 (phone), (401)9426505 (fax)
Languages:
English
Description:
Mr. Martino works in Cranston, RI and specializes in Internal Medicine. Mr. Martino is affiliated with Our Lady Of Fatima Hospital and Roger Williams Medical Center.
Name / Title
Company / Classification
Phones & Addresses
Michael Martino Chief Comp, Chief Compliance Off
FOUR POINTS CAPITAL PARTNERS LLC Investor
232 Madison Ave C/O Michael Martino, New York, NY 10016 232 Madison Ave, New York, NY 10016 14 Wall St, New York, NY 10005 PO Box 820831, Houston, TX 77282
Random Certainty Entertainment - CEO/Founder (2006) Madison Media Institute - EMB Slave (2011)
Education:
EMB - Entertainment Media Business, Wilmot Union High School - Rebellion, MMI - Recording and Sound Technology, Life - Failing often to succeed more
Relationship:
In_a_relationship
About:
I am CEO and Founder of Random Certainty Entertainment - a revolutionary entertainment company founded on innovation and networking. I am a recording artist and am currently enrolled and employed at M...
Tagline:
An Embodiment of Chaos and Creation
Bragging Rights:
EMB Slave, Work with Martin Atkins, Opened for international Horrorcore artist.
Michael Martino
Education:
Loyola University Maryland - BBA - Accounting, Baruch College - NYC - MBA - Finance & Investments, University of Tampa - MSA - Accounting
Michael Martino
Work:
Self Employed - Personal Trainer (2011)
Tagline:
Certified Personal Trainer, Actor, Model
Michael Martino
Education:
Framingham North
Michael Martino (Michael ...
Michael Martino
Tagline:
Father, stepfather, husband, government PR guy and author of www.drymartino.com
Michael Martino
Michael Martino
News
Crews continue to battle wind-driven brush fire on New York’s Long Island
As of Sunday morning, three of the fires had been contained, while one was still burning in the hamlet of Westhampton, according to Michael Martino, a spokesperson for Suffolk county executive Ed Romaine.
Bennett beat out Michael Martino, an inspector for the Nevada commission; Jeffrey C. Mullen, the executive director of the Tennessee commission; and Andy Foster, the executive officer of the California State Athletic Commission, although Foster removed himself from the running earlier in the week.
Date: Apr 18, 2014
Category: U.S.
Source: Google
US Supreme Court declines case over mountaintop cross
The Thomas More Law Center filed a friend of the court brief on behalf of the families of Navy Adm. Jeremiah Denton and Marine Capt. Michael Martino. Denton was a prisoner of war for nearly eight years during the Vietnam War; Martino was killed in Iraq in 2005.
Hedge fund Mason Capital Management's managing member Michael Martino wrote to J. Crew's board on Feb. 11 and asked it to hold out for a higher offer. The firm has a 7.5 percent stake in J. Crew, which is based in New York. They said they would vote against the offer, according to an SEC filing.