Ce Liu - Arlington MA, US Michael Rubinstein - Somerville MA, US
Assignee:
Microsoft Corporation - Redmond WA
International Classification:
G06K 9/68 G06K 9/46
US Classification:
382170, 382195
Abstract:
Techniques for semantically annotating images in a plurality of images, each image in the plurality of images comprising at least one image region. The techniques include identifying at least two similar images including a first image and a second image, identifying corresponding image regions in the first image and the second image, and assigning, using at least one processor, annotations to image regions in one or more images in the plurality of images by using a metric of fit indicative of a degree of match between the assigned annotations and the corresponding image regions. The metric of fit may depend on at least one annotation for each image in a subset of the plurality of images and the identified correspondence between image regions in the first image and the second image.
Michael Rubinstein - Somerville MA, US Eugene Inghaw Shih - Brookline MA, US John V. Guttag - Lexington MA, US Frederic Durand - Somerville MA, US William T. Freeman - Acton MA, US
Assignee:
Massachusetts Institute of Technology - Cambridge MA
International Classification:
G06T 7/00
US Classification:
382128
Abstract:
In one embodiment, a method of amplifying temporal variation in at least two images comprises examining pixel values of the at least two images. The temporal variation of the pixel values between the at least two images can be below a particular threshold. The method can further include applying signal processing to the pixel values.
Michael Rubinstein - Somerville MA, US Neal Wadhwa - Cambridge MA, US Fredo Durand - Boston MA, US William T. Freeman - Acton MA, US
Assignee:
Massachusetts Institute of Technology - Cambridge MA
International Classification:
G06K 9/48
US Classification:
382197
Abstract:
In one embodiment, a method of amplifying temporal variation in at least two images includes converting two or more images to a transform representation. The method further includes, for each spatial position within the two or more images, examining a plurality of coefficient values. The method additionally includes calculating a first vector based on the plurality of coefficient values. The first vector can represent change from a first image to a second image of the at least two images describing deformation. The method also includes modifying the first vector to create a second vector. The method further includes calculating a second plurality of coefficients based on the second vector.
Michael Rubinstein - Somerville MA, US Frederic Durand - Boston MA, US William T. Freeman - Acton MA, US Hao-Yu Wu - Cambridge MA, US Eugen Inghaw Shih - Brookline MA, US John V. Guttag - Lexington MA, US
Assignee:
Massachusetts Institute of Technology - Cambridge MA
International Classification:
G06K 9/48
US Classification:
382197
Abstract:
In one embodiment, a method of amplifying temporal variation in at least two images includes converting two or more images to a transform representation. The method further includes, for each spatial position within the two or more images, examining a plurality of coefficient values. The method additionally includes calculating a first vector based on the plurality of coefficient values. The first vector can represent change from a first image to a second image of the at least two images describing deformation. The method also includes modifying the first vector to create a second vector. The method further includes calculating a second plurality of coefficients based on the second vector.
- Mountain View CA, US Michael Rubinstein - Natick MA, US Ariel Ephrat - Efrat, IL William Freeman - Acton MA, US Oran Lang - Givatayim, IL Kevin William Wilson - Cambridge MA, US Tali Dekel - Arlington MA, US Avinatan Hassidim - Petah Tikva, IL
Methods, systems, and apparatus, including computer programs encoded on computer storage media, for audio-visual speech separation. A method includes: obtaining, for each frame in a stream of frames from a video in which faces of one or more speakers have been detected, a respective per-frame face embedding of the face of each speaker; processing, for each speaker, the per-frame face embeddings of the face of the speaker to generate visual features for the face of the speaker; obtaining a spectrogram of an audio soundtrack for the video; processing the spectrogram to generate an audio embedding for the audio soundtrack; combining the visual features for the one or more speakers and the audio embedding for the audio soundtrack to generate an audio-visual embedding for the video; determining a respective spectrogram mask for each of the one or more speakers; and determining a respective isolated speech spectrogram for each speaker.
- Cambridge MA, US Frederic Durand - Somerville MA, US Tianfan Xue - Cambridge MA, US Michael Rubinstein - Somerville MA, US Neal Wadhwa - Mountain View CA, US
International Classification:
G06T 7/20 G02B 27/54 G01P 5/00
Abstract:
An apparatus according to an embodiment of the present invention enables measurement and visualization of a refractive field such as a fluid. An embodiment device obtains video captured by a video camera with an imaging plane. Representations of apparent motions in the video are correlated to determine actual motions of the refractive field. A textured background of the scene can be modeled as stationary, with a refractive field translating between background and video camera. This approach offers multiple advantages over conventional fluid flow visualization, including an ability to use ordinary video equipment outside a laboratory without particle injection. Even natural backgrounds can be used, and fluid motion can be distinguished from refraction changes. Embodiments can render refractive flow visualizations for augmented reality, wearable devices, and video microscopes.
- Cambridge MA, US Michael Rubinstein - Somerville MA, US Eugene Inghaw Shih - Brookline MA, US John V. Guttag - Lexington MA, US Frederic Durand - Somerville MA, US William T. Freeman - Acton MA, US Neal Wadhwa - Mountain View CA, US
Assignee:
Quanta Computer, Inc. - Kuei Shan Hsiang
International Classification:
G06T 7/00
Abstract:
In an embodiment, a method converts two images to a transform representation in a transform domain. For each spatial position, the method examines coefficients representing a neighborhood of the spatial position that is spatially the same across each of the two images. The method calculates a first vector in the transform domain based on first coefficients representing the spatial position, the first vector representing change from a first to second image of the two images describing deformation. The method modifies the first vector to create a second vector in the transform domain representing amplified movement at the spatial position between the first and second images. The method calculates second coefficients based on the second vector of the transform domain. From the second coefficients, the method generates an output image showing motion amplified according to the second vector for each spatial position between the first and second images.
Riesz Pyramids For Fast Phase-Based Video Magnification
- Cambridge MA, US Michael Rubinstein - Newton MA, US Frederic Durand - Somerville MA, US William T. Freeman - Acton MA, US
International Classification:
H04N 5/14 H04N 5/213 H04N 5/20
Abstract:
Some embodiments are directed to a method, corresponding system, and corresponding apparatus for rendering a video and/or image display to amplify small motions through video magnification. Some embodiments include a new compact image pyramid representation, the Riesz pyramid, that may be used for real-time, high-quality phase-based video magnification. Some embodiments are less overcomplete than even the smallest two orientation, octave-bandwidth complex steerable pyramid. Some embodiments are implemented using compact, efficient linear filters in the spatial domain. Some embodiments produce motion magnified videos that are of comparable quality to those using the complex steerable pyramid. In some embodiments, the Riesz pyramid is used with phase-based video magnification. The Riesz pyramid may phase-shift image features along their dominant orientation, rather than along every orientation like the complex steerable pyramid.
Medicine Doctors
Dr. Michael P Rubinstein, Fullerton CA - MD (Doctor of Medicine)
Orthopedic Hand Surgery Orthopedic Surgery Sports Medicine
Address:
Fullerton Orthopaedic Surgery Medical Group, Inc. 101 Laguna Rd Suite A, Fullerton, CA 92835 (714)8790050 (Phone), (714)8790249 (Fax)
Fullerton Orthopaedic Surgery Medical Group Inc 17021 Yorba Linda Blvd Suite 100, Yorba Linda, CA 92886 (714)9966440 (Phone), (714)9965831 (Fax)
Procedures:
Arthritic Hand Reconstruction Arthroscopy, Wrist Carpal Bones & Hand Fractures Carpal Tunnel Surgery Elbow Ligament Reconstruction Elbow Reconstruction & Arthroscopic Elbow/Shoulder Surgery Hand Reconstruction Hand, Elbow Shoulder and Micro Vascular Surgery Shoulder joint replacement Shoulder Reconstruction Shoulder Surgery Tendon Repairs Wrist ligament reconstructions
Conditions:
Arthritis Carpal Tunnel Surgery Fractures Hand & Wrist - Upper Extremities Hand conditions Hand fracture Minimally Invasive Rotator Cuff Repair Shoulder arthritis SHOULDER ARTHROSCOPY Sports Medicine Tennis elbow Trigger Finger Wrist Arthritis Wrist Pain
Certifications:
Hand Surgery Orthopedic Surgery
Awards:
Healthgrades Honor Roll
Languages:
English Spanish
Hospitals:
Fullerton Orthopaedic Surgery Medical Group, Inc. 101 Laguna Rd Suite A, Fullerton, CA 92835
Fullerton Orthopaedic Surgery Medical Group Inc 17021 Yorba Linda Blvd Suite 100, Yorba Linda, CA 92886
Placentia - Linda Hospital 1301 Rose Drive, Placentia, CA 92870
St.Jude Medical Center 101 East Valencia Mesa Drive, Fullerton, CA 92835
Education:
Medical School New York Medical College Graduated: 1982 Medical School Lennox Hill Hospital Graduated: 1983 Medical School Lennox Hill Hospital Graduated: 1987 Medical School Harvard Graduated: 1978
Michael Rubinstein, Yorba Linda CA
Work:
Fullerton Orthopedic Surgery
17021 Yorba Linda Blvd, Yorba Linda, CA 92886 Fullerton Orthopedic Surgery
101 Laguna Rd, Fullerton, CA 92835 Orthopaedic Consultants
1275 N Rose Dr, Placentia, CA 92870
Name / Title
Company / Classification
Phones & Addresses
Michael P. Rubinstein Chief Of Surgery Services
Tenet Healthsystem Medical, Inc Specialty Hospital Medical Doctor's Office General Hospital
1301 N Rose Dr, Placentia, CA 92870 (714)9932000
Michael P. Rubinstein Orthopaedic Surgeon, Surgery-Orthopedic
Fullerton Orthopedic Surgery Medical Group, Inc Orthopedic Surgery Group · Offices of Physicians (except Mental Health Specialists)
101 Laguna Rd, Fullerton, CA 92835 PO Box 5474, Fullerton, CA 92838 (714)8790050
Michael Paul Rubinstein
Michael Rubinstein MD Emergency Medicine · Orthopedics · Hand Doctor · Internist
101 Laguna Rd, Fullerton, CA 92835 (714)8790050
Michael P. Rubinstein Owner
Michael Hall Medical Doctor's Office
18300 Yorba Linda Blvd, Yorba Linda, CA 92886
Michael P. Rubinstein President, Medical Doctor, Owner
MICHAEL P. RUBINSTEIN, M.D., INC Medical Doctor's Office
27015 Glaramara Ln, Yorba Linda, CA 92887 17021 Yorba Linda Blvd, Yorba Linda, CA 92886
Joining Davis on the Siggraph paper are Frdo Durand and Bill Freeman, both MIT professors of computer science and engineering; Neal Wadhwa, a graduate student in Freemans group; Michael Rubinstein of Microsoft Research, who did his Ph.D. with Freeman; and Gautham Mysore of Adobe Research.
Date: Aug 05, 2014
Source: Google
MIT researchers can listen to your conversation by watching your potato chip bag
This particular study grew out of an earlier experiment at MIT, led by Michael Rubinstein, now a postdoctoral researcher at Microsoft Research New England. In 2012, Rubinstein amplified tiny variations in video to detect things like the skin color change caused by the pumping of blood. Studying the
Date: Aug 04, 2014
Category: Sci/Tech
Source: Google
Researchers get audio information out of visual data
Davis' co-authors of the Siggraph paper are Frdo Durand and Bill Freeman, both MIT professors of computer science and engineering; Neal Wadhwa, a graduate student in Freemans group; Michael Rubinstein of Microsoft Research, who did his PhD with Freeman; and Gautham Mysore of Adobe Research.