Kenneth M. Karakotsios - San Jose CA, US Kah Kuen Fu - Sunnyvale CA, US Volodymyr V. Ivanchenko - Mountain View CA, US Mingjing Huang - San Francisco CA, US
International Classification:
G06K 9/00
US Classification:
382118
Abstract:
The computational resources needed to perform processes such as image recognition can be reduced by determining appropriate frames of image information to use for the processing. In some embodiments, infrared imaging can be used to determine when a person is looking substantially towards a device, such that an image frame captured at that time will likely be adequate for facial recognition. In other embodiments, sound triangulation or motion sensing can be used to assist in determining which captured image frames to discard and which to select for processing based on any of a number of factors indicative of a proper frame for processing.
Kenneth M. Karakotsios - San Jose CA, US Volodymyr V. Ivanchenko - Mountain View CA, US
International Classification:
G06K 9/00
US Classification:
382103, 382115
Abstract:
A user can be identified and/or authenticated to an electronic device by analyzing aspects of a motion or gesture made by that user. At least one imaging element of the device can capture image information including the motion or gesture, and can determine time-dependent information about that motion or gesture in two or three dimensions of space. The time-dependent information can be used to identify varying speeds, motions, and other such aspects that are indicative of a particular user. The way in which a gesture or motion is made, in addition to the motion or gesture itself, can be used to authenticate an individual user. While other persons can learn the basic gesture or motion, the way in which each person makes that gesture or motion will generally be at least slightly different, which can be used to prevent unauthorized access to sensitive information, protected functionality, or other such content.
Howard D. LOOK - Palo Alto CA, US Leo B. BALDWIN - Livermore CA, US Kenneth M. KARAKOTSIOS - San Jose CA, US Dennis HODGE - Bellevue WA, US Isaac S. NOBLE - Soquel CA, US Volodymyr V. IVANCHENKO - Mountain View CA, US Jeffrey P. BEZOS - Greater Seattle Area WA, US
Assignee:
Amazon Technologies, Inc. - Reno NV
International Classification:
G09G 5/00 G06T 15/80
US Classification:
345426, 345619, 345419
Abstract:
Image information displayed on an electronic device can be modified based at least in part upon a relative position of a user with respect to a device. Mapping, topological or other types of positional data can be used to render image content from a perspective that is consistent with a viewing angle for the current relative position of the user. As that viewing angle changes, as a result of movement of the user and/or the device, the content can be re-rendered or otherwise updated to display the image content from a perspective that reflects the change in viewing angle. Simulations of effects such as parallax and occlusions can be used with the change in perspective to provide a consistent user experience that provides a sense of three-dimensional content even when that content is rendered on a two-dimensional display. Lighting, shading and/or other effects can be used to enhance the experience.
Volodymyr V. Ivanchenko - Mountain View CA, US Geoffrey Scott Heller - Seattle WA, US Daniel Bibireata - Seattle WA, US
Assignee:
AMAZON TECHNOLOGIES, INC. - Reno NV
International Classification:
G06K 9/48 G06K 9/34
US Classification:
382176, 382200
Abstract:
Various approaches for providing textual information to an application, system, or service are disclosed. In particular, various embodiments enable a user to capture an image with a camera of a portable computing device. The computing device is capable of taking the image and processing it to recognize, identify, and/or isolate the text in order to forward the text to an application or function. The application or function can then utilize the text to perform an action in substantially real-time. The text may include an email, phone number, URL, an address, and the like and the application or function may be dialing the phone number, navigating to the URL, opening an address book to save contact information, displaying a map to show the address, and so on. Adaptive thresholding can be used to account for variations across an image, in order to improve the accuracy and efficiency of text recognition processes.
- Mountain View CA, US Volodymyr Ivanchenko - Redwood City CA, US Yu Zhang - Mountain View CA, US
International Classification:
G06K 9/62 G06T 7/32 G06T 7/73 G06K 9/00
Abstract:
Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for determining an alignment between cross-modal sensor data. In one aspect, a method comprises: obtaining (i) an image that characterizes a visual appearance of an environment, and (ii) a point cloud comprising a collection of data points that characterizes a three-dimensional geometry of the environment; processing each of a plurality of regions of the image using a visual embedding neural network to generate a respective embedding of each of the image regions; processing each of a plurality of regions of the point cloud using a shape embedding neural network to generate a respective embedding of each of the point cloud regions; and identifying a plurality of region pairs using the embeddings of the image regions and the embeddings of the point cloud regions.
- Reno NV, US Volodymyr V. Ivanchenko - Mountain View CA, US
International Classification:
G06F 1/16 G06F 1/26 G06F 1/18
Abstract:
Additional components or functionality can be provided for a computing device by coupling the device with one or more device skins. Each device skin can include one or more components that can be utilized by the computing device, such that a user of the computing device can obtain desired functionality by selecting an appropriate skin. The components of the skin can be powered by the computing device, such as through wired or wireless power coupling. The computing device can determine an appropriate control scheme for the skin when the skin is coupled with the device, enabling a user to utilize the various components of the skin as if those components were part of the computing device.
Apparatus And Method For Noise Reduction In Depth Images During Object Segmentation
- Suwon-si, KR Volodymyr IVANCHENKO - Mountain View CA, US
International Classification:
G06T 5/00 G06T 15/08 G06T 3/60 G06T 7/00
Abstract:
An apparatus and method for reducing noise in a depth image are provided. The method includes capturing an image, generating a depth map of the image, the depth map including depth information corresponding to at least one object included in the image, performing over-segmentation of the depth map to generate segmentation information corresponding to the image, determining at least one segment to be noise according to the over-segmentation, and deleting the segment determined to be noise.
- Reno NV, US Volodymyr V. Ivanchenko - Mountain View CA, US
International Classification:
G06F 1/16
US Classification:
36167941
Abstract:
Additional components or functionality can be provided for a computing device by coupling the device with one or more device skins. Each device skin can include one or more components that can be utilized by the computing device, such that a user of the computing device can obtain desired functionality by selecting an appropriate skin. The components of the skin can be powered by the computing device, such as through wired or wireless power coupling. The computing device can determine an appropriate control scheme for the skin when the skin is coupled with the device, enabling a user to utilize the various components of the skin as if those components were part of the computing device.
Google
Software Engineer, Accessiblity
Google
Software Engineer
Eyefluence Apr 2014 - Jun 2015
Software Engineer
Samsung Sep 2012 - Mar 2014
Sw Senior Engineer
Amazon Nov 2010 - Sep 2012
Sde
Education:
University of Rochester 2001 - 2006
Doctorates, Doctor of Philosophy
New Bulgarian University 1998 - 2001
Master of Science, Masters, Cognitive Science
Kharkov State University 1985 - 1990
Bachelors, Bachelor of Science, Biophysics
Skills:
Machine Learning Computer Vision Image Processing Embedded Systems Algorithms Video Processing Software Development Android Programming Signal Processing Software Engineering Testing Simulations Digital Signal Processors Cuda Debugging Distributed Systems Java Embedded Software Gpu Software Design Multithreading Device Drivers