3D gaze control of robot for navigation and object manipulation

Inventors

ZHANG, XiaoliLi, Songpo

Assignees

Colorado School of Mines

Interested in licensing this patent?

MTEC can help explore whether this patent might be available for licensing for your application.

Publication Number

US-10157313-B1

Patent

Publication Date

2018-12-18

Expiration Date


Abstract

Disclosed herein are devices, methods, and systems for controlling a robot or assistive device that allows the robot or device to find and manipulate objects in a real world environment. The disclosed devices, methods, and systems may, in many cases, receive control input through monitoring, tracking, and analyzing the 3D gaze of a user/controller. Using the described 3D eye tracking, a user can directly and naturally look at an object of interest in the real world, while the system monitors, tracks, and records the user's 3D gaze position. This information, in many cases, is then translated into input commands for the robot or assistive.

Core Innovation

The invention relates to creating control output signals from visual information by monitoring one or more eyes of a user gazing at a target object. The system determines and measures movement of the eye and pupil, analyzes the movement to identify intentional movement and unintentional movement, and translates the intentional movement into data describing three dimensional movement of the user's gaze. The intentional movement data are translated into an output signal, thereby creating an output signal.

The disclosed approach estimates each eye’s visual axis from pupil and glint features extracted from a 3D binocular, head-mounted eye-tracking system. It computes a 3D gaze point in space using visual axis intersection or the shortest distance between visual axes, and derives a gaze vector and gaze distance. This produces three dimensional gaze data suitable for forming control output signals.

The disclosure addresses the “Midas touch problem” by distinguishing intentional manipulatory gaze from scanning using fuzzy-logic/neural-network fusion for analyzing intentional versus unintentional eye movements. The resulting 3D gaze data are translated into robot/assistive-device control via rule-based navigation and object-manipulation strategies, including a Mamdani-type fuzzy controller.

Claims Coverage

The independent claim covers a method that converts monitored eye and pupil movement into control output signals by distinguishing intentional versus unintentional movement and translating intentional movement into three-dimensional gaze data and then into an output signal. Dependent claims further narrow how the three-dimensional gaze metrics are computed and how the analysis and use of the output signal are performed, including fuzzy logic and transferring the output signal to a robot or assistive device.

Intentional and unintentional eye movement identification and translation into three dimensional gaze data

Monitoring one or more eyes of a user gazing at a target object; determining and measuring movement of the eye and pupil; analyzing the movement to identify intentional movement and unintentional movement; translating the movement into data describing three dimensional movement of the user's gaze; translating the data from the intentional movement into an output signal; thereby creating an output signal.

Per-eye visual axis determination and three dimensional gaze geometry

Calculating a visual axis for each eye by computing visual axes separately for the user’s left and right eyes, and determining a three dimensional gaze geometry from those visual axes.

Gaze vector derivation from visual axis intersection

Using the intersection to calculate a gaze vector from the visual axes.

Gaze distance computation along the gaze vector

Using the gaze vector to compute a gaze distance along that gaze vector.

Fuzzy logic based analysis for distinguishing intentional versus unintentional movement

Using fuzzy logic for analyzing the movement to identify intentional movement and unintentional movement.

Transferring the output signal to a robot or assistive device

Transferring the output signal to a robot or an assistive device.

Overall claim coverage centers on generating control output signals from eye and pupil movement by explicitly identifying intentional versus unintentional movement, translating intentional movement into three-dimensional gaze data, optionally using fuzzy logic for the intentional/unintentional analysis, and transferring the resulting output signal to a robot or assistive device.

Stated Advantages

Addresses the “Midas touch problem” by distinguishing intentional manipulatory gaze from scanning.

Enables robot/assistive-device control based on three dimensional gaze data.

Provides fuzzy-logic control for robot speed and heading for obstacle avoidance using gaze-related variables and blink frequency.

Includes safety rules for object manipulation actions using “take to/get” object manipulation.

Documented Applications

Robot navigation and obstacle avoidance using rule-based gaze control informed by three-dimensional gaze data and a Mamdani-type fuzzy controller.

Object manipulation with a robot/assistive device using safety rules for “take to/get” object manipulation actions.

JOIN OUR MAILING LIST

Stay Connected with MTEC

Keep up with active and upcoming solicitations, MTEC news and other valuable information.