Leveraging Computer Vision to Establish a Correlation Between Eye-Gaze Estimation and Saccades

Authors

  • Aarav Sharma Archbishop Mitty High School
  • Prof. Isabel Hyo Jung Song

DOI:

https://doi.org/10.47611/jsrhs.v12i3.4476

Keywords:

Computer Vision, Eye Gaze Estimation, Saccades, MTCNN, Multi-task Cascaded Convolutional Neural Network (MTCNN), OpenCV, Head Impulse Test, Real-Time Gaze Estimation in Natural Environments (RT-GENE), RT-GENE, Machine Learning

Abstract

The purpose of this project was to develop a model that can detect pupils on a subject’s face and draw the graph of both pupils to determine if a subject has saccades. Current methods cannot accurately detect contours when the face is presented at an angle. To resolve this issue, subjects recorded a video while performing the Head Impulse Test (HIT). Next, a computer vision library, Opencv, extracted frames from the recorded video and detected the facial key point for the nose. The Multi-task Cascaded Convolutional Neural Network (MTCNN) extracted the face from the frame and generated contours for the eyes using a segmentation library. The largest contours on the mask were divided into two parts and their extreme bounding box points were identified using dilation, erosion, and blur. Our model rendered a video of both contours applied to the patient performing the HIT test. Also, the model generated two graphs, comparing each eye’s gaze with the pose estimation. There were two main outputs: the graph may resemble a y = -x line if a patient does not have saccades or the graph may be more distorted if the patient has saccades due to a sudden shift in eye-gaze movement. This study demonstrates a precise method of eye-gaze estimation and the detection of saccades. Further experiments will help to validate the notion that saccades are connected to neurological disorders. Future experiments may include more data during the training process and to quantitatively determine the accuracy of our model.

Downloads

Download data is not yet available.

References or Bibliography

Jay, Doctor. “Northoak Chiropractic.” Northoak Chiropractic, Northoak Chiropractic, 5 May 2022, https://northoakchiro.com/blog/2022/03/14/saccades-brain-injuries/#:~:text=Saccades%20are%20eye%20movements%20that,of%20gaze%20fixation%20to%20another .

Danaadmin. “The Eyes Are Windows into the Brain.” Dana Foundation, Dana Foundation, 7 Aug. 2019, https://dana.org/article/the-eyes-are-windows-into-the-brain/.

Kaspersky. “What Is Facial Recognition – Definition and Explanation.” Www.kaspersky.com, 9 Feb. 2022, https://www.kaspersky.com/resource-center/definitions/what-is-facial-recognition .

Halmagyi, G. M., et al. “The Video Head Impulse Test.” Frontiers, Frontiers, 1 Jan. 1AD, https://www.frontiersin.org/articles/10.3389/fneur.2017.00258/full .

Kerber, Kevin A. “Vertigo and Dizziness in the Emergency Department.” Emergency Medicine Clinics of North America, U.S. National Library of Medicine, Feb. 2009, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2676794/ .

Anto, et al. “Facial Landmarks with Dlib, Opencv, and Python.” PyImageSearch, 3 July 2021, https://pyimagesearch.com/2017/04/03/facial-landmarks-dlib-opencv-python/ .

“Measuring Saccade Latency Using Smartphone Cameras.” IEEE Xplore, https://ieeexplore.ieee.org/document/8703178.

JD;, Wong EC;Pasquesi L;Steenerson KK;Sharon. “A Broader View of Video Head Impulse Tests-Reframing Windows.” JAMA Otolaryngology-- Head & Neck Surgery, U.S. National Library of Medicine, https://pubmed.ncbi.nlm.nih.gov/33270083/.

IS;, Halmagyi GM;Curthoys. “A Clinical Sign of Canal Paresis.” Archives of Neurology, U.S. National Library of Medicine, https://pubmed.ncbi.nlm.nih.gov/3390028/ .

Mantokoudis G;Saber Tehrani AS;Kattah JC;Eibenberger K;Guede CI;Zee DS;Newman-Toker DE; “Quantifying the Vestibulo-Ocular Reflex with Video-Oculography: Nature and Frequency of Artifacts.” Audiology & Neuro-Otology, U.S. National Library of Medicine, https://pubmed.ncbi.nlm.nih.gov/25501133/ .

Contributor, TechTarget. “What Is Polynomial Interpolation? - Definition from Whatis.com.” WhatIs.com, TechTarget, 26 Apr. 2013, https://www.techtarget.com/whatis/definition/polynomial-interpolation#:~:text=Polynomial%20interpolation%20is%20a%20method,can%20be%20made%20by%20interpolation .

Janky KL;Patterson JN;Shepard NT;Thomas MLA;Honaker JA; “Effects of Device on Video Head Impulse Test (Vhit) Gain.” Journal of the American Academy of Audiology, U.S. National Library of Medicine, https://pubmed.ncbi.nlm.nih.gov/28972467/#:~:text=There%20was%20not%20an%20effect,gain%20values%20between%20devices%2Falgorithms .

“Dizziness.” Mayo Clinic, Mayo Foundation for Medical Education and Research, 15 Oct. 2020, https://www.mayoclinic.org/diseases-conditions/dizziness/symptoms-causes/syc-20371787#:~:text=Dizziness%20is%20a%20term%20used,reasons%20adults%20visit%20their%20doctors .

Suh MW;Park JH;Kang SI;Lim JH;Park MK;Kwon SK; “Effect of Goggle Slippage on the Video Head Impulse Test Outcome and Its Mechanisms.” Otology & Neurotology : Official Publication of the American Otological Society, American Neurotology Society [and] European Academy of Otology and Neurotology, U.S. National Library of Medicine, https://pubmed.ncbi.nlm.nih.gov/27956722/#:~:text=Results%3A%20The%20most%20common%20slippage,%2C%20and%204)%20deceleration%20bumps .

Agarwal, Vardan. “Real-Time Head Pose Estimation in Python.” Medium, Towards Data Science, 30 July 2020, https://towardsdatascience.com/real-time-head-pose-estimation-in-python-e52db1bc606a .

Palmero, Cristina, et al. “Recurrent CNN for 3D Gaze Estimation Using Appearance and Shape Cues.” ArXiv.org, 17 Sept. 2018, https://arxiv.org/abs/1805.03064.

Published

08-31-2023

How to Cite

Sharma, A., & Hyo Jung Song, . I. (2023). Leveraging Computer Vision to Establish a Correlation Between Eye-Gaze Estimation and Saccades. Journal of Student Research, 12(3). https://doi.org/10.47611/jsrhs.v12i3.4476

Issue

Section

HS Research Projects