Digit recognition by lip-movements and time recursive Neural Networks

From ISLAB/CAISR
Title Digit recognition by lip-movements and time recursive Neural Networks
Summary The project aims to recognize digits by lip movements and neural networks
Keywords
TimeFrame HT2019
References
Prerequisites Good knowledge in Image Analysis and Computer Vision in 3D
Author
Supervisor Josef Bigun, Kevin Hernandez-Diaz, Fernando Alonso-Fernandez
Level Master
Status Open


The project aims to use lip-movements to recognize digits visually only. This is interesting to verification of liveness in biometric identification as well as recognizing password utterences in public (=soundless speech). The work will study AI techniques in combination with Computer Vision techniqes, e.g. convolutional neural networks with short time memory and optical flow.

Liveness verification is an important issue in Biometrics. It corresponds to verifying that the signal , e.g. face-image, fingerprint, audio-speech, on which biometric recognition is based is authentic, coming from a live person, in contrast to a synthetic signal, including (dis)playing a video or speech from memory.

Also, audio utterances of words or digits when prompted for in public, such as trains, and busses, are not suitable to be used as passwords, for a variety of reasons. This includes overhearing by others, but also because speech is difficult to be used as biometric source, beside password information, for identification due to environment noise, e.g. vehicle noise. Accordingly, recognizing digits by lip-movements can be a way to mitigate the issues posed by speech in public, noisy, and crowded places.