

A prototype lip-reading system recognizing English phonemes, designed to assist hearing-impaired individuals in situations where sound is unavailable. Trained on a subset of the GRID corpus (~34,000 short videos) inspired by the LipNet architecture.
Demo day video
Tech stack
Python
TensorFlow
OpenCV

Streamlit

Dlib