BBC: The system, which has been trained on thousands of hours of BBC News programmes, has been developed in collaboration with Google's DeepMind AI division. by Rory Cellan-Jones
'"Watch, Attend and Spell", as the system has been called, can now watch silent speech and get about 50% of the words correct. That may not sound too impressive - but when the researchers supplied the same clips to professional lip-readers, they got only 12% of words right.
'Right now the system has limitations - it can only operate on full sentences of recorded video. "We want to get it to work in real time," says Joon Son Chung. "As it keeps watching TV, it will learn." And he says getting the system to work live is a lesser challenge than improving its accuracy.'