Speaker Inconsistency Detection: Spotting Audio-Visual Inconsistencies (SAVI)





The interactive transcript could not be loaded.


Rating is available when the video has been rented.
This feature is not available right now. Please try again later.
Published on Oct 30, 2018

SRI’s Spotting Audio-Visual Inconsistencies (SAVI) techniques detect tampered videos by identifying discrepancies between the audio and visual tracks. For example, the system can detect when lip synchronization is a little off or if there is an unexplained visual “jerk” in the video. Or it can flag a video as possibly tampered if the visual scene is outdoors, but analysis of the reverberation properties of the audio track indicates the recording was done in a small room.

Approved for Public Release, Distribution Unlimited.

This project is funded by DARPA (contract and funding through AFRL) under DARPA’s Media Forensics (MediFor) Program, Contract #FA8750-16-C-0170.


When autoplay is enabled, a suggested video will automatically play next.

Up next

to add this to Watch Later

Add to

Loading playlists...