Yes, but the timeline is in sync. The video could also have audio. Another experiment that send many cues based on narration is: