WEBVTT Cue text fragment with voice markup mapped to HTML element with @title for annotation. 1 00:00:00.000 --> 00:00:30.500 align:start position:20% Bear is Coming!!!!! Text span with a class and an annotation. 2 00:00:31.000 --> 00:01:00.500 align:start position:20% I said Bear is coming!!!! 3 00:01:01.000 --> 00:02:00.500 align:start position:20% I said Bear is coming now!!!!