Empty cue blocks cannot be parsed #2

bwkimmel · 2024-11-25T02:35:28Z

According to the spec, the payload of a WebVTT cue block consists of one of three options, the first one being WebVTT caption or subtitle cue text, which, in turn, consists of "zero or more WebVTT caption or subtitle cue components, in any order, each optionally separated from the next by a WebVTT line terminator."

Therefore, an empty cue block is valid. In practice, I've encountered several Youtube videos that have such empty cue blocks.

To modify an example from the test cases, the following valid WebVTT file fails to parse:

WEBVTT

00:01.000 --> 00:04.000
- Never drink liquid nitrogen.

00:04.000 --> 00:05.000


00:05.000 --> 00:09.000
- It will perforate your stomach.
- You could die.

Attempting to parse this yields the following error:

ParseError { location: LineCol { line: 7, column: 1, offset: 88 }, expected: ExpectedSet { expected: {"[' ' | '\\t']", "['0' ..= '9']"} } }

The text was updated successfully, but these errors were encountered:

Fixes mochi-neko#2 Note that this also allows empty comments, and "below"-style comments that begin with whitespace, neither of which are forbidden by the spec: https://www.w3.org/TR/webvtt1/#introduction-comments

bwkimmel linked a pull request Nov 25, 2024 that will close this issue

Allow empty multiline blocks and whitespace-only cue payload lines. #3

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Empty cue blocks cannot be parsed #2

Empty cue blocks cannot be parsed #2

bwkimmel commented Nov 25, 2024

Empty cue blocks cannot be parsed #2

Empty cue blocks cannot be parsed #2

Comments

bwkimmel commented Nov 25, 2024