Found an adversarial sample/track #23

knaik · 2022-09-22T00:10:04Z

knaik
Sep 22, 2022

Why do the transcription/translations seem to depend on the input bitrate, why is it affecting tokenization? Is there an optimal type of source?

Is there a way to fine tune the window frame to more tightly or loosely match the predicted tokens? I know currently there is no current word tokenization implemented. Additionally, I understand that the way words are said in songs is different than how they are said naturally. If I understand correctly, is that what the patience parameter can be used for?

Answered by jongwook

Oct 17, 2022

I believe by "tokenization" you meant phrase boundaries produced by the model. The term more often means converting text to/from a sequence of integers, as done in tokenizer.py.

Why do the transcription/translations seem to depend on the input bitrate, why is it affecting tokenization? Is there an optimal type of source?

There might be a correlation between audio quality and typical phrase lengths in the subtitles.

Is there a way to fine tune the window frame to more tightly or loosely match the predicted tokens?

Changing the decoding rules for the timestamp tokens like in #139 (comment) can affect how long the segments are in general, but not precisely.

If I understand correctly, is…

View full answer

jongwook · 2022-10-17T18:25:50Z

jongwook
Oct 17, 2022
Maintainer

I believe by "tokenization" you meant phrase boundaries produced by the model. The term more often means converting text to/from a sequence of integers, as done in tokenizer.py.

Why do the transcription/translations seem to depend on the input bitrate, why is it affecting tokenization? Is there an optimal type of source?

There might be a correlation between audio quality and typical phrase lengths in the subtitles.

Is there a way to fine tune the window frame to more tightly or loosely match the predicted tokens?

Changing the decoding rules for the timestamp tokens like in #139 (comment) can affect how long the segments are in general, but not precisely.

If I understand correctly, is that what the patience parameter can be used for?

No, the parameter implements the algorithm from this paper.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Found an adversarial sample/track #23

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Replies: 1 comment

{{title}}

Select a reply

Found an adversarial sample/track #23

knaik Sep 22, 2022

Replies: 1 comment

jongwook Oct 17, 2022 Maintainer

knaik
Sep 22, 2022

jongwook
Oct 17, 2022
Maintainer