-
Notifications
You must be signed in to change notification settings - Fork 165
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Determining time indexes for embeddings? #64
Comments
I had a similar problem and I fixed it by changing the method
|
Hi @lagidigu , Thanks for this suggestion. total duration of each segment in ms, counting number of embeddings that can fit in each segment with 400 ms partition, If I consider 3 embeddings in each segment I should have got 9 d-vector embeddings. But Actually I got 11 d-vector embeddings which does not match with both of the above number. I have experimented like this for higher length audio files as well and finding mismatch of around 2 or 4 embeddings. |
In
dvector_create.py
, an audio file is converted to a sequence of dvectors. However, the time index of each dvector is lost, so if a classification is performed using that dvector, you can't really do much with it since you don't know where in the original file that classification applies to. How would determine the time index for each dvector returned byalign_embeddings()
?The text was updated successfully, but these errors were encountered: