Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Could not download 'pyannote/segmentation-3.0' model. #15

Open
Malkovitz opened this issue Oct 15, 2023 · 2 comments
Open

Could not download 'pyannote/segmentation-3.0' model. #15

Malkovitz opened this issue Oct 15, 2023 · 2 comments

Comments

@Malkovitz
Copy link

Hello, I recently tried to spin up this program again, but it seems like there's some problem with speaker diarization.

It worked okay month ago, and I have not changed anything in the config file.

I tried to fix it generating new token and making sure I was gated in all three programs like in YouTube tutorial, but it's still failing.

`Could not download 'pyannote/segmentation-3.0' model.
It might be because the model is private or gated so make
sure to authenticate. Visit https://hf.co/settings/tokens to
create your access token and retry with:

Model.from_pretrained('pyannote/segmentation-3.0',
... use_auth_token=YOUR_AUTH_TOKEN)

If this still does not work, it might be because the model is gated:
visit https://hf.co/pyannote/segmentation-3.0 to accept the user conditions.
Traceback (most recent call last):
File "c:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\Scripts\whisperx-script.py", line 33, in
sys.exit(load_entry_point('whisperx==3.1.1', 'console_scripts', 'whisperx')())
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\whisperx\transcribe.py", line 211, in cli
diarize_model = DiarizationPipeline(use_auth_token=hf_token, device=device)
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\whisperx\diarize.py", line 19, in init
self.model = Pipeline.from_pretrained(model_name, use_auth_token=use_auth_token).to(device)
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\pyannote\audio\core\pipeline.py", line 136, in from_pretrained
pipeline = Klass(**params)
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\pyannote\audio\pipelines\speaker_diarization.py", line 128, in init
model: Model = get_model(segmentation, use_auth_token=use_auth_token)
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\pyannote\audio\pipelines\utils\getter.py", line 89, in get_model
model.eval()
AttributeError: 'NoneType' object has no attribute 'eval'
Traceback (most recent call last):
File "c:\Users\arkad\Desktop\ai\audiosplitter_whisper\split_audio.py", line 183, in
main()
File "c:\Users\arkad\Desktop\ai\audiosplitter_whisper\split_audio.py", line 180, in main
process_audio_files(input_folder, settings)
File "c:\Users\arkad\Desktop\ai\audiosplitter_whisper\split_audio.py", line 148, in process_audio_files
diarize_audio_with_srt(audio_file_path, srt_file, speaker_segments_dir)
File "c:\Users\arkad\Desktop\ai\audiosplitter_whisper\split_audio.py", line 77, in diarize_audio_with_srt
subs = pysrt.open(srt_file)
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\pysrt\srtfile.py", line 151, in open
source_file, encoding = cls._open_unicode_file(path, claimed_encoding=encoding)
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\pysrt\srtfile.py", line 292, in _open_unicode_file
encoding = claimed_encoding or cls._detect_encoding(path)
File "C:\Users\arkad\Desktop\ai\audiosplitter_whisper\venv\lib\site-packages\pysrt\srtfile.py", line 279, in _detect_encoding
file_descriptor = open(path, 'rb')
FileNotFoundError: [Errno 2] No such file or directory: 'C:\Users\arkad\Desktop\ai\Pudzian\output\plik.srt'`

@Malkovitz
Copy link
Author

Nevermind, I just found the issue.

In YouTube video we are supposed to accept conditions for:
https://huggingface.co/pyannote/segmentation

but the script is requiring

https://huggingface.co/pyannote/segmentation-3.0

which looks like is never version of segmentation.

I agreed to 3.0 conditions, but it's getting stuck on >>Performing diarization...

@freeload101
Copy link

C:\whisperX\1\audiosplitter_whisper\venv\Lib\site-packages\whisperx\diarize.py
https://hf.co/pyannote/speaker-diarization-3.1 for me ...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants