Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Getting error on generation in Windows #12

Open
elephantpanda opened this issue Mar 10, 2023 · 4 comments
Open

Getting error on generation in Windows #12

elephantpanda opened this issue Mar 10, 2023 · 4 comments

Comments

@elephantpanda
Copy link

elephantpanda commented Mar 10, 2023

I installed bitsandbytes following the guide for windows
including the dll from here.

Everything works find it loads 7B into about 8GB VRAM. Great.

But in generating I get:

  File "example.py", line 103, in main
    results = generator.generate(
  File "C:\Users\Shadow\Documents\LLama\llama-int8-main\llama\generation.py", line 60, in generate
    next_token = torch.multinomial(
RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

Any ideas what went wrong?

@Minami-su
Copy link

same question

@XDeepAzure
Copy link

So am i, did you fix this?

@Minami-su
Copy link

I reported an error in testing on tesla p40, but it ran successfully on rtx a5000. Maybe it is because of the low computing power of the graphics card?

@XDeepAzure
Copy link

thanks!!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants