You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I hope this message finds you well. I am currently working on training the model in the third stage as described in your repository. However, I’ve encountered an issue related to the dataset configuration:
After a few iterations, the training process encounters an issue in the dataloader (llava.py). I suspect the problem may be due to a mismatch between the dataset I downloaded and the version you used in your experiments. From your documentation, it seems you might have modified the dataset to align with the training script.
Would it be possible for you to share the modified version of sharegpt4v_instruct_gpt4-vision_cap100k_new.json or provide details about the modifications you made? This would greatly help me resolve the issue and proceed with the training.
Thank you in advance for your time and support! Your work has been invaluable to the community, and I appreciate your efforts in maintaining the repository.
Looking forward to your guidance.
Best regards,
Ali
The text was updated successfully, but these errors were encountered:
Hi there, I modified sharegpt4v_instruct_gpt4-vision_cap100k_new.json simply because several images (less than 10) have incorrect paths in the original json annotations. But for some reasons, I do not have access to my version of sharegpt4v annotations at this time. You may have a try to filter your downloaded sharegpt4v by removing missing images in the annotation. Hope this will help!
Hi there,
I hope this message finds you well. I am currently working on training the model in the third stage as described in your repository. However, I’ve encountered an issue related to the dataset configuration:
Specifically, I am using the
sharegpt4v_instruct_gpt4-vision_cap100k_new.json
dataset from [Lin-Chen/ShareGPT4V](https://huggingface.co/datasets/Lin-Chen/ShareGPT4V/tree/main) with the following configuration:After a few iterations, the training process encounters an issue in the dataloader (
llava.py
). I suspect the problem may be due to a mismatch between the dataset I downloaded and the version you used in your experiments. From your documentation, it seems you might have modified the dataset to align with the training script.Would it be possible for you to share the modified version of
sharegpt4v_instruct_gpt4-vision_cap100k_new.json
or provide details about the modifications you made? This would greatly help me resolve the issue and proceed with the training.Thank you in advance for your time and support! Your work has been invaluable to the community, and I appreciate your efforts in maintaining the repository.
Looking forward to your guidance.
Best regards,
Ali
The text was updated successfully, but these errors were encountered: