Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

interlaced information between images and text and multiple images #79

Open
zhangqingwu opened this issue May 19, 2024 · 2 comments
Open

Comments

@zhangqingwu
Copy link

I want to train a model that can understand interlaced information between images and text. The prompt words may contain multiple images.
Can the current code be configured to train multiple images?
If it needs to be modified, how to modify it is more reasonable.

@LAW1223
Copy link
Collaborator

LAW1223 commented May 21, 2024

The current code version only supports single-image training. The multi-image training and evaluation capabilities will be included in a future release, and the code will be made publicly available at that time.

@RussRobin
Copy link
Collaborator

RussRobin commented Jul 6, 2024

Hi @zhangqingwu , thank you for your interest in Bunny. In SpatialBot we release Bunny codes for multi-image. Please note that interlaced training is not included in it. Also, SpatialBot codes will not be kept up-to-date with Bunny in the future.

Regards
Russell

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants