-
Notifications
You must be signed in to change notification settings - Fork 64
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
修改或融合视觉模块 #99
Comments
For another vision tower or projector, you can import what you like. Pay attention to multimodal_encoder and multimodal_projector. You need to add the code of class and modify the For combining multiple vision features, you also need to modify the architecture of Bunny (something like vision_tower_list) and Generally, you need to pre-train and fine-tune by yourself. Under some circumstances, you may start from our released weights. |
感谢回复,另外问一下pre_train大概需要怎样的算力资源? |
We always use 8*A100. |
Close the issue for now if there's no further discussions. Feel free to reopen it if there's any other questions. |
请问是否支持修改视觉模块或融合多个主干的视觉表征?
如果进行修改或融合,是否需要重新进行pre_train来获得相应的projector权重?
或是如何对projector进行修改?
The text was updated successfully, but these errors were encountered: