Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

💡 [REQUEST] - <title> 有人知道怎么用VLLM对Qwen2-VL进行推理加速嘛 #461

Open
Fly2flies opened this issue Sep 2, 2024 · 3 comments
Labels
question Further information is requested

Comments

@Fly2flies
Copy link

起始日期 | Start Date

No response

实现PR | Implementation PR

No response

相关Issues | Reference Issues

No response

摘要 | Summary

怎么用VLLM库对Qwen2-VL进行推理加速,有具体的示例嘛?

基本示例 | Basic Example

  • VLLM目前支持一些多模态模型,vllm support models,有人知道怎么适配qwen2-vl来加速推理嘛?

缺陷 | Drawbacks

未解决问题 | Unresolved questions

  • 没有相关的demo展示
@Fly2flies Fly2flies added the question Further information is requested label Sep 2, 2024
@zhangfan-algo
Copy link

+1

@lizhipengpeng
Copy link

+1

@elesun2018
Copy link

请问现在Qwen-VL是否支持vllm加速
如何才能支持VLLM加速,谢谢 sss

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Projects
None yet
Development

No branches or pull requests

4 participants