[Layers] Add qwenRope support for Qwen1.0 in CB mode #449

abenmao · 2024-06-14T02:38:05Z

No description provided.

pujiang2018 · 2024-06-17T02:05:18Z

src/kernels/rotary_embedding_kernels.cpp

+
+#pragma omp parallel for collapse(2)
+    for (int head = 0; head < heads; ++head) {
+        for (int seq = 0; seq < totSeqLen; ++seq) {


For next step, considering first token, should we swap the 2 loops to make each thread accessing contiguous memory? may deserve to test such implementation.
OK for current version.

changqi1 · 2024-06-17T02:08:57Z

I think this kernel APIs on continuous batching version and on continuous batching version are the same. Next step. we could merge two into one kernel API.
OK for current version.

abenmao · 2024-06-17T02:11:27Z

I think this kernel APIs on continuous batching version and on continuous batching version are the same. Next step. we could merge two into one kernel API. OK for current version.

Yes, maybe we can remove the older version in the next step.

[Layers] Add qwenRope support for CB Qwen1.0

5446a57

abenmao force-pushed the layers/rope/qwen1 branch from 4e167cd to 5446a57 Compare June 14, 2024 09:40

abenmao requested a review from changqi1 June 17, 2024 01:25

pujiang2018 approved these changes Jun 17, 2024

View reviewed changes

abenmao merged commit 8bd8d68 into intel:main Jun 17, 2024
1 check passed

Duyi-Wang mentioned this pull request Jun 19, 2024

[request]qwen1 not supported by vllm-xft #447

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Layers] Add qwenRope support for Qwen1.0 in CB mode #449

[Layers] Add qwenRope support for Qwen1.0 in CB mode #449

abenmao commented Jun 14, 2024

pujiang2018 Jun 17, 2024

abenmao Jun 17, 2024

changqi1 commented Jun 17, 2024 •

edited

Loading

abenmao commented Jun 17, 2024

[Layers] Add qwenRope support for Qwen1.0 in CB mode #449

[Layers] Add qwenRope support for Qwen1.0 in CB mode #449

Conversation

abenmao commented Jun 14, 2024

pujiang2018 Jun 17, 2024

Choose a reason for hiding this comment

abenmao Jun 17, 2024

Choose a reason for hiding this comment

changqi1 commented Jun 17, 2024 • edited Loading

abenmao commented Jun 17, 2024

changqi1 commented Jun 17, 2024 •

edited

Loading