Suport reward model with tp #491

fubaosu · 2024-08-06T09:57:04Z

Objective ：support reward model.
Made the following changes：

Add "/get_score" request interface in api_server.py.
Add lightllm_get_score(...) function to spport get_score in api_lightllm.py.
Add "use_reward_model" args to depoly reward model in api_server.py.
Add "RewardModelBackend" to spport reward model forward in lightllm/lightllm/server/router/model_infer/mode_backend/continues_batch/impl_for_reward_model.py
Added internlm2_reward model support as an example of reward model.

fubaosu added 7 commits August 6, 2024 11:33

add internlm-reward model suport

9fc0aac

new lightllm_get_score func

b3c01ab

new lightllm_get_score func

8f1ee60

add tp support for reward model

5d4d9d3

make sure the reward model can not use other backends

0678748

pre-commit

cf9b050

reformat code

82c9baa

shihaobai merged commit dcfcd55 into ModelTC:main Aug 7, 2024
1 check passed

Provide feedback