Reward model VLLM API upgrade #331

shtoshni · 2025-01-27T23:07:16Z

The new vLLM API supports reward models via the "pooling" interface. This PR fixes the earlier hacky embedding interface.

Kipok

Did you verify that qwen rm gives expected scores?

tests/gpu-tests/test-local.yaml

tests/gpu-tests/test_reward.py

Co-authored-by: Igor Gitman <igitman@nvidia.com>

shtoshni · 2025-01-28T15:25:25Z

Did you verify that qwen rm gives expected scores?

The number is in the ballpark of the huggingface numbers. Reported is 3.75, we get 3.33.

Shubham Toshniwal and others added 30 commits July 10, 2024 11:42

Nemotron eval map

151799d

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

83fe2aa

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

2ee8067

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

6fe71aa

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

4f125d0

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

63dc89c

Merging with main

f132827

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

271d7c3

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

6236437

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

c441f15

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

6cc0c0f

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

10b06fa

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

99d23e2

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

c77a584

Reward model updates for new VLLM API

6d4a967

Merge branch 'main' of github.com:Kipok/NeMo-Skills into main

475938a

Merge branch 'main' into shtoshni/vllm-upgrade

0d24563

Add logic for ORM vs PRM

9663b5b

Reward model type

2e38080

Fixes

293c384

Merge branch 'main' into shtoshni/vllm-upgrade

4b28a7f

Tests

efe0aff

Fixed test

4aefaf8

Fixed minor error

feac055

Fixing RM api

77678bd

Test change

f7f533d

Testing

c5e9ed0

Testing

70ed6dd

Testing

9ca9dd7

RM testing

90d7c25

shtoshni added 6 commits January 27, 2025 11:39

RM testing

8fbb652

RM testing

64710cf

Reward model test update

71ccce6

Fixing test

c33ce27

Merge branch 'main' into shtoshni/vllm-upgrade

45bf8a8

Removing logging

c7c453f

shtoshni added the run GPU tests label Jan 27, 2025

Kipok reviewed Jan 27, 2025

View reviewed changes

tests/gpu-tests/test-local.yaml Outdated Show resolved Hide resolved

tests/gpu-tests/test-local.yaml Outdated Show resolved Hide resolved

tests/gpu-tests/test_reward.py Outdated Show resolved Hide resolved

shtoshni and others added 4 commits January 28, 2025 10:23

Update tests/gpu-tests/test-local.yaml

6a4ce67

Co-authored-by: Igor Gitman <igitman@nvidia.com>

Update tests/gpu-tests/test-local.yaml

53f0d39

Co-authored-by: Igor Gitman <igitman@nvidia.com>

Update tests/gpu-tests/test_reward.py

cee9618

Co-authored-by: Igor Gitman <igitman@nvidia.com>

Merge branch 'main' into shtoshni/vllm-upgrade

baecdb5

Merge branch 'main' into shtoshni/vllm-upgrade

6413fe3

shtoshni added run GPU tests and removed run GPU tests labels Jan 28, 2025

Adding attention heads to avoid division error

d233bf3

shtoshni added run GPU tests and removed run GPU tests labels Jan 28, 2025

shtoshni merged commit 74715af into main Jan 28, 2025
5 of 7 checks passed

shtoshni deleted the shtoshni/vllm-upgrade branch January 29, 2025 16:11

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reward model VLLM API upgrade #331

Reward model VLLM API upgrade #331

shtoshni commented Jan 27, 2025

Kipok left a comment

shtoshni commented Jan 28, 2025

Reward model VLLM API upgrade #331

Reward model VLLM API upgrade #331

Conversation

shtoshni commented Jan 27, 2025

Kipok left a comment

Choose a reason for hiding this comment

shtoshni commented Jan 28, 2025