Viewer
•
Updated
•
1.23k
•
15.9k
trl-lib/documentation-images
Viewer
•
Updated
•
9
•
67.8k
Viewer
•
Updated
•
103k
•
4.47k
trl-lib/llava-instruct-mix
Viewer
•
Updated
•
228k
•
1.38k
•
2
trl-lib/OpenMathReasoning
Viewer
•
Updated
•
3.2M
•
516
trl-lib/chatbot_arena_completions
Viewer
•
Updated
•
33k
•
638
•
1
Viewer
•
Updated
•
83.1k
•
338
•
3
trl-lib/ultrafeedback-gpt-3.5-turbo-helpfulness
Viewer
•
Updated
•
16.6k
•
135
•
3
trl-lib/ultrafeedback-prompt
Viewer
•
Updated
•
39.8k
•
929
•
9
Viewer
•
Updated
•
179k
•
592
•
2
Viewer
•
Updated
•
130k
•
3.63k
•
30
Viewer
•
Updated
•
41.2k
•
252
•
2
Viewer
•
Updated
•
445k
•
2.18k
•
9
trl-lib/lm-human-preferences-sentiment
Viewer
•
Updated
•
6.26k
•
1.05k
trl-lib/lm-human-preferences-descriptiveness
Viewer
•
Updated
•
6.26k
•
44
•
1
trl-lib/hh-rlhf-helpful-base
Viewer
•
Updated
•
46.2k
•
1.65k
•
3
Viewer
•
Updated
•
51.8k
•
29
trl-lib/Capybara-Preferences
Viewer
•
Updated
•
15.4k
•
43
Viewer
•
Updated
•
16k
•
3.77k
•
14
trl-lib/ultrafeedback_binarized
Viewer
•
Updated
•
63.1k
•
5.87k
•
20
trl-lib/capybara-preferencces-7k
Viewer
•
Updated
•
7.56k
•
39
Viewer
•
Updated
•
15k
•
202
•
9
trl-lib/ultrachat_200k_chatml
Viewer
•
Updated
•
231k
•
60
•
3