Inference Providers
Active filters: RLHF
NousResearch/Hermes-2-Pro-Llama-3-8B
Text Generation
• 8B • Updated • 222k
• • 448
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO-adapter
Updated • 16
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO
Text Generation
• 47B • Updated • 8.75k
• 453
NousResearch/Nous-Hermes-2-Mixtral-8x7B-DPO-GGUF
47B • Updated • 1.56k
• 71
NousResearch/Nous-Hermes-2-Mistral-7B-DPO
Text Generation
• 7B • Updated • 1.43k
• 218
NousResearch/Hermes-2-Pro-Mistral-7B-GGUF
7B • Updated • 4.7k
• 247
NousResearch/Hermes-2-Pro-Mistral-7B
Text Generation
• 7B • Updated • 4.91k
• 501
NousResearch/Hermes-2-Theta-Llama-3-8B
Text Generation
• 8B • Updated • 10.4k
• • 204
NousResearch/Hermes-2-Theta-Llama-3-8B-GGUF
8B • Updated • 587
• 91
NousResearch/Hermes-2-Pro-Llama-3-70B
Text Generation
• 71B • Updated • 95
• • 35
OpenAssistant/reward-model-deberta-v3-base
Text Classification
• Updated • 1.16k
• • 13
OpenAssistant/reward-model-electra-large-discriminator
Text Classification
• Updated • 19
• 5
OpenAssistant/reward-model-deberta-v3-large
Text Classification
• Updated • 296
• 26
OpenAssistant/reward-model-deberta-v3-large-v2
Text Classification
• Updated • 48.3k
• • 245
Text Ranking
• 0.4B • Updated • 10
• 3
nicholasKluge/RewardModelPT
Text Classification
• 0.1B • Updated • 28
nicholasKluge/RewardModel
Text Classification
• 0.1B • Updated • 319
• 1
fb700/chatglm-fitness-RLHF
Updated • 268
fb700/Bofan-chatglm-Best-lora
Updated • 5
• 11
kubernetes-bad/Ligma-L2-13b
Updated • 7
• 3
Text Generation
• Updated • 552
• 206
berkeley-nest/Starling-LM-7B-alpha
Text Generation
• 7B • Updated • 1.86k
• 559
berkeley-nest/Starling-RM-7B-alpha
Updated • 78
• 104
LoneStriker/Starling-LM-7B-alpha-3.0bpw-h6-exl2
Text Generation
• Updated • 2
LoneStriker/Starling-LM-7B-alpha-4.0bpw-h6-exl2
Text Generation
• Updated • 5
• 1
LoneStriker/Starling-LM-7B-alpha-5.0bpw-h6-exl2
Text Generation
• Updated • 5
• 2
LoneStriker/Starling-LM-7B-alpha-6.0bpw-h6-exl2
Text Generation
• Updated • 3
• 1
LoneStriker/Starling-LM-7B-alpha-8.0bpw-h8-exl2
Text Generation
• Updated • 5
• 2
TheBloke/Starling-LM-7B-alpha-GGUF
7B • Updated • 762
• 94
TheBloke/Starling-LM-7B-alpha-AWQ
Text Generation
• 7B • Updated • 15
• 9