Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Edit Models filters
Tasks
Libraries
Datasets
Languages
Licenses
Other
1
Inference Providers
Select all
SambaNova
Replicate
fal
Together AI
HF Inference API
Misc
Reset Misc
arxiv:
2305.18290
Inference Endpoints
AutoTrain Compatible
text-generation-inference
4-bit precision
Eval Results
8-bit precision
custom_code
Merge
text-embeddings-inference
Misc with no match
Carbon Emissions
Mixture of Experts
Apply filters
Models
1,018
Full-text search
Edit filters
Sort: Trending
Active filters:
2305.18290
Clear all
nomadrp/tq-aya-binary-20each-ws
Updated
8 days ago
nomadrp/tq-aya-binary-20each-ws-v2
Updated
7 days ago
JayHyeon/Qwen_0.5-VDPO_5e-7-3ep_1vpo_const
Text Generation
•
Updated
6 days ago
•
3
nomadrp/tq-expanse-binary-20each-ws-v1
Updated
6 days ago
nomadrp/tq-expanse-binary-20each-ws-v2
Updated
6 days ago
JayHyeon/Qwen_0.5-VDPO_5e-7-3ep_3vpo_const
Text Generation
•
Updated
6 days ago
•
15
Prakash2608/tiny-chatbot-dpo
Updated
6 days ago
JayHyeon/Qwen_0.5-VDPO_5e-7-3ep_10vpo_const
Text Generation
•
Updated
6 days ago
•
12
jasonhuang3/dpo-llama-3-1-8b-math-ep3
Updated
6 days ago
JayHyeon/Qwen_0.5-VDPO_5e-7-3ep_30vpo_const
Text Generation
•
Updated
4 days ago
•
9
josang1204/Qweb2.5-FT-DPO-CSY
Text Generation
•
Updated
5 days ago
•
3
shivank21/model
Updated
4 days ago
sylvain471/llama-3-1-8b-dpo-math-ep1
Text Generation
•
Updated
3 days ago
alinatl/SmolLM2-FT-DPO
Text Generation
•
Updated
4 days ago
•
8
onekq/outputs
Updated
4 days ago
VictorBratko/SmolLM2-FT-DPO
Text Generation
•
Updated
3 days ago
•
2
RichardErkhov/gauthamk28_-_SmolLM2-SFTuned-DPo-01-gguf
Updated
3 days ago
•
28
RichardErkhov/thatupiso_-_smolK12-gguf
Updated
3 days ago
•
28
RichardErkhov/YeungNLP_-_firefly-qwen1.5-en-7b-dpo-v0.1-unsloth-4bits
Updated
3 days ago
•
2
RichardErkhov/augmxnt_-_shisa-7b-v1-4bits
Updated
3 days ago
•
2
RichardErkhov/YeungNLP_-_firefly-qwen1.5-en-7b-dpo-v0.1-unsloth-8bits
Updated
3 days ago
•
2
RichardErkhov/augmxnt_-_shisa-7b-v1-8bits
Updated
3 days ago
nicoboss/DeepSeek-V2-Lite-Chat-Uncensored-Unbiased-Lora
Updated
1 day ago
nicoboss/DeepSeek-V2-Lite-Chat-Uncensored-Unbiased
Updated
1 day ago
hamedrahimi/User-VLM-10B-Instruct-DPO
Updated
about 16 hours ago
RichardErkhov/RyanYr_-_self-reflect_ministral8Bit_mg_star-dpo-4bits
Updated
about 16 hours ago
RichardErkhov/RyanYr_-_self-reflect_ministral8Bit_mg_star-dpo-8bits
Updated
about 16 hours ago
dhruvrnaik/test-openbiollm
Updated
about 14 hours ago
Previous
1
...
32
33
34
Next