Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HANI-LAB
/
Med-REFL-Llama-3.1-8B-lora
like
0
Follow
HANI-LAB
1
Text Generation
Safetensors
English
medical
medical-reasoning
lora
dpo
reflection
arxiv:
2504.12334
License:
apache-2.0
Model card
Files
Files and versions
Community
main
Med-REFL-Llama-3.1-8B-lora
Commit History
Update README.md
61545cc
verified
CityU-Zongxian
commited on
5 days ago
Update README.md
505748c
verified
CityU-Zongxian
commited on
5 days ago
Upload 13 files
23fb88c
verified
CityU-Zongxian
commited on
5 days ago
initial commit
19418f9
verified
CityU-Zongxian
commited on
5 days ago