Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
HANI-LAB
/
Med-REFL-Huatuo-o1-8B-lora
like
1
Follow
HANI-LAB
1
Text Generation
Safetensors
English
medical
medical-reasoning
lora
dpo
reflection
arxiv:
2504.12334
License:
apache-2.0
Model card
Files
Files and versions
Community
main
Med-REFL-Huatuo-o1-8B-lora
Commit History
Update README.md
5edbbe0
verified
CityU-Zongxian
commited on
5 days ago
Update README.md
8838108
verified
CityU-Zongxian
commited on
5 days ago
Update README.md
3a86a98
verified
CityU-Zongxian
commited on
5 days ago
Update README.md
2b3ac4c
verified
CityU-Zongxian
commited on
5 days ago
Update README.md
5f91904
verified
CityU-Zongxian
commited on
5 days ago
Upload 13 files
49751cd
verified
CityU-Zongxian
commited on
5 days ago
initial commit
a672d3f
verified
CityU-Zongxian
commited on
5 days ago