Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
lindafei001
/
llama-8b-instruct-medical-dpo-unlearn-10epochs-1e-5-64-128-2048
like
0
Transformers
TensorBoard
Safetensors
Generated from Trainer
trl
dpo
arxiv:
2305.18290
Model card
Files
Files and versions
xet
Metrics
Training metrics
Community
Train
Deploy
Use this model
afa794a
llama-8b-instruct-medical-dpo-unlearn-10epochs-1e-5-64-128-2048
/
runs
28.3 kB
1 contributor
History:
1 commit
lindafei001
Upload trained model checkpoint
afa794a
verified
16 days ago
Sep22_16-41-12_w-yanli-sudolm-88776406705d4aceac34e7a04afadc53-85df665cbcpx44f
Upload trained model checkpoint
16 days ago
Sep22_17-35-27_w-yanli-sudolm-88776406705d4aceac34e7a04afadc53-85df665cbcpx44f
Upload trained model checkpoint
16 days ago