Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

zou00080
/
llama_PPO_neg_informal

Reinforcement Learning
Transformers
PyTorch
llama
text-generation
trl
text-generation-inference
Model card Files Files and versions Community
llama_PPO_neg_informal
Ctrl+K
Ctrl+K
  • 1 contributor
History: 2 commits
zou00080's picture
zou00080
Upload 8 files
8236065 about 2 years ago
  • .gitattributes
    1.48 kB
    initial commit about 2 years ago
  • README.md
    1.15 kB
    Upload 8 files about 2 years ago
  • adapter_config.json
    360 Bytes
    Upload 8 files about 2 years ago
  • adapter_model.bin
    33.6 MB
    LFS
    Upload 8 files about 2 years ago
  • config.json
    472 Bytes
    Upload 8 files about 2 years ago
  • pytorch_model.bin
    17.5 kB
    LFS
    Upload 8 files about 2 years ago
  • special_tokens_map.json
    96 Bytes
    Upload 8 files about 2 years ago
  • tokenizer.model
    500 kB
    LFS
    Upload 8 files about 2 years ago
  • tokenizer_config.json
    229 Bytes
    Upload 8 files about 2 years ago