Text Generation
Transformers
Safetensors
qwen3
conversational
text-generation-inference
nielsr HF Staff commited on
Commit
8749329
Β·
verified Β·
1 Parent(s): 100f15d

Improve model card: Add paper and code badges, update datasets metadata

Browse files

Hi! This PR enhances the model card for `PIPer-8B` by:
- Adding prominent badges for the associated paper ([PIPer: On-Device Environment Setup via Online Reinforcement Learning](https://huggingface.co/papers/2509.25455)) and the main GitHub repository (`https://github.com/JetBrains-Research/PIPer`) for better visibility and quick access.
- Updating the `datasets` metadata to include all three relevant datasets mentioned in the "Available Artifacts" section of the model card.

All other existing metadata (`base_model`, `library_name`, `license`, `pipeline_tag`) and the comprehensive Markdown content (including the "Reproduce the results" section) are kept as they are accurate and well-structured. No sample usage for inference was added as no such snippet was found in the GitHub README that is directly compatible with the model and `transformers` library.

Files changed (1) hide show
  1. README.md +10 -5
README.md CHANGED
@@ -1,12 +1,15 @@
1
  ---
2
- library_name: transformers
3
- datasets:
4
- - JetBrains-Research/envbench-zeroshot-rl
5
  base_model:
6
  - JetBrains-Research/Qwen3-8B-am
7
- pipeline_tag: text-generation
 
 
 
 
8
  license: mit
 
9
  ---
 
10
  <img src="https://github.com/JetBrains-Research/PIPer/blob/main/misc/piper-logo.png?raw=true" alt="PIPer Mascot" style="height: 6em">
11
  <h1>
12
  PIPer: On-Device Environment Setup via Online Reinforcement Learning
@@ -15,6 +18,8 @@ license: mit
15
 
16
  <div align="center">
17
 
 
 
18
  [![Models](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Models-orange.svg)](https://jb.gg/PIPer)
19
  [![Dataset](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Dataset-green.svg)](https://huggingface.co/datasets/JetBrains-Research/PIPer-envbench-zeroshot-rl)
20
  [![License](https://img.shields.io/badge/License-MIT-red.svg)](LICENSE)
@@ -96,4 +101,4 @@ uv run piper/hparams_entrypoint.py +experiment=llm-reward --info config
96
 
97
  ## πŸ“„ License
98
 
99
- This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
 
1
  ---
 
 
 
2
  base_model:
3
  - JetBrains-Research/Qwen3-8B-am
4
+ datasets:
5
+ - JetBrains-Research/PIPer-envbench-zeroshot-rl
6
+ - JetBrains-Research/PIPer-SFT-2500-sharegpt
7
+ - JetBrains-Research/PIPer-eval
8
+ library_name: transformers
9
  license: mit
10
+ pipeline_tag: text-generation
11
  ---
12
+
13
  <img src="https://github.com/JetBrains-Research/PIPer/blob/main/misc/piper-logo.png?raw=true" alt="PIPer Mascot" style="height: 6em">
14
  <h1>
15
  PIPer: On-Device Environment Setup via Online Reinforcement Learning
 
18
 
19
  <div align="center">
20
 
21
+ [![Paper](https://img.shields.io/badge/πŸ“–-Paper-b31b1b.svg)](https://huggingface.co/papers/2509.25455)
22
+ [![Code](https://img.shields.io/badge/πŸ’»-Code-blue.svg)](https://github.com/JetBrains-Research/PIPer)
23
  [![Models](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Models-orange.svg)](https://jb.gg/PIPer)
24
  [![Dataset](https://img.shields.io/badge/πŸ€—%20Hugging%20Face-Dataset-green.svg)](https://huggingface.co/datasets/JetBrains-Research/PIPer-envbench-zeroshot-rl)
25
  [![License](https://img.shields.io/badge/License-MIT-red.svg)](LICENSE)
 
101
 
102
  ## πŸ“„ License
103
 
104
+ This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.