Improve model card: Add metadata, paper link, and code repository
#1
by
nielsr
HF Staff
- opened
This PR enhances the model card by:
- Adding
pipeline_tag: text-generationfor better discoverability of this LLM for reasoning tasks. - Including
library_name: transformersto enable the automated "How to use" widget, as the model uses aLlamaForCausalLMarchitecture and is compatible with the Transformers library. - Linking to the official paper: Reinforce-Ada: An Adaptive Sampling Framework for Reinforce-Style LLM Training.
- Adding a direct link to the project's GitHub repository for access to the code and further details.
- Expanding the model description with an overview of the Reinforce-Ada framework, derived from the paper's abstract and the GitHub README.
These updates aim to make the model card more informative and user-friendly for the community.