Upload folder using huggingface_hub
Browse files- .gitattributes +2 -0
- LICENSE.md +114 -0
- Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q5_K_M.gguf +3 -0
- Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q8_0.gguf +3 -0
- NOTICE +1 -0
- README.md +108 -0
- USE POLICY.md +14 -0
.gitattributes
CHANGED
@@ -33,3 +33,5 @@ saved_model/**/* filter=lfs diff=lfs merge=lfs -text
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
|
|
|
|
|
33 |
*.zip filter=lfs diff=lfs merge=lfs -text
|
34 |
*.zst filter=lfs diff=lfs merge=lfs -text
|
35 |
*tfevents* filter=lfs diff=lfs merge=lfs -text
|
36 |
+
Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q5_K_M.gguf filter=lfs diff=lfs merge=lfs -text
|
37 |
+
Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q8_0.gguf filter=lfs diff=lfs merge=lfs -text
|
LICENSE.md
ADDED
@@ -0,0 +1,114 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
LLAMA 3.1 COMMUNITY LICENSE AGREEMENT
|
2 |
+
Llama 3.1 Version Release Date: July 23, 2024
|
3 |
+
|
4 |
+
“Agreement” means the terms and conditions for use, reproduction, distribution and modification of the
|
5 |
+
Llama Materials set forth herein.
|
6 |
+
|
7 |
+
“Documentation” means the specifications, manuals and documentation accompanying Llama 3.1
|
8 |
+
distributed by Meta at https://llama.meta.com/doc/overview.
|
9 |
+
|
10 |
+
“Licensee” or “you” means you, or your employer or any other person or entity (if you are entering into
|
11 |
+
this Agreement on such person or entity’s behalf), of the age required under applicable laws, rules or
|
12 |
+
regulations to provide legal consent and that has legal authority to bind your employer or such other
|
13 |
+
person or entity if you are entering in this Agreement on their behalf.
|
14 |
+
|
15 |
+
“Llama 3.1” means the foundational large language models and software and algorithms, including
|
16 |
+
machine-learning model code, trained model weights, inference-enabling code, training-enabling code,
|
17 |
+
fine-tuning enabling code and other elements of the foregoing distributed by Meta at
|
18 |
+
https://llama.meta.com/llama-downloads.
|
19 |
+
|
20 |
+
“Llama Materials” means, collectively, Meta’s proprietary Llama 3.1 and Documentation (and any
|
21 |
+
portion thereof) made available under this Agreement.
|
22 |
+
|
23 |
+
“Meta” or “we” means Meta Platforms Ireland Limited (if you are located in or, if you are an entity, your
|
24 |
+
principal place of business is in the EEA or Switzerland) and Meta Platforms, Inc. (if you are located
|
25 |
+
outside of the EEA or Switzerland).
|
26 |
+
|
27 |
+
By clicking “I Accept” below or by using or distributing any portion or element of the Llama Materials,
|
28 |
+
you agree to be bound by this Agreement.
|
29 |
+
|
30 |
+
1. License Rights and Redistribution.
|
31 |
+
|
32 |
+
a. Grant of Rights. You are granted a non-exclusive, worldwide, non-transferable and royalty-free
|
33 |
+
limited license under Meta’s intellectual property or other rights owned by Meta embodied in the Llama
|
34 |
+
Materials to use, reproduce, distribute, copy, create derivative works of, and make modifications to the
|
35 |
+
Llama Materials.
|
36 |
+
|
37 |
+
b. Redistribution and Use.
|
38 |
+
|
39 |
+
i. If you distribute or make available the Llama Materials (or any derivative works
|
40 |
+
thereof), or a product or service (including another AI model) that contains any of them, you shall (A)
|
41 |
+
provide a copy of this Agreement with any such Llama Materials; and (B) prominently display “Built with
|
42 |
+
Llama” on a related website, user interface, blogpost, about page, or product documentation. If you use
|
43 |
+
the Llama Materials or any outputs or results of the Llama Materials to create, train, fine tune, or
|
44 |
+
otherwise improve an AI model, which is distributed or made available, you shall also include “Llama” at
|
45 |
+
the beginning of any such AI model name.
|
46 |
+
|
47 |
+
ii. If you receive Llama Materials, or any derivative works thereof, from a Licensee as part
|
48 |
+
of an integrated end user product, then Section 2 of this Agreement will not apply to you.
|
49 |
+
|
50 |
+
iii. You must retain in all copies of the Llama Materials that you distribute the following
|
51 |
+
attribution notice within a “Notice” text file distributed as a part of such copies: “Llama 3.1 is
|
52 |
+
licensed under the Llama 3.1 Community License, Copyright © Meta Platforms, Inc. All Rights
|
53 |
+
Reserved.”
|
54 |
+
|
55 |
+
iv. Your use of the Llama Materials must comply with applicable laws and regulations
|
56 |
+
(including trade compliance laws and regulations) and adhere to the Acceptable Use Policy for the Llama
|
57 |
+
Materials (available at https://llama.meta.com/llama3_1/use-policy), which is hereby incorporated by
|
58 |
+
reference into this Agreement.
|
59 |
+
|
60 |
+
2. Additional Commercial Terms. If, on the Llama 3.1 version release date, the monthly active users
|
61 |
+
of the products or services made available by or for Licensee, or Licensee’s affiliates, is greater than 700
|
62 |
+
million monthly active users in the preceding calendar month, you must request a license from Meta,
|
63 |
+
which Meta may grant to you in its sole discretion, and you are not authorized to exercise any of the
|
64 |
+
rights under this Agreement unless or until Meta otherwise expressly grants you such rights.
|
65 |
+
|
66 |
+
3. Disclaimer of Warranty. UNLESS REQUIRED BY APPLICABLE LAW, THE LLAMA MATERIALS AND ANY
|
67 |
+
OUTPUT AND RESULTS THEREFROM ARE PROVIDED ON AN “AS IS” BASIS, WITHOUT WARRANTIES OF
|
68 |
+
ANY KIND, AND META DISCLAIMS ALL WARRANTIES OF ANY KIND, BOTH EXPRESS AND IMPLIED,
|
69 |
+
INCLUDING, WITHOUT LIMITATION, ANY WARRANTIES OF TITLE, NON-INFRINGEMENT,
|
70 |
+
MERCHANTABILITY, OR FITNESS FOR A PARTICULAR PURPOSE. YOU ARE SOLELY RESPONSIBLE FOR
|
71 |
+
DETERMINING THE APPROPRIATENESS OF USING OR REDISTRIBUTING THE LLAMA MATERIALS AND
|
72 |
+
ASSUME ANY RISKS ASSOCIATED WITH YOUR USE OF THE LLAMA MATERIALS AND ANY OUTPUT AND
|
73 |
+
RESULTS.
|
74 |
+
|
75 |
+
4. Limitation of Liability. IN NO EVENT WILL META OR ITS AFFILIATES BE LIABLE UNDER ANY THEORY OF
|
76 |
+
LIABILITY, WHETHER IN CONTRACT, TORT, NEGLIGENCE, PRODUCTS LIABILITY, OR OTHERWISE, ARISING
|
77 |
+
OUT OF THIS AGREEMENT, FOR ANY LOST PROFITS OR ANY INDIRECT, SPECIAL, CONSEQUENTIAL,
|
78 |
+
INCIDENTAL, EXEMPLARY OR PUNITIVE DAMAGES, EVEN IF META OR ITS AFFILIATES HAVE BEEN ADVISED
|
79 |
+
OF THE POSSIBILITY OF ANY OF THE FOREGOING.
|
80 |
+
|
81 |
+
5. Intellectual Property.
|
82 |
+
|
83 |
+
a. No trademark licenses are granted under this Agreement, and in connection with the Llama
|
84 |
+
Materials, neither Meta nor Licensee may use any name or mark owned by or associated with the other
|
85 |
+
or any of its affiliates, except as required for reasonable and customary use in describing and
|
86 |
+
redistributing the Llama Materials or as set forth in this Section 5(a). Meta hereby grants you a license to
|
87 |
+
use “Llama” (the “Mark”) solely as required to comply with the last sentence of Section 1.b.i. You will
|
88 |
+
comply with Meta’s brand guidelines (currently accessible at
|
89 |
+
https://about.meta.com/brand/resources/meta/company-brand/ ). All goodwill arising out of your use
|
90 |
+
of the Mark will inure to the benefit of Meta.
|
91 |
+
|
92 |
+
b. Subject to Meta’s ownership of Llama Materials and derivatives made by or for Meta, with
|
93 |
+
respect to any derivative works and modifications of the Llama Materials that are made by you, as
|
94 |
+
between you and Meta, you are and will be the owner of such derivative works and modifications.
|
95 |
+
|
96 |
+
c. If you institute litigation or other proceedings against Meta or any entity (including a
|
97 |
+
cross-claim or counterclaim in a lawsuit) alleging that the Llama Materials or Llama 3.1 outputs or
|
98 |
+
results, or any portion of any of the foregoing, constitutes infringement of intellectual property or other
|
99 |
+
rights owned or licensable by you, then any licenses granted to you under this Agreement shall
|
100 |
+
terminate as of the date such litigation or claim is filed or instituted. You will indemnify and hold
|
101 |
+
harmless Meta from and against any claim by any third party arising out of or related to your use or
|
102 |
+
distribution of the Llama Materials.
|
103 |
+
|
104 |
+
6. Term and Termination. The term of this Agreement will commence upon your acceptance of this
|
105 |
+
Agreement or access to the Llama Materials and will continue in full force and effect until terminated in
|
106 |
+
accordance with the terms and conditions herein. Meta may terminate this Agreement if you are in
|
107 |
+
breach of any term or condition of this Agreement. Upon termination of this Agreement, you shall delete
|
108 |
+
and cease use of the Llama Materials. Sections 3, 4 and 7 shall survive the termination of this
|
109 |
+
Agreement.
|
110 |
+
|
111 |
+
7. Governing Law and Jurisdiction. This Agreement will be governed and construed under the laws of
|
112 |
+
the State of California without regard to choice of law principles, and the UN Convention on Contracts
|
113 |
+
for the International Sale of Goods does not apply to this Agreement. The courts of California shall have
|
114 |
+
exclusive jurisdiction of any dispute arising out of this Agreement.
|
Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q5_K_M.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:399b08c30a0db957a3f7fb5711022af1375ea7c41729703d85f0117e945ead0e
|
3 |
+
size 5733001536
|
Llama-PLLuM-8B-instruct-ArtexIT-reasoning.Q8_0.gguf
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:77c6d3b8006cd743e82723448928c7931f99f902497fe03ffcc00066ded6d8ca
|
3 |
+
size 8540790016
|
NOTICE
ADDED
@@ -0,0 +1 @@
|
|
|
|
|
1 |
+
Llama 3.1 is licensed under the Llama 3.1 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.
|
README.md
ADDED
@@ -0,0 +1,108 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
language: [pl]
|
3 |
+
license: llama3.1
|
4 |
+
pipeline_tag: text-generation
|
5 |
+
library_name: llama.cpp
|
6 |
+
tags:
|
7 |
+
- gguf
|
8 |
+
- quantized
|
9 |
+
- q8_0
|
10 |
+
- f16
|
11 |
+
base_model: ARTEXIT/Llama-PLLuM-8B-instruct-ArtexIT-reasoning
|
12 |
+
base_model_relation: quantized
|
13 |
+
quantization:
|
14 |
+
- Q8_0
|
15 |
+
- Q5_K_M
|
16 |
+
---
|
17 |
+
|
18 |
+
# Llama-PLLuM-8B-instruct-ArtexIT-reasoning
|
19 |
+
|
20 |
+
**Built with Llama**
|
21 |
+
|
22 |
+
This repository contains a GRPO fine‑tune of [`CYFRAGOVPL/Llama-PLLuM-8B-instruct`] trained on **GSM8K** (MIT).
|
23 |
+
We publish both **Hugging Face (safetensors)** and **GGUF** artifacts (Q8_0, Q5_K_M) for use with `llama.cpp`.
|
24 |
+
|
25 |
+
|
26 |
+
## What is this?
|
27 |
+
- **Base**: Meta Llama 3.1 → PLLuM 8B Instruct (Polish) → GRPO fine‑tune (math / word problems).
|
28 |
+
- **Context**: ~131k (based on GGUF header).
|
29 |
+
- **Message format**: Llama `[INST] ... [/INST]` + explicit reasoning / answer tags (see below).
|
30 |
+
- **Default chat template**: The tokenizer includes a default system instruction enforcing the two‑block format.
|
31 |
+
|
32 |
+
|
33 |
+
## Prompt format
|
34 |
+
|
35 |
+
The model expects Llama chat formatting and supports explicit tags:
|
36 |
+
|
37 |
+
- **Reasoning**: `<think> ... </think>`
|
38 |
+
- **Final answer**: `<answer> ... </answer>`
|
39 |
+
|
40 |
+
**Example**
|
41 |
+
```text
|
42 |
+
[INST] Rozwiąż: 12 * 13 = ? [/INST]
|
43 |
+
<think>12*13 = 156.</think>
|
44 |
+
<answer>156</answer>
|
45 |
+
```
|
46 |
+
|
47 |
+
## Quickstart
|
48 |
+
|
49 |
+
### Transformers (PyTorch)
|
50 |
+
|
51 |
+
```python
|
52 |
+
import torch
|
53 |
+
from transformers import AutoModelForCausalLM, AutoTokenizer
|
54 |
+
|
55 |
+
repo = "ARTEXIT/Llama-PLLuM-8B-instruct-ArtexIT-reasoning"
|
56 |
+
tok = AutoTokenizer.from_pretrained(repo, use_fast=True)
|
57 |
+
model = AutoModelForCausalLM.from_pretrained(repo, torch_dtype="auto", device_map="auto")
|
58 |
+
|
59 |
+
prompt = tok.apply_chat_template(
|
60 |
+
[{"role": "user", "content": "Podaj 3 miasta w Polsce."}],
|
61 |
+
add_generation_prompt=True,
|
62 |
+
tokenize=False,
|
63 |
+
)
|
64 |
+
inputs = tok(prompt, return_tensors="pt").to(model.device)
|
65 |
+
out = model.generate(**inputs, max_new_tokens=64)
|
66 |
+
print(tok.decode(out[0], skip_special_tokens=False))
|
67 |
+
```
|
68 |
+
|
69 |
+
|
70 |
+
## Training (brief)
|
71 |
+
|
72 |
+
- **Method**: GRPO (policy‑gradient reinforcement learning with multiple reward functions).
|
73 |
+
- **Data**: `openai/gsm8k` — License: **MIT**.
|
74 |
+
- **Goal**: consistent two‑block outputs (reasoning + final answer) using the training tags.
|
75 |
+
|
76 |
+
|
77 |
+
## License & Attribution
|
78 |
+
|
79 |
+
This repository contains derivatives of **Llama 3.1** and **PLLuM**:
|
80 |
+
|
81 |
+
- **Llama 3.1 Community License** applies. When redistributing, you must:
|
82 |
+
- include a copy of the license and **prominently display “Built with Llama”**,
|
83 |
+
- include **“Llama” at the beginning of any distributed model’s name** if it was created, trained or fine‑tuned using Llama materials,
|
84 |
+
- keep a **NOTICE** file with the following line:
|
85 |
+
`Llama 3.1 is licensed under the Llama 3.1 Community License, Copyright © Meta Platforms, Inc. All Rights Reserved.`
|
86 |
+
- comply with the **Acceptable Use Policy (AUP)**.
|
87 |
+
- **PLLuM**: please cite the PLLuM work (see **Citation** below).
|
88 |
+
- **Data**: GSM8K is MIT‑licensed; include dataset attribution.
|
89 |
+
|
90 |
+
This repo includes:
|
91 |
+
- `LICENSE` — full text of the **Llama 3.1 Community License**
|
92 |
+
- `USE_POLICY.md` — pointer to the official **Acceptable Use Policy**
|
93 |
+
- `NOTICE` — required Llama attribution line
|
94 |
+
|
95 |
+
> If your (or your affiliates’) products exceeded **700M monthly active users** on the Llama 3.1 release date, you must obtain a separate license from Meta before exercising the rights in the Llama 3.1 license.
|
96 |
+
|
97 |
+
|
98 |
+
## Citation
|
99 |
+
|
100 |
+
If you use PLLuM in research or deployments, please cite:
|
101 |
+
|
102 |
+
```bibtex
|
103 |
+
@unpublished{pllum2025,
|
104 |
+
title={PLLuM: A Family of Polish Large Language Models},
|
105 |
+
author={PLLuM Consortium},
|
106 |
+
year={2025}
|
107 |
+
}
|
108 |
+
```
|
USE POLICY.md
ADDED
@@ -0,0 +1,14 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
# Llama 3.1 Acceptable Use Policy (AUP)
|
2 |
+
|
3 |
+
This repository distributes a model derived from Llama 3.1. By accessing or using this model, you agree to the Llama 3.1 Acceptable Use Policy.
|
4 |
+
|
5 |
+
**The most recent, authoritative copy of the AUP is maintained by Meta at:**
|
6 |
+
https://llama.meta.com/llama3_1/use-policy
|
7 |
+
|
8 |
+
For convenience only (non-exhaustive summary), the AUP requires responsible and lawful use and prohibits, among other things, uses that:
|
9 |
+
- Violate laws or regulations;
|
10 |
+
- Exploit, harm, or endanger people (including harassment, discrimination, or incitement to violence);
|
11 |
+
- Infringe privacy or intellectual property rights;
|
12 |
+
- Facilitate creation or distribution of malicious code or high-risk illegal activities.
|
13 |
+
|
14 |
+
If this summary conflicts with the official AUP, **the official AUP controls**. Please read the full AUP at the link above before using the model.
|