Update README.md
Browse files
README.md
CHANGED
@@ -222,7 +222,8 @@ The uploaded version on Hugging Face retains the bf16 format for public use.
|
|
222 |
|
223 |
Designed for integration into professional developer tooling (e.g., intelligent code suggestions in IDEs), AI-powered coding assistants, and research on code understanding and generation, Mellum is also well-suited for educational applications and fine-tuning experiments.
|
224 |
|
225 |
-
This release includes a base model, and
|
|
|
226 |
Keep in mind that base model is not fine-tuned for downstream tasks out-of-the-box, however, it is fully capable of supporting supervised fine-tuning (SFT) and reinforcement learning (RL) for adaptation to specific applications.
|
227 |
|
228 |
# Training Data
|
@@ -266,6 +267,8 @@ Since Mellum has a maximum context window of 8k, we report here both the average
|
|
266 |
| Mellum-4b-sft-python | 33.16% | 36.11% | 57.10% | 42.12% |
|
267 |
| Mellum-4b-base | 25.30% | 38.39% | 50.65% | 38.11% |
|
268 |
|
|
|
|
|
269 |
## HumanEval Infilling
|
270 |
- Type: single-line and multi-line
|
271 |
- Languages: Python
|
|
|
222 |
|
223 |
Designed for integration into professional developer tooling (e.g., intelligent code suggestions in IDEs), AI-powered coding assistants, and research on code understanding and generation, Mellum is also well-suited for educational applications and fine-tuning experiments.
|
224 |
|
225 |
+
This release includes a base model, and Python SFT models as well.
|
226 |
+
Models for other languages will be released soon.
|
227 |
Keep in mind that base model is not fine-tuned for downstream tasks out-of-the-box, however, it is fully capable of supporting supervised fine-tuning (SFT) and reinforcement learning (RL) for adaptation to specific applications.
|
228 |
|
229 |
# Training Data
|
|
|
267 |
| Mellum-4b-sft-python | 33.16% | 36.11% | 57.10% | 42.12% |
|
268 |
| Mellum-4b-base | 25.30% | 38.39% | 50.65% | 38.11% |
|
269 |
|
270 |
+
We are investigating potential issues with SAFIM evaluation.
|
271 |
+
|
272 |
## HumanEval Infilling
|
273 |
- Type: single-line and multi-line
|
274 |
- Languages: Python
|