topshik commited on
Commit
15315f0
·
verified ·
1 Parent(s): 9a97ead

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +4 -1
README.md CHANGED
@@ -222,7 +222,8 @@ The uploaded version on Hugging Face retains the bf16 format for public use.
222
 
223
  Designed for integration into professional developer tooling (e.g., intelligent code suggestions in IDEs), AI-powered coding assistants, and research on code understanding and generation, Mellum is also well-suited for educational applications and fine-tuning experiments.
224
 
225
- This release includes a base model, and some SFT models as well.
 
226
  Keep in mind that base model is not fine-tuned for downstream tasks out-of-the-box, however, it is fully capable of supporting supervised fine-tuning (SFT) and reinforcement learning (RL) for adaptation to specific applications.
227
 
228
  # Training Data
@@ -266,6 +267,8 @@ Since Mellum has a maximum context window of 8k, we report here both the average
266
  | Mellum-4b-sft-python | 33.16% | 36.11% | 57.10% | 42.12% |
267
  | Mellum-4b-base | 25.30% | 38.39% | 50.65% | 38.11% |
268
 
 
 
269
  ## HumanEval Infilling
270
  - Type: single-line and multi-line
271
  - Languages: Python
 
222
 
223
  Designed for integration into professional developer tooling (e.g., intelligent code suggestions in IDEs), AI-powered coding assistants, and research on code understanding and generation, Mellum is also well-suited for educational applications and fine-tuning experiments.
224
 
225
+ This release includes a base model, and Python SFT models as well.
226
+ Models for other languages will be released soon.
227
  Keep in mind that base model is not fine-tuned for downstream tasks out-of-the-box, however, it is fully capable of supporting supervised fine-tuning (SFT) and reinforcement learning (RL) for adaptation to specific applications.
228
 
229
  # Training Data
 
267
  | Mellum-4b-sft-python | 33.16% | 36.11% | 57.10% | 42.12% |
268
  | Mellum-4b-base | 25.30% | 38.39% | 50.65% | 38.11% |
269
 
270
+ We are investigating potential issues with SAFIM evaluation.
271
+
272
  ## HumanEval Infilling
273
  - Type: single-line and multi-line
274
  - Languages: Python