zhengwenzhen commited on
Commit
8d6a0ac
·
verified ·
1 Parent(s): 62bf797

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +2 -5
README.md CHANGED
@@ -23,7 +23,7 @@ This model is part of the [StepLaw-N_119M-D_3.0B](https://huggingface.co/collect
23
  - **Feed-forward network size (FFN)**: 6416
24
  - **Attention heads**: 12
25
  - **Layers**: 7
26
- - **Parameter count**: 119MM
27
 
28
  ### Training Parameters
29
  - **Learning rate (lr)**: 7.812e-03
@@ -48,7 +48,4 @@ model = AutoModelForCausalLM.from_pretrained(model_name, trust_remote_code=True)
48
  inputs = tokenizer("A long time ago in a galaxy far, far away", return_tensors="pt")
49
  outputs = model.generate(**inputs, max_length=100)
50
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
51
- ```## Part of StepLaw Project
52
-
53
- StepLaw is an initiative to provide thousands of models for optimal hyperparameter research.
54
- Visit [StepLaw Project](https://step-law.github.io/) for more information.
 
23
  - **Feed-forward network size (FFN)**: 6416
24
  - **Attention heads**: 12
25
  - **Layers**: 7
26
+ - **Parameter count**: 119M
27
 
28
  ### Training Parameters
29
  - **Learning rate (lr)**: 7.812e-03
 
48
  inputs = tokenizer("A long time ago in a galaxy far, far away", return_tensors="pt")
49
  outputs = model.generate(**inputs, max_length=100)
50
  print(tokenizer.decode(outputs[0], skip_special_tokens=True))
51
+ ```