Bochkov
/

bvv241-abs

Model card Files Files and versions Community

Bochkov commited on 1 day ago

Commit

9b1f048

·

verified ·

1 Parent(s): 6aab3eb

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -63,6 +63,16 @@ If you use this model or the underlying concepts in your research, please cite o
       primaryClass={cs.CL},
       url={https://arxiv.org/abs/2507.04886},
 }
 ```
 This work demonstrates that transformer blocks, not token embeddings, carry the semantic burden in LLMs — a step toward modular, fusable, multilingual LMs.

       primaryClass={cs.CL},
       url={https://arxiv.org/abs/2507.04886},
 }
+@misc{bochkov2025growingtransformersmodularcomposition,
+      title={Growing Transformers: Modular Composition and Layer-wise Expansion on a Frozen Substrate},
+      author={A. Bochkov},
+      year={2025},
+      eprint={2507.07129},
+      archivePrefix={arXiv},
+      primaryClass={cs.LG},
+      url={https://arxiv.org/abs/2507.07129},
+}
 ```
 This work demonstrates that transformer blocks, not token embeddings, carry the semantic burden in LLMs — a step toward modular, fusable, multilingual LMs.