Raincleared commited on
Commit
2dc1626
·
verified ·
1 Parent(s): 3a4eab6

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -12,3 +12,17 @@ This is the original 3B BlockFFN checkpoint used in the paper *BlockFFN: Towards
12
  You can load and use this model simply by using `AutoTokenizer` and `AutoModelForCausalLM`.
13
 
14
  Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
  You can load and use this model simply by using `AutoTokenizer` and `AutoModelForCausalLM`.
13
 
14
  Links: [[Paper](https://arxiv.org/pdf/2507.08771)] [[Codes](https://github.com/thunlp/BlockFFN)]
15
+
16
+ ### Citation
17
+
18
+ If you find our work useful for your research, please kindly cite our paper as follows:
19
+
20
+ ```
21
+ @article{song2025blockffn,
22
+ title={{BlockFFN}: Towards End-Side Acceleration-Friendly Mixture-of-Experts with Chunk-Level Activation Sparsity},
23
+ author={Chenyang Song and Weilin Zhao and Xu Han and Chaojun Xiao and Yingfa Chen and Yuxuan Li and Zhiyuan Liu and Maosong Sun},
24
+ journal={arXiv preprint arXiv:2507.08771},
25
+ year={2025},
26
+ url={https://arxiv.org/pdf/2507.08771},
27
+ }
28
+ ```