Self-Adjust Softmax - EMNLP 2025 Main Conference The checkpoint for [Self-Adjust Softmax](https://arxiv.org/abs/2502.18277) Gausson/gpt-neox-125m-deduped-SA 0.2B • Updated Jul 2 • 1
SepLLM - ICML 2025 The related code & checkpoints for [SepLLM - ICML 2025](https://arxiv.org/abs/2412.12094) paper. transformers-community/sep_cache 8B • Updated Aug 4 • 89 • 8 Gausson/pythia-160m-deduped-n64-SepLLM 0.2B • Updated Jul 2 Gausson/pythia-160m-deduped-n64h-SepLLM 0.2B • Updated Jul 2 • 1 Gausson/pythia-160m-deduped-n64-RoBiPE-SepLLM 0.2B • Updated Jul 2
Self-Adjust Softmax - EMNLP 2025 Main Conference The checkpoint for [Self-Adjust Softmax](https://arxiv.org/abs/2502.18277) Gausson/gpt-neox-125m-deduped-SA 0.2B • Updated Jul 2 • 1
SepLLM - ICML 2025 The related code & checkpoints for [SepLLM - ICML 2025](https://arxiv.org/abs/2412.12094) paper. transformers-community/sep_cache 8B • Updated Aug 4 • 89 • 8 Gausson/pythia-160m-deduped-n64-SepLLM 0.2B • Updated Jul 2 Gausson/pythia-160m-deduped-n64h-SepLLM 0.2B • Updated Jul 2 • 1 Gausson/pythia-160m-deduped-n64-RoBiPE-SepLLM 0.2B • Updated Jul 2