3 1

Zhijian Zhuo

BryceZhuo

AI & ML interests

None yet

Recent Activity

upvoted a collection 7 days ago

Papers - Pre-training

new activity 4 months ago

Open-Foundation-Models/PolyReLU_1B:Add pipeline tag: text-generation

new activity 4 months ago

Open-Foundation-Models/PolyNorm_1B:Add pipeline tag: text-generation

View all activity

Organizations

upvoted a collection 7 days ago

Papers - Pre-training

Collection

11 items • Updated Dec 15, 2024 • 1

New activity in Open-Foundation-Models/PolyReLU_1B 4 months ago

Add pipeline tag: text-generation

#1 opened 4 months ago by

nielsr

New activity in Open-Foundation-Models/PolyNorm_1B 4 months ago

Add pipeline tag: text-generation

#1 opened 4 months ago by

nielsr

authored a paper 4 months ago

Polynomial Composition Activations: Unleashing the Dynamics of Large Language Models

Paper • 2411.03884 • Published Nov 6, 2024 • 29

updated 2 models 4 months ago

Open-Foundation-Models/PolyReLU_1B

Text Generation • Updated Apr 8 • 2

Open-Foundation-Models/PolyNorm_1B

Text Generation • Updated Apr 8 • 2

published 2 models 4 months ago

Open-Foundation-Models/PolyNorm_1B

Text Generation • Updated Apr 8 • 2

Open-Foundation-Models/PolyReLU_1B

Text Generation • Updated Apr 8 • 2

authored 2 papers 5 months ago

HybridNorm: Towards Stable and Efficient Transformer Training via Hybrid Normalization

Paper • 2503.04598 • Published Mar 6 • 20

Scale-Distribution Decoupling: Enabling Stable and Effective Training of Large Language Models

Paper • 2502.15499 • Published Feb 21 • 14

Zhijian Zhuo

AI & ML interests

Recent Activity

Organizations

BryceZhuo's activity

Add pipeline tag: text-generation

Add pipeline tag: text-generation