Bowen Peng
bloc97
AI & ML interests
Machine Learning, Computer Graphics, Language Models
Recent Activity
updated
a model
3 days ago
bloc97/150m-auto-88000
published
a model
3 days ago
bloc97/150m-auto-88000
updated
a model
3 days ago
bloc97/150m-rand-88000
Organizations
bloc97's activity
How did you train this without going OOM in RAM & VRAM?
3
#15 opened about 1 year ago
by
vicplus

VRAM usage for full 128k tokens
7
#5 opened over 1 year ago
by
Hypersniper

sliding_window = 131072? Sliding window attention doesn't work for 128?
1
#4 opened over 1 year ago
by
keyishen
Hardware requirements for the model.
2
#1 opened over 1 year ago
by
Sc0urge