Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
14
4
Dan Goldstein
SmerkyG
Follow
LighterDarkness's profile picture
jadexlaw's profile picture
cheer7w's profile picture
14 followers
·
5 following
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 3 hours ago
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale
commented
on
a paper
about 3 hours ago
RADLADS: Rapid Attention Distillation to Linear Attention Decoders at Scale
updated
a model
about 6 hours ago
recursal/QRWKV7-7B-Instruct
View all activity
Organizations
Papers
3
arxiv:
2503.14456
arxiv:
2407.12077
arxiv:
2404.05892
models
18
Sort: Recently updated
SmerkyG/Qwen3Softpick-8B-Base
Updated
3 days ago
SmerkyG/RWKV7-1.5B-World3-128k-250309
Updated
Mar 9
•
1
SmerkyG/rwkv7-0.4B-world
Text Generation
•
Updated
Mar 2
•
5
SmerkyG/RWKV7-2.9B-World3-128k-250225
Updated
Feb 26
•
1
SmerkyG/rwkv7-1.5b-ctxlen-tests
Updated
Feb 4
SmerkyG/RWKV7-Goose-0.1B-Pile-HF
Updated
Feb 2
•
28
SmerkyG/RWKV7-Goose-0.4B-Pile-HF
Updated
Feb 2
•
1
SmerkyG/RWKV7-Goose-1.4B-Pile-HF
Updated
Feb 2
SmerkyG/RWKV7-Goose-0.1B-World2.8-HF
Updated
Dec 18, 2024
•
10
•
1
SmerkyG/rwkv-6-world-v2.1-3b
Text Generation
•
Updated
Jun 6, 2024
•
14
Expand 18 models
datasets
1
SmerkyG/DCLM-10B-Qwen2-binidx
Updated
Mar 26
•
77