Djuunaa
djuna
AI & ML interests
None yet
Recent Activity
liked
a model
36 minutes ago
aipgpt/Txt-Polisher-Douyin-Style
liked
a model
3 days ago
THUDM/GLM-4-9B-0414
liked
a model
13 days ago
Delta-Vector/Hamanasu-Magnum-QwQ-32B
Organizations
djuna's activity
Different rms_norm_eps
2
#6 opened 5 months ago
by
djuna

How to use the model?
3
#1 opened about 2 months ago
by
Omrisr

I think what you're doing here is really helpful
1
1
#2 opened about 2 months ago
by
sometimesanotion

Tokenizer Details
7
#2 opened 3 months ago
by
qingy2024

Update config.json
#1 opened 3 months ago
by
djuna

Adding Evaluation Results
#1 opened 3 months ago
by
djuna

Error: Unimplemented merge method sce
5
#35 opened 3 months ago
by
xi0v

Upload necessary tokenizer
1
2
#2 opened 3 months ago
by
djuna

feat: Choosable CLI, Custom Output Shard Size, LORA extraction
9
#30 opened 6 months ago
by
djuna

RYS with Qwen2.5
1
1
#5 opened 6 months ago
by
PSM24
Some sample
1
#3 opened 4 months ago
by
djuna

[MODELS] Discussion
34
719
#372 opened about 1 year ago
by
victor

14B model detected as 7B
11
#1049 opened 4 months ago
by
djuna

Adding Evaluation Results
#2 opened 4 months ago
by
djuna

Some sample
1
#1 opened 4 months ago
by
djuna

Adding Evaluation Results
#2 opened 4 months ago
by
djuna

Some sample
1
#1 opened 4 months ago
by
djuna

Adding Evaluation Results
#1 opened 4 months ago
by
leaderboard-pr-bot

Difference between this and the other (100 steps) model?
7
#1 opened 8 months ago
by
lemon07r
Hoping to start learning
1
#6 opened 4 months ago
by
arcusprints
