Elvy

3lvy

3lvy

AI & ML interests

Training Deep Neural Nets

Recent Activity

upvoted an article about 1 month ago

Uncensor any LLM with abliteration

upvoted a paper about 1 month ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

upvoted a paper about 1 month ago

TransMLA: Multi-head Latent Attention Is All You Need

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Uncensor any LLM with abliteration

•

Jun 13, 2024

• 619

upvoted 2 papers about 1 month ago

Distilling LLM Agent into Small Models with Retrieval and Code Tools

Paper • 2505.17612 • Published May 23 • 78

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 56

upvoted an article about 2 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 468

liked a Space 4 months ago

2.74k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a dataset 10 months ago

SkunkworksAI/reasoning-0.01

Viewer • Updated Sep 14, 2024 • 29.9k • 234 • 279

liked a model 10 months ago

microsoft/Phi-3.5-vision-instruct

Image-Text-to-Text • 4B • Updated Sep 26, 2024 • 1.01M • 692

Elvy

AI & ML interests

Recent Activity

Organizations

3lvy's activity

Uncensor any LLM with abliteration

Vision Language Models (Better, Faster, Stronger)

The Ultra-Scale Playbook