9 11 41

ShadowLilac PRO

shadowlilac

AI & ML interests

Thinking with two Systems

Recent Activity

liked a model 1 day ago

PowerInfer/SmallThinker-21BA3B-Instruct

updated a dataset 5 days ago

shadowlilac/weblate-translations-processed

published a dataset 5 days ago

shadowlilac/weblate-translations-processed

View all activity

Organizations

upvoted an article 12 days ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 239

upvoted 2 papers about 2 months ago

Discrete Diffusion in Large Language and Multimodal Models: A Survey

Paper • 2506.13759 • Published Jun 16 • 42

Astra: Toward General-Purpose Mobile Robots via Hierarchical Multimodal Learning

Paper • 2506.06205 • Published Jun 6 • 29

upvoted 3 papers 2 months ago

MotionSight: Boosting Fine-Grained Motion Understanding in Multimodal LLMs

Paper • 2506.01674 • Published Jun 2 • 28

On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes

Paper • 2306.13649 • Published Jun 23, 2023 • 22

GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Paper • 2505.20355 • Published May 26 • 36

upvoted an article 3 months ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

and 14 others •

Dec 19, 2024

• 673

upvoted a paper 4 months ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 300

upvoted a paper 5 months ago

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 70

upvoted 2 papers 6 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 164

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149