68 14 23

Celso F

celsowm

AI & ML interests

None yet

Recent Activity

upvoted an article about 10 hours ago

Transformers backend integration in SGLang

new activity 8 days ago

huggingface/InferenceSupport:tencent/Hunyuan-7B-Instruct

liked a model 12 days ago

CohereLabs/aya-expanse-32b

View all activity

Organizations

None yet

upvoted an article about 10 hours ago

Article

Transformers backend integration in SGLang

and 4 others •

Jun 23

• 51

upvoted a collection 2 months ago

StarVector SVG Datasets (🏆SVG-Bench)

Collection

Datasets for training and evaluating SVG generation models • 11 items • Updated Jan 12 • 20

upvoted a paper 2 months ago

GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning

Paper • 2505.20355 • Published May 26 • 36

upvoted a paper 3 months ago

Qwen3 Technical Report

Paper • 2505.09388 • Published May 14 • 274

upvoted 3 collections 4 months ago

upvoted 2 papers 5 months ago

DAPO: An Open-Source LLM Reinforcement Learning System at Scale

Paper • 2503.14476 • Published Mar 18 • 137

Long Context Tuning for Video Generation

Paper • 2503.10589 • Published Mar 13 • 14

upvoted a paper 9 months ago

ORPO: Monolithic Preference Optimization without Reference Model

Paper • 2403.07691 • Published Mar 12, 2024 • 68

upvoted an article 12 months ago

Article

Welcome FalconMamba: The first strong attention-free 7B model

and 5 others •

Aug 12, 2024

• 112

upvoted an article about 1 year ago

Article

Fine-tune Llama 3 with ORPO

•

Apr 22, 2024

• 239

upvoted a collection over 1 year ago

Gemma release

Collection

Groups the Gemma models released by the Google team. • 40 items • Updated Jul 10 • 340

upvoted a paper about 2 years ago

LongNet: Scaling Transformers to 1,000,000,000 Tokens

Paper • 2307.02486 • Published Jul 5, 2023 • 81

Celso F

AI & ML interests

Recent Activity

Organizations

celsowm's activity

Transformers backend integration in SGLang

Welcome FalconMamba: The first strong attention-free 7B model

Fine-tune Llama 3 with ORPO