view article Article Transformers backend integration in SGLang By marcsun13 and 4 others ⢠Jun 23 ⢠51
StarVector SVG Datasets (đSVG-Bench) Collection Datasets for training and evaluating SVG generation models ⢠11 items ⢠Updated Jan 12 ⢠20
GraLoRA: Granular Low-Rank Adaptation for Parameter-Efficient Fine-Tuning Paper ⢠2505.20355 ⢠Published May 26 ⢠36
Portuguese LLM Leaderboard best models â¤ď¸âđĽ Collection A daily uploaded list of models with best evaluations on the PT-LLM leaderboard: ⢠17 items ⢠Updated about 1 hour ago ⢠37
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper ⢠2503.14476 ⢠Published Mar 18 ⢠137
ORPO: Monolithic Preference Optimization without Reference Model Paper ⢠2403.07691 ⢠Published Mar 12, 2024 ⢠68
view article Article Welcome FalconMamba: The first strong attention-free 7B model By JingweiZuo and 5 others ⢠Aug 12, 2024 ⢠112
Gemma release Collection Groups the Gemma models released by the Google team. ⢠40 items ⢠Updated Jul 10 ⢠340
LongNet: Scaling Transformers to 1,000,000,000 Tokens Paper ⢠2307.02486 ⢠Published Jul 5, 2023 ⢠81