kaupane/lichess-stockfish-tactics-llm-reasoning Viewer โข Updated about 17 hours ago โข 5.06k โข 96
kaupane/lichess-stockfish-tactics-llm-reasoning Viewer โข Updated about 17 hours ago โข 5.06k โข 96
Mxode/Fineweb-Edu-Chinese-V2.1-merged-score4_5 Viewer โข Updated 11 days ago โข 17.8M โข 430 โข 3
high-quality Chinese training datasets Collection a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. โข 14 items โข Updated 17 days ago โข 14
view post Post 1781 I tested Qwen3 235b and 32b and they are both worse than Qwen2.5 32b. onekq-ai/WebApp1K-models-leaderboardI used non-thinking mode because the thinking mode is too slow ๐ข๐ข๐ข to be usable in any way.Sigh ... See translation 12 replies ยท ๐ 7 7 ๐ 2 2 ๐ค 1 1 + Reply
Unsloth 4-bit Dynamic Quants Collection Unsloths Dynamic 4bit Quants selectively skips quantizing certain parameters; greatly improving accuracy while only using <10% more VRAM than BnB 4bit โข 28 items โข Updated 12 days ago โข 80