O1-Pruner: Length-Harmonizing Fine-Tuning for O1-Like Reasoning Pruning Paper ⢠2501.12570 ⢠Published Jan 22 ⢠28
Reverse Thinking Makes LLMs Stronger Reasoners Paper ⢠2411.19865 ⢠Published Nov 29, 2024 ⢠23
Large Language Models Can Self-Improve in Long-context Reasoning Paper ⢠2411.08147 ⢠Published Nov 12, 2024 ⢠67
OpenCodeReasoning Collection Reasoning data for supervised finetuning of LLMs to advance data distillation for competitive coding ⢠5 items ⢠Updated 1 day ago ⢠5
Llama Nemotron Collection Open, Production-ready Enterprise Models ⢠4 items ⢠Updated 1 day ago ⢠36
A Unified Agentic Framework for Evaluating Conditional Image Generation Paper ⢠2504.07046 ⢠Published 6 days ago ⢠28
SmolVLM: Redefining small and efficient multimodal models Paper ⢠2504.05299 ⢠Published 8 days ago ⢠158
One-Minute Video Generation with Test-Time Training Paper ⢠2504.05298 ⢠Published 8 days ago ⢠93
OpenCodeReasoning: Advancing Data Distillation for Competitive Coding Paper ⢠2504.01943 ⢠Published 13 days ago ⢠12
view article Article Training and Finetuning Reranker Models with Sentence Transformers v4 21 days ago ⢠110
ShieldAgent: Shielding Agents via Verifiable Safety Policy Reasoning Paper ⢠2503.22738 ⢠Published 20 days ago ⢠15
A Survey on the Optimization of Large Language Model-based Agents Paper ⢠2503.12434 ⢠Published about 1 month ago ⢠2