TabularARGN: A Flexible and Efficient Auto-Regressive Framework for Generating High-Fidelity Synthetic Data Paper β’ 2501.12012 β’ Published Jan 21 β’ 9
view article Article Tiny Agents: a MCP-powered agent in 50 lines of code By julien-c β’ Apr 25 β’ 283
view article Article Cohere on Hugging Face Inference Providers π₯ By burtenshaw and 6 others β’ Apr 16 β’ 126
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published Jan 22 β’ 404
view article Article I Clicked βI Agreeβ, But What Am I Really Consenting To? By giadap β’ Mar 26 β’ 24
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM By ariG23498 and 3 others β’ Mar 12 β’ 437
view article Article Deepseek R1 Robotic Reasoning with Checkers By codyreading and 4 others β’ Mar 5 β’ 14
view article Article Welcome to Inference Providers on the Hub π₯ By julien-c and 6 others β’ Jan 28 β’ 484
view article Article Open-R1: a fully open reproduction of DeepSeek-R1 By eliebak and 2 others β’ Jan 28 β’ 868
view article Article WWDC 24: Running Mistral 7B with Core ML By FL33TW00D-HF and 3 others β’ Jul 22, 2024 β’ 61
The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism Paper β’ 2407.10457 β’ Published Jul 15, 2024 β’ 25
view article Article Data Is Better Together: A Look Back and Forward By sdiazlor and 2 others β’ Jun 20, 2024 β’ 20
Portrait4D-v2: Pseudo Multi-View Data Creates Better 4D Head Synthesizer Paper β’ 2403.13570 β’ Published Mar 20, 2024 β’ 3
Instruction Pre-Training: Language Models are Supervised Multitask Learners Paper β’ 2406.14491 β’ Published Jun 20, 2024 β’ 94
PoSE: Efficient Context Window Extension of LLMs via Positional Skip-wise Training Paper β’ 2309.10400 β’ Published Sep 19, 2023 β’ 26
Evaluating Text-to-Visual Generation with Image-to-Text Generation Paper β’ 2404.01291 β’ Published Apr 1, 2024 β’ 6
Mamba: Linear-Time Sequence Modeling with Selective State Spaces Paper β’ 2312.00752 β’ Published Dec 1, 2023 β’ 143