GENIE: Gaussian Encoding for Neural Radiance Fields Interactive Editing Paper • 2508.02831 • Published 8 days ago • 11
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm Paper • 2507.18553 • Published 19 days ago • 39
LAPO: Internalizing Reasoning Efficiency via Length-Adaptive Policy Optimization Paper • 2507.15758 • Published 22 days ago • 34
A Survey of Context Engineering for Large Language Models Paper • 2507.13334 • Published 26 days ago • 236
view article Article Featherless AI on Hugging Face Inference Providers 🔥 By sbrandeis and 5 others • Jun 12 • 46
Embodied Web Agents: Bridging Physical-Digital Realms for Integrated Agent Intelligence Paper • 2506.15677 • Published Jun 18 • 24
Wait, We Don't Need to "Wait"! Removing Thinking Tokens Improves Reasoning Efficiency Paper • 2506.08343 • Published Jun 10 • 49
Inherently Faithful Attention Maps for Vision Transformers Paper • 2506.08915 • Published Jun 10 • 4
SuperWriter: Reflection-Driven Long-Form Generation with Large Language Models Paper • 2506.04180 • Published Jun 4 • 32
Robot-R1: Reinforcement Learning for Enhanced Embodied Reasoning in Robotics Paper • 2506.00070 • Published May 29 • 28
view article Article No GPU left behind: Unlocking Efficiency with Co-located vLLM in TRL By toslali-ibm and 5 others • Jun 3 • 81
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models Paper • 2505.24864 • Published May 30 • 134
view article Article Blazingly fast whisper transcriptions with Inference Endpoints By mfuntowicz and 5 others • May 13 • 74
Monolith: Real Time Recommendation System With Collisionless Embedding Table Paper • 2209.07663 • Published Sep 16, 2022 • 2
Agentic Reasoning and Tool Integration for LLMs via Reinforcement Learning Paper • 2505.01441 • Published Apr 28 • 39
LLM papers Collection It is a collection of papers that are useful in studying LLM. • 14 items • Updated Apr 3, 2024 • 14