From Uncertainty to Trust: Enhancing Reliability in Vision-Language Models with Uncertainty-Guided Dropout Decoding Paper • 2412.06474 • Published Dec 9, 2024
Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment Paper • 2501.09620 • Published Jan 16
S'MoRE: Structural Mixture of Residual Experts for LLM Fine-tuning Paper • 2504.06426 • Published Apr 8 • 2
CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning Paper • 2503.19900 • Published Mar 25
RecoWorld: Building Simulated Environments for Agentic Recommender Systems Paper • 2509.10397 • Published Sep 12 • 7
StreamMem: Query-Agnostic KV Cache Memory for Streaming Video Understanding Paper • 2508.15717 • Published Aug 21 • 1
Let it Calm: Exploratory Annealed Decoding for Verifiable Reinforcement Learning Paper • 2510.05251 • Published Oct 6 • 7
HumanMM: Global Human Motion Recovery from Multi-shot Videos Paper • 2503.07597 • Published Mar 10 • 2
Quantifying Generalization Complexity for Large Language Models Paper • 2410.01769 • Published Oct 2, 2024 • 13
MJ-Bench: Is Your Multimodal Reward Model Really a Good Judge for Text-to-Image Generation? Paper • 2407.04842 • Published Jul 5, 2024 • 56
HALC: Object Hallucination Reduction via Adaptive Focal-Contrast Decoding Paper • 2403.00425 • Published Mar 1, 2024 • 1
AutoPRM: Automating Procedural Supervision for Multi-Step Reasoning via Controllable Question Decomposition Paper • 2402.11452 • Published Feb 18, 2024 • 1
Safe Reinforcement Learning via Hierarchical Adaptive Chance-Constraint Safeguards Paper • 2310.03379 • Published Oct 5, 2023
Multi-Modality Guidance Network For Missing Modality Inference Paper • 2309.03452 • Published Sep 7, 2023
Evaluating Machine Learning Models with NERO: Non-Equivariance Revealed on Orbits Paper • 2305.19889 • Published May 31, 2023
Breaking the Curse of Quality Saturation with User-Centric Ranking Paper • 2305.15333 • Published May 24, 2023