cabinet - a mangoxb Collection

mangoxb 's Collections

cabinet

read

cabinet

updated 1 day ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published Mar 31 • 272
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Paper • 2504.10479 • Published 19 days ago • 251
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models

Paper • 2503.24235 • Published Mar 31 • 53
Seedream 3.0 Technical Report

Paper • 2504.11346 • Published 18 days ago • 52
TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published 11 days ago • 100
Rethinking Reflection in Pre-Training

Paper • 2504.04022 • Published 28 days ago • 77
AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published Apr 1 • 67
C3PO: Critical-Layer, Core-Expert, Collaborative Pathway Optimization for Test-Time Expert Re-Mixing

Paper • 2504.07964 • Published 23 days ago • 61
An Empirical Study of GPT-4o Image Generation Capabilities

Paper • 2504.05979 • Published 25 days ago • 62
The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published 11 days ago • 62
Inference-Time Scaling for Generalist Reward Modeling

Paper • 2504.02495 • Published about 1 month ago • 54
One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published 26 days ago • 100
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published Apr 2 • 83
TIP-I2V: A Million-Scale Real Text and Image Prompt Dataset for Image-to-Video Generation

Paper • 2411.04709 • Published Nov 5, 2024 • 27
Improving Video Generation with Human Feedback

Paper • 2501.13918 • Published Jan 23 • 50
ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 140
Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 89
Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 94
Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 114
Inference-Time Scaling for Diffusion Models beyond Scaling Denoising Steps

Paper • 2501.09732 • Published Jan 16 • 72
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 276
Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 4 days ago • 74
Sa2VA: Marrying SAM2 with LLaVA for Dense Grounded Understanding of Images and Videos

Paper • 2501.04001 • Published Jan 7 • 46
A Survey of Interactive Generative Video

Paper • 2504.21853 • Published 3 days ago • 36