new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

May 9

Submitted by

foggyforest

Perception, Reason, Think, and Plan: A Survey on Large Multimodal Reasoning Models

·
22 authors

1

Submitted by

scofield7419

On Path to Multimodal Generalist: General-Level and General-Bench

·
32 authors

5

Submitted by

Lp256

Flow-GRPO: Training Flow Matching Models via Online RL

·
9 authors

2

Submitted by

vvibt

Sentient Agent as a Judge: Evaluating Higher-Order Social Cognition in Large Language Models

·
13 authors

3

Submitted by

yuhuixu

Scalable Chain of Thoughts via Elastic Reasoning

·
6 authors

1

Submitted by

akhaliq

Generating Physically Stable and Buildable LEGO Designs from Text

·
6 authors

Submitted by

DavidLeon

FG-CLIP: Fine-Grained Visual and Textual Alignment

·
8 authors

1

Submitted by

WHB139426

StreamBridge: Turning Your Offline Video Large Language Model into a Proactive Streaming Assistant

·
9 authors

1

Submitted by

hzxie

3D Scene Generation: A Survey

·
5 authors

1

Submitted by

arianhosseini

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

·
5 authors

Submitted by

yyxsghx

ICon: In-Context Contribution for Automatic Data Selection

·
5 authors

1

Submitted by

shengz

X-Reasoner: Towards Generalizable Reasoning Across Modalities and Domains

·
12 authors

2

Submitted by

yongzx

Crosslingual Reasoning through Test-Time Scaling

·
10 authors

1

Submitted by

Samir55

PlaceIt3D: Language-Guided Object Placement in Real 3D Scenes

·
9 authors

1

Submitted by

RanjanSapkota

Vision-Language-Action Models: Concepts, Progress, Applications and Challenges

·
4 authors

1

Submitted by

pengliu123

LiftFeat: 3D Geometry-Aware Local Feature Matching

·
7 authors

1

Submitted by

xylu

WaterDrum: Watermarking for Data-centric Unlearning Metric

·
9 authors

1

Submitted by

dogtooth

SIMPLEMIX: Frustratingly Simple Mixing of Off- and On-policy Data in Language Model Preference Learning

·
2 authors

Submitted by

soliz1998

Chain-of-Thought Tokens are Computer Program Variables

·
3 authors

1

Submitted by

PALIN2018

BrowseComp-ZH: Benchmarking Web Browsing Ability of Large Language Models in Chinese

·
16 authors

1