new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Aug 22

Submitted by

ZwwWayne

Intern-S1: A Scientific Multimodal Foundation Model

·
175 authors

Submitted by

xhyandwyy

Mobile-Agent-v3: Foundamental Agents for GUI Automation

·
15 authors

Submitted by

Kevin355

LiveMCP-101: Stress Testing and Diagnosing MCP-enabled Agents on Challenging Queries

·
14 authors

2

Submitted by

jiaweizhao

Deep Think with Confidence

·
4 authors

3

Submitted by

taesiri

Waver: Wave Your Way to Lifelike Video Generation

·
10 authors

Submitted by

haoningwu

SceneGen: Single-Image 3D Scene Generation in One Feedforward Pass

·
4 authors

Submitted by

taesiri

A Survey on Large Language Model Benchmarks

·
14 authors

Submitted by

cai-qi

Visual Autoregressive Modeling for Instruction-Guided Image Editing

·
8 authors

Submitted by

taesiri

ATLAS: Decoupling Skeletal and Shape Parameters for Expressive Parametric Human Modeling

·
10 authors

Submitted by

universea

aiXiv: A Next-Generation Open Access Ecosystem for Scientific Discovery Generated by AI Scientists

·
23 authors

1

Submitted by

taesiri

"Does the cafe entrance look accessible? Where is the door?" Towards Geospatial AI Agents for Visual Inquiries

·
10 authors

Submitted by

taesiri

When and What: Diffusion-Grounded VideoLLM with Entity Aware Segmentation for Long Video Understanding

·
3 authors

Submitted by

amazingj

Fin-PRM: A Domain-Specialized Process Reward Model for Financial Reasoning in Large Language Models

·
7 authors

1

Submitted by

thewhole

Snap-Snap: Taking Two Images to Reconstruct 3D Human Gaussians in Milliseconds

·
9 authors

1

Submitted by

AdinaY

INTIMA: A Benchmark for Human-AI Companionship Behavior

·
3 authors

Submitted by

YirongSun

LLaSO: A Foundational Framework for Reproducible Research in Large Language and Speech Model

·
8 authors