new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Jun 13

Submitted by

YuSun-AI

ReasonMed: A 370K Multi-Agent Generated Dataset for Advancing Medical Reasoning

·
10 authors

3

Submitted by

itaowe

SWE-Factory: Your Automated Factory for Issue Resolution Training Data and Evaluation Benchmarks

·
9 authors

2

Submitted by

Min-Jaewon

Text-Aware Image Restoration with Diffusion Models

·
9 authors

2

Submitted by

stefan-it

Magistral

·
100 authors

Submitted by

YunxinLi

AniMaker: Automated Multi-Agent Animated Storytelling with MCTS-Driven Clip Generation

·
6 authors

4

Submitted by

awojustin

VRBench: A Benchmark for Multi-Step Reasoning in Long Narrative Videos

·
17 authors

2

Submitted by

gallilmaimon

Discrete Audio Tokens: More Than a Survey!

·
21 authors

2

Submitted by

Howe77

Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training

·
4 authors

2

Submitted by

sayakpaul

Fine-Grained Perturbation Guidance via Attention Head Selection

·
10 authors

Submitted by

Owen777

PosterCraft: Rethinking High-Quality Aesthetic Poster Generation in a Unified Framework

·
14 authors

Submitted by

dawn0815

Optimus-3: Towards Generalist Multimodal Minecraft Agents with Scalable Task Experts

·
7 authors

Submitted by

BiaoGong

Ming-Omni: A Unified Multimodal Model for Perception and Generation

·
58 authors

2

Submitted by

upup-ashton-wang

Resa: Transparent Reasoning Models via SAEs

·
7 authors

2

Submitted by

avery00

VideoDeepResearch: Long Video Understanding With Agentic Tool Using

·
5 authors

Submitted by

xhluca

Build the web for agents, not agents for the web

·
4 authors

2

Submitted by

reach-vb

Comment on The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity

·
2 authors

Submitted by

Ningyu

AutoMind: Adaptive Knowledgeable Agent for Automated Data Science

·
9 authors

2

Submitted by

zbrl

CreatiPoster: Towards Editable and Controllable Multi-Layer Graphic Design Generation

·
9 authors

2

Submitted by

Ningyu

ChineseHarm-Bench: A Chinese Harmful Content Detection Benchmark

·
10 authors

2

Submitted by

zhiyang1

LaTtE-Flow: Layerwise Timestep-Expert Flow-based Transformer

·
9 authors

2

Submitted by

billpsomas

Attention, Please! Revisiting Attentive Probing for Masked Image Modeling

·
9 authors

2

Submitted by

Speeeed

Compound AI Systems Optimization: A Survey of Methods, Challenges, and Future Directions

·
6 authors

2

Submitted by

LavenderLA

UniPre3D: Unified Pre-training of 3D Point Cloud Models with Cross-Modal Gaussian Splatting

·
4 authors

3

Submitted by

Wesleythu

VerIF: Verification Engineering for Reinforcement Learning in Instruction Following

·
6 authors

2

Submitted by

codelion

Eliciting Fine-Tuned Transformer Capabilities via Inference-Time Techniques

·
1 authors

2

Submitted by

dxlong2000

What Makes a Good Natural Language Prompt?

·
7 authors

2

Submitted by

benfielding

NoLoCo: No-all-reduce Low Communication Training Method for Large Models

·
5 authors

Submitted by

wanglz14

DreamActor-H1: High-Fidelity Human-Product Demonstration Video Generation via Motion-designed Diffusion Transformers

·
7 authors

Submitted by

msadat97

Token Perturbation Guidance for Diffusion Models

·
4 authors

2

Submitted by

ordavids1

Decomposing MLP Activations into Interpretable Features via Semi-Nonnegative Matrix Factorization

·
3 authors

2

Submitted by

vincolle

TeleMath: A Benchmark for Large Language Models in Telecom Mathematical Problem Solving

·
6 authors

2

Submitted by

kev95

Draft-based Approximate Inference for LLMs

·
6 authors

Submitted by

Acruxos

LLM Unlearning Should Be Form-Independent

·
3 authors

2

Submitted by

pkargupta

TaxoAdapt: Aligning LLM-Based Multidimensional Taxonomy Construction to Evolving Research Corpora

·
6 authors

2

Submitted by

pkargupta

Beyond True or False: Retrieval-Augmented Hierarchical Analysis of Nuanced Claims

·
3 authors

2

Submitted by

xinjjj

EmbodiedGen: Towards a Generative 3D World Engine for Embodied Intelligence

·
8 authors

2

Submitted by

hlzhang109

Discovering Hierarchical Latent Capabilities of Language Models via Causal Representation Learning

·
4 authors

Submitted by

JJ-TMT

Breaking Data Silos: Towards Open and Scalable Mobility Foundation Models via Generative Continual Learning

·
5 authors

2

Submitted by

Franck-Dernoncourt

LaMP-Cap: Personalized Figure Caption Generation With Multimodal Figure Profiles

·
11 authors

Submitted by

yiren98

MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks

·
4 authors

Submitted by

Nickwzk

StreamSplat: Towards Online Dynamic 3D Reconstruction from Uncalibrated Video Streams

·
5 authors

2