1 65 5

james curry

ainbo

AI & ML interests

None yet

Recent Activity

upvoted a paper 4 days ago

Self-Distilled Agentic Reinforcement Learning

upvoted a paper 5 days ago

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

upvoted a paper 6 days ago

Qwen-Image-VAE-2.0 Technical Report

View all activity

Organizations

upvoted a paper 4 days ago

Self-Distilled Agentic Reinforcement Learning

Paper • 2605.15155 • Published 6 days ago • 100

upvoted a paper 5 days ago

MemEye: A Visual-Centric Evaluation Framework for Multimodal Agent Memory

Paper • 2605.15128 • Published 6 days ago • 60

upvoted 2 papers 6 days ago

Qwen-Image-VAE-2.0 Technical Report

Paper • 2605.13565 • Published 7 days ago • 57

AnyFlow: Any-Step Video Diffusion Model with On-Policy Flow Map Distillation

Paper • 2605.13724 • Published 7 days ago • 95

upvoted 2 papers 7 days ago

CausalCine: Real-Time Autoregressive Generation for Multi-Shot Video Narratives

Paper • 2605.12496 • Published 8 days ago • 28

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Paper • 2605.12500 • Published 8 days ago • 181

upvoted a paper 9 days ago

HumanNet: Scaling Human-centric Video Learning to One Million Hours

Paper • 2605.06747 • Published 13 days ago • 51

upvoted a paper 12 days ago

Continuous Latent Diffusion Language Model

Paper • 2605.06548 • Published 13 days ago • 77

upvoted a paper 18 days ago

Visual Generation in the New Era: An Evolution from Atomic Mapping to Agentic World Modeling

Paper • 2604.28185 • Published 20 days ago • 90

upvoted a paper 20 days ago

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

Paper • 2604.26752 • Published 21 days ago • 106

upvoted a paper 21 days ago

World-R1: Reinforcing 3D Constraints for Text-to-Video Generation

Paper • 2604.24764 • Published 23 days ago • 118

upvoted a paper 22 days ago

ClawMark: A Living-World Benchmark for Multi-Turn, Multi-Day, Multimodal Coworker Agents

Paper • 2604.23781 • Published 24 days ago • 33

upvoted a paper 29 days ago

MultiWorld: Scalable Multi-Agent Multi-View Video World Models

Paper • 2604.18564 • Published about 1 month ago • 45

upvoted an article 30 days ago

Article

NEO-unify: Building Native Multimodal Unified Models End to End

sensenova

•

Mar 5

• 162

upvoted a paper 30 days ago

Elucidating the SNR-t Bias of Diffusion Probabilistic Models

Paper • 2604.16044 • Published Apr 17 • 74

upvoted 2 papers about 1 month ago

ClawGUI: A Unified Framework for Training, Evaluating, and Deploying GUI Agents

Paper • 2604.11784 • Published Apr 13 • 143

OmniShow: Unifying Multimodal Conditions for Human-Object Interaction Video Generation

Paper • 2604.11804 • Published Apr 13 • 72

upvoted 2 papers about 2 months ago

AI Can Learn Scientific Taste

Paper • 2603.14473 • Published Mar 15 • 426

Gen-Searcher: Reinforcing Agentic Search for Image Generation

Paper • 2603.28767 • Published Mar 30 • 58

upvoted a paper 2 months ago

Lost in Stories: Consistency Bugs in Long Story Generation by LLMs

Paper • 2603.05890 • Published Mar 6 • 93

james curry

AI & ML interests

Recent Activity

Organizations

ainbo's activity

NEO-unify: Building Native Multimodal Unified Models End to End