Shuo-data-private

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

vztu submitted a paper 2 days ago

PISCO: Precise Video Instance Insertion with Sparse Control

vztu submitted a paper 4 days ago

Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling

vztu authored a paper about 1 month ago

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

View all activity

vztu

submitted a paper to Daily Papers 2 days ago

PISCO: Precise Video Instance Insertion with Sparse Control

Paper • 2602.08277 • Published 6 days ago • 10

vztu

submitted a paper to Daily Papers 4 days ago

Agent Banana: High-Fidelity Image Editing with Agentic Thinking and Tooling

Paper • 2602.09084 • Published 5 days ago • 26

vztu

authored a paper about 1 month ago

Digital Twin AI: Opportunities and Challenges from Large Language Models to World Models

Paper • 2601.01321 • Published Jan 4 • 19

shuoxing

authored a paper 4 months ago

LLMs Can Get "Brain Rot"!

Paper • 2510.13928 • Published Oct 15, 2025 • 23

vztu

authored 5 papers 7 months ago

MMHU: A Massive-Scale Multimodal Benchmark for Human Behavior Understanding

Paper • 2507.12463 • Published Jul 16, 2025 • 27

SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems

Paper • 2506.07564 • Published Jun 9, 2025 • 6

mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation

Paper • 2505.24073 • Published May 29, 2025

GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution

Paper • 2505.00687 • Published May 1, 2025

4KAgent: Agentic Any Image to 4K Super-Resolution

Paper • 2507.07105 • Published Jul 9, 2025 • 106

shuoxing

authored 2 papers 8 months ago

mRAG: Elucidating the Design Space of Multi-modal Retrieval-Augmented Generation

Paper • 2505.24073 • Published May 29, 2025

Demystifying the Visual Quality Paradox in Multimodal Large Language Models

Paper • 2506.15645 • Published Jun 18, 2025 • 4

vztu

authored a paper 8 months ago

Demystifying the Visual Quality Paradox in Multimodal Large Language Models

Paper • 2506.15645 • Published Jun 18, 2025 • 4

shuoxing

authored a paper 8 months ago

SAFEFLOW: A Principled Protocol for Trustworthy and Transactional Autonomous Agent Systems

Paper • 2506.07564 • Published Jun 9, 2025 • 6

che111

authored a paper 9 months ago

SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning

Paper • 2506.01713 • Published Jun 2, 2025 • 48

vztu

authored 6 papers 9 months ago

AIS 2024 Challenge on Video Quality Assessment of User-Generated Content: Methods and Results

Paper • 2404.16205 • Published Apr 24, 2024

AI & ML interests

Recent Activity

Team members 11

Shuo-data-private's activity