3 17 2

Zhe Cao

MichaelCaoo

MichaelCao0

AI & ML interests

None yet

Recent Activity

authored a paper 26 days ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

new activity 30 days ago

NJU-LINK/T2AV-Compass:Update README.md

new activity 30 days ago

NJU-LINK/T2AV-Compass:Upload 0000.parquet

View all activity

Organizations

authored a paper 26 days ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Paper • 2512.21094 • Published Dec 24, 2025 • 25

New activity in NJU-LINK/T2AV-Compass 30 days ago

Update README.md

#5 opened 30 days ago by

MichaelCaoo

Upload 0000.parquet

#4 opened 30 days ago by

MichaelCaoo

upvoted a paper about 1 month ago

T2AV-Compass: Towards Unified Evaluation for Text-to-Audio-Video Generation

Paper • 2512.21094 • Published Dec 24, 2025 • 25

New activity in NJU-LINK/T2AV-Compass about 1 month ago

Upload prompts_with_checklist.json

#1 opened about 1 month ago by

MichaelCaoo

upvoted a paper about 1 month ago

ViDiC: Video Difference Captioning

Paper • 2512.03405 • Published Dec 3, 2025 • 28

upvoted 3 papers about 2 months ago

How Far Are We from Genuinely Useful Deep Research Agents?

Paper • 2512.01948 • Published Dec 1, 2025 • 56

From Code Foundation Models to Agents and Applications: A Practical Guide to Code Intelligence

Paper • 2511.18538 • Published Nov 23, 2025 • 294

Video Generation Models Are Good Latent Reward Models

Paper • 2511.21541 • Published Nov 26, 2025 • 45

authored a paper about 2 months ago

OmniVideoBench: Towards Audio-Visual Understanding Evaluation for Omni MLLMs

Paper • 2510.10689 • Published Oct 12, 2025 • 47

updated a model 2 months ago

MichaelCaoo/RoboTwin_DP3_ckpt

Updated Nov 12, 2025

upvoted a paper 2 months ago

MVU-Eval: Towards Multi-Video Understanding Evaluation for Multimodal LLMs

Paper • 2511.07250 • Published Nov 10, 2025 • 18

upvoted a paper 3 months ago

UniAVGen: Unified Audio and Video Generation with Asymmetric Cross-Modal Interactions

Paper • 2511.03334 • Published Nov 5, 2025 • 53

published a model 3 months ago

MichaelCaoo/RoboTwin_DP3_ckpt

Updated Nov 12, 2025

upvoted 6 papers 3 months ago

EditScore: Unlocking Online RL for Image Editing via High-Fidelity Reward Modeling

Paper • 2509.23909 • Published Sep 28, 2025 • 33

Robot Learning: A Tutorial

Paper • 2510.12403 • Published Oct 14, 2025 • 121

A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning

Paper • 2510.15444 • Published Oct 17, 2025 • 148

IF-VidCap: Can Video Caption Models Follow Instructions?

Paper • 2510.18726 • Published Oct 21, 2025 • 26

MT-Video-Bench: A Holistic Video Understanding Benchmark for Evaluating Multimodal LLMs in Multi-Turn Dialogues

Paper • 2510.17722 • Published Oct 20, 2025 • 20

AI for Service: Proactive Assistance with AI Glasses

Paper • 2510.14359 • Published Oct 16, 2025 • 75

Zhe Cao

AI & ML interests

Recent Activity

Organizations

MichaelCaoo's activity

Update README.md

Upload 0000.parquet

Upload prompts_with_checklist.json