Hao Fei

scofield7419

9 17 10

http://haofei.vip/

AI & ML interests

Multimodal Learning, Large Language Model, Vision and Language, Natural Language Processing, Structural Modeling

Recent Activity

published a dataset 4 days ago

mental-world-model/menti-bench

updated a dataset 4 days ago

mental-world-model/menti-bench

authored a paper 3 months ago

CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models

View all activity

Organizations

commented a paper 9 months ago

UniVA: Universal Video Agent towards Open-Source Next-Generation Video Generalist

Paper • 2511.08521 • Published Nov 11, 2025 • 39 •

New activity in huggingface/HuggingDiscussions 9 months ago

[FEEDBACK] Daily Papers

🔥❤️ 21

207

#32 opened about 2 years ago by

kramp

commented 4 papers about 1 year ago

commented 8 papers over 1 year ago

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Paper • 2503.23377 • Published Mar 30, 2025 • 57 •

JavisDiT: Joint Audio-Video Diffusion Transformer with Hierarchical Spatio-Temporal Prior Synchronization

Paper • 2503.23377 • Published Mar 30, 2025 • 57 •

Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Paper • 2412.19806 • Published Oct 8, 2024 • 2 •

Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Paper • 2412.19806 • Published Oct 8, 2024 • 2 •

Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Paper • 2412.19806 • Published Oct 8, 2024 • 2 •

Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Paper • 2412.19806 • Published Oct 8, 2024 • 2 •

Vitron: A Unified Pixel-level Vision LLM for Understanding, Generating, Segmenting, Editing

Paper • 2412.19806 • Published Oct 8, 2024 • 2 •

RetrieveGPT: Merging Prompts and Mathematical Models for Enhanced Code-Mixed Information Retrieval

Paper • 2411.04752 • Published Nov 7, 2024 • 16 •

commented a paper almost 3 years ago

NExT-GPT: Any-to-Any Multimodal LLM

Paper • 2309.05519 • Published Sep 11, 2023 • 79 •

Hao Fei

AI & ML interests

Recent Activity

Organizations

scofield7419's activity

[FEEDBACK] Daily Papers