Test organization

non-profit

AI & ML interests

None defined yet.

Recent Activity

KaituoFeng authored a paper 14 days ago

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

KaituoFeng authored a paper 15 days ago

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

KaituoFeng updated a dataset about 2 months ago

Testorganize/Evaluation-fkt

View all activity

Testorganize's activity

KaituoFeng

authored a paper 14 days ago

MME-Reasoning: A Comprehensive Benchmark for Logical Reasoning in MLLMs

Paper • 2505.21327 • Published 15 days ago • 82

KaituoFeng

authored a paper 15 days ago

SophiaVL-R1: Reinforcing MLLMs Reasoning with Thinking Reward

Paper • 2505.17018 • Published 20 days ago • 15

KaituoFeng

updated a dataset about 2 months ago

Testorganize/Evaluation-fkt

Viewer • Updated Apr 19 • 10.3k • 80

kxgong

authored a paper 3 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79

KaituoFeng

authored a paper 3 months ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79

BreakLee

authored 2 papers 3 months ago

VidEgoThink: Assessing Egocentric Video Understanding Capabilities for Embodied AI

Paper • 2410.11623 • Published Oct 15, 2024 • 49

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published Mar 27 • 79

KaituoFeng

published a dataset 3 months ago

Testorganize/Evaluation-fkt

Viewer • Updated Apr 19 • 10.3k • 80

KaituoFeng

updated a dataset 3 months ago

Testorganize/Video-fkt

Viewer • Updated Mar 10 • 61.2k • 24

Xidong

authored a paper 5 months ago

HuatuoGPT-o1, Towards Medical Complex Reasoning with LLMs

Paper • 2412.18925 • Published Dec 25, 2024 • 105

kxgong

authored a paper 6 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

KaituoFeng

authored a paper 6 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

BreakLee

authored a paper 6 months ago

AV-Odyssey Bench: Can Your Multimodal LLMs Really Understand Audio-Visual Information?

Paper • 2412.02611 • Published Dec 3, 2024 • 24

Xidong

authored a paper 8 months ago

Roadmap towards Superhuman Speech Understanding using Large Language Models

Paper • 2410.13268 • Published Oct 17, 2024 • 35

Xidong

authored 6 papers 9 months ago

Huatuo-26M, a Large-scale Chinese Medical QA Dataset

Paper • 2305.01526 • Published May 2, 2023

HuatuoGPT-II, One-stage Training for Medical Adaption of LLMs

Paper • 2311.09774 • Published Nov 16, 2023 • 1

CMB: A Comprehensive Medical Benchmark in Chinese

Paper • 2308.08833 • Published Aug 17, 2023 • 1

Apollo: Lightweight Multilingual Medical LLMs towards Democratizing Medical AI to 6B People

Paper • 2403.03640 • Published Mar 6, 2024 • 2

LLMs for Doctors: Leveraging Medical LLMs to Assist Doctors, Not Replace Them

Paper • 2406.18034 • Published Jun 26, 2024

LongLLaVA: Scaling Multi-modal LLMs to 1000 Images Efficiently via Hybrid Architecture

Paper • 2409.02889 • Published Sep 4, 2024 • 55