1 6 1

zez

Everything-is-Ok

everythingez

AI & ML interests

None yet

Recent Activity

published a bucket about 1 month ago

NextGenWhu/Annotation-storage

updated a Space about 1 month ago

NextGenWhu/Annotation

updated a Space about 1 month ago

NextGenWhu/DITING-leaderboard

View all activity

Organizations

published a bucket about 1 month ago

NextGenWhu/Annotation-storage

1.39 MB

updated 2 Spaces about 1 month ago

Annotation

🟧

Label data with an open‑source annotation tool

DITING Leaderboard

📊

Explore model performance with interactive radar charts

upvoted a paper 5 months ago

MentraSuite: Post-Training Large Language Models for Mental Health Reasoning and Assessment

Paper • 2512.09636 • Published Dec 10, 2025 • 26

upvoted 2 papers 7 months ago

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

Paper • 2510.09116 • Published Oct 10, 2025 • 97

Spatial Forcing: Implicit Spatial Representation Alignment for Vision-language-action Model

Paper • 2510.12276 • Published Oct 14, 2025 • 149

liked a Space 7 months ago

DITING Leaderboard

📊

Explore model performance with interactive radar charts

published a Space 7 months ago

DITING Leaderboard

📊

Explore model performance with interactive radar charts

commented a paper 7 months ago

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

Paper • 2510.09116 • Published Oct 10, 2025 • 97 •

authored a paper 7 months ago

DITING: A Multi-Agent Evaluation Framework for Benchmarking Web Novel Translation

Paper • 2510.09116 • Published Oct 10, 2025 • 97

updated a dataset 7 months ago

NextGenWhu/DITING

Preview • Updated Oct 14, 2025 • 17 • 1

published a dataset 7 months ago

NextGenWhu/DITING

Preview • Updated Oct 14, 2025 • 17 • 1

upvoted a paper 9 months ago

From Scores to Skills: A Cognitive Diagnosis Framework for Evaluating Financial Large Language Models

Paper • 2508.13491 • Published Aug 19, 2025 • 59

upvoted a paper 11 months ago

MultiFinBen: A Multilingual, Multimodal, and Difficulty-Aware Benchmark for Financial LLM Evaluation

Paper • 2506.14028 • Published Jun 16, 2025 • 94

upvoted a paper about 1 year ago

FinAudio: A Benchmark for Audio Large Language Models in Financial Applications

Paper • 2503.20990 • Published Mar 26, 2025 • 19

zez

AI & ML interests

Recent Activity

Organizations

Everything-is-Ok's activity

NextGenWhu/Annotation-storage

Annotation

DITING Leaderboard

DITING Leaderboard

DITING Leaderboard