2 8 12

Yifan Song

Solaris99

AI & ML interests

None yet

Recent Activity

upvoted a paper 12 days ago

MiMo-VL Technical Report

liked a model 18 days ago

XiaomiMiMo/MiMo-VL-7B-SFT

liked a model 18 days ago

XiaomiMiMo/MiMo-VL-7B-RL

View all activity

Organizations

Solaris99's activity

upvoted a paper 12 days ago

MiMo-VL Technical Report

Paper • 2506.03569 • Published 13 days ago • 71

liked 2 models 18 days ago

XiaomiMiMo/MiMo-VL-7B-SFT

Image-Text-to-Text • Updated 10 days ago • 9.18k • 41

XiaomiMiMo/MiMo-VL-7B-RL

Image-Text-to-Text • Updated 10 days ago • 13.3k • 143

authored 4 papers about 1 month ago

VisualWebBench: How Far Have Multimodal LLMs Evolved in Web Page Understanding and Grounding?

Paper • 2404.05955 • Published Apr 9, 2024

The Good, The Bad, and The Greedy: Evaluation of LLMs Should Not Ignore Non-Determinism

Paper • 2407.10457 • Published Jul 15, 2024 • 25

AgentBank: Towards Generalized LLM Agents via Fine-Tuning on 50000+ Interaction Trajectories

Paper • 2410.07706 • Published Oct 10, 2024

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 79

upvoted 2 papers about 1 month ago

PiTe: Pixel-Temporal Alignment for Large Video-Language Model

Paper • 2409.07239 • Published Sep 11, 2024 • 15

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 79

commented a paper about 1 month ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 79 •

upvoted a paper 3 months ago

A Comprehensive Survey on Long Context Language Modeling

Paper • 2503.17407 • Published Mar 20 • 49

upvoted a collection 3 months ago

🧠 Reasoning datasets

Collection

Datasets with reasoning traces for math and code released by the community • 24 items • Updated 29 days ago • 151

upvoted a paper 3 months ago

MPO: Boosting LLM Agents with Meta Plan Optimization

Paper • 2503.02682 • Published Mar 4 • 27

liked a Space 4 months ago

2.69k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

published a dataset 4 months ago

Solaris99/rw_data

Preview • Updated Jan 18 • 4

updated 2 datasets 5 months ago

Solaris99/rw_data

Preview • Updated Jan 18 • 4

Solaris99/f82be

Updated Jan 12 • 5

upvoted 2 papers 8 months ago

MixEval-X: Any-to-Any Evaluations from Real-World Data Mixtures

Paper • 2410.13754 • Published Oct 17, 2024 • 76

Harnessing Webpage UIs for Text-Rich Visual Understanding

Paper • 2410.13824 • Published Oct 17, 2024 • 32

updated a dataset 8 months ago

Solaris99/AgentBank

Viewer • Updated Oct 10, 2024 • 53.2k • 574 • 12