-
2.5 Years in Class: A Multimodal Textbook for Vision-Language Pretraining
Paper • 2501.00958 • Published • 95 -
ProgCo: Program Helps Self-Correction of Large Language Models
Paper • 2501.01264 • Published • 25 -
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction
Paper • 2501.01957 • Published • 40 -
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
Paper • 2501.03226 • Published • 35
sergicalsix
sergicalsix
AI & ML interests
None yet
Recent Activity
updated
a collection
about 18 hours ago
2025 LLM Papers on Hugging Face with Japanese Memos
upvoted
a
paper
about 18 hours ago
MiniMax-01: Scaling Foundation Models with Lightning Attention
updated
a collection
about 18 hours ago
2025 LLM Papers on Hugging Face with Japanese Memos
Organizations
Collections
1
spaces
1
models
None public yet