Mc Sp's picture

14 3

Mc Sp

mcsp

·

AI & ML interests

None yet

Recent Activity

upvoted a paper about 19 hours ago

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

upvoted a paper 2 days ago

Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving

upvoted a paper 4 days ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

View all activity

Organizations

mcsp's activity

upvoted a paper about 19 hours ago

Putting the Value Back in RL: Better Test-Time Scaling by Unifying LLM Reasoners With Verifiers

Paper • 2505.04842 • Published 3 days ago • 12

upvoted a paper 2 days ago

Beyond Theorem Proving: Formulation, Framework and Benchmark for Formal Problem-Solving

Paper • 2505.04528 • Published 4 days ago • 10

upvoted a paper 4 days ago

Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-Tuning

Paper • 2505.03318 • Published 5 days ago • 83

upvoted an article 5 days ago

Article

Vision Language Models Explained

Apr 11, 2024

• 328

upvoted 6 papers 7 days ago

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

Paper • 2504.15279 • Published 20 days ago • 73

Step1X-Edit: A Practical Framework for General Image Editing

Paper • 2504.17761 • Published 17 days ago • 88

Optimizing LLMs for Italian: Reducing Token Fertility and Enhancing Efficiency Through Vocabulary Adaptation

Paper • 2504.17025 • Published 18 days ago • 16

Skywork R1V2: Multimodal Hybrid Reinforcement Learning for Reasoning

Paper • 2504.16656 • Published 18 days ago • 55

CipherBank: Exploring the Boundary of LLM Reasoning Capabilities through Cryptography Challenges

Paper • 2504.19093 • Published 14 days ago • 16

In-Context Edit: Enabling Instructional Image Editing with In-Context Generation in Large Scale Diffusion Transformer

Paper • 2504.20690 • Published 12 days ago • 18

upvoted a paper 11 days ago

Reinforcement Learning for Reasoning in Large Language Models with One Training Example

Paper • 2504.20571 • Published 12 days ago • 90

upvoted a collection 12 days ago

Qwen3

37 items • Updated 2 days ago • 558

upvoted a paper 18 days ago

The Bitter Lesson Learned from 2,000+ Multilingual Benchmarks

Paper • 2504.15521 • Published 19 days ago • 63

upvoted a collection about 2 months ago

DeTikZify

Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 12 items • Updated Mar 19 • 26