Manuel Romero's picture

Manuel Romero PRO

mrm8488

·

https://mrm8488.github.io

AI & ML interests

#AI Research and Democratization. NLP/NLG 🤗

Recent Activity

upvoted a paper about 10 hours ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

liked a Space 5 days ago

AIEnergyScore/Leaderboard

liked a Space 5 days ago

AIEnergyScore/submission_portal

View all activity

Organizations

upvoted a paper about 10 hours ago

SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning

Paper • 2506.24119 • Published 5 days ago • 39

upvoted a collection about 1 month ago

Holo1

10 items • Updated Jun 3 • 1

upvoted 2 articles about 1 month ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

By

•

Jan 15

• 195

Article

nanoVLM: The simplest repository to train your VLM in pure PyTorch

By

and 6 others •

May 21

• 181

upvoted a paper about 2 months ago

MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining

Paper • 2505.07608 • Published May 12 • 80

upvoted a paper 2 months ago

The Leaderboard Illusion

Paper • 2504.20879 • Published Apr 29 • 70

upvoted an article 2 months ago

Article

I trained a Language Model to schedule events with GRPO!

By

•

Apr 29

• 80

upvoted a paper 2 months ago

TTRL: Test-Time Reinforcement Learning

Paper • 2504.16084 • Published Apr 22 • 117

upvoted an article 2 months ago

Article

Tiny Agents: a MCP-powered agent in 50 lines of code

By

•

Apr 25

• 284

upvoted a paper 3 months ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 154

upvoted a collection 3 months ago

Orpheus Multilingual Research Release

Beta Release of multilingual models. • 12 items • Updated Apr 10 • 90

upvoted an article 3 months ago

Article

You could have designed state of the art positional encoding

By

•

Nov 25, 2024

• 307

upvoted 2 collections 4 months ago

Scaling Laws 📏

4 items • Updated Oct 15, 2024 • 3

🤖 Agents

21 items • Updated Dec 31, 2024 • 160

upvoted a paper 4 months ago

START: Self-taught Reasoner with Tools

Paper • 2503.04625 • Published Mar 6 • 114

upvoted 2 collections 4 months ago

🧠 Reasoning datasets

Datasets with reasoning traces for math and code released by the community • 24 items • Updated May 19 • 154

👩‍💻 OlympicCoder

Reasoning datasets and models for competitive coding • 4 items • Updated May 13 • 19

upvoted an article 4 months ago

Article

Open R1: Update #3

By

and 9 others •

Mar 11

• 293

upvoted a collection 4 months ago

olmOCR

olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 4 items • Updated 17 days ago • 117

upvoted an article 4 months ago

Article

FastRTC: The Real-Time Communication Library for Python

By

and 1 other •

Feb 25

• 169