1 25 5

Maksym Riabov

MRiabov

https://github.com/MRiabov

MRiabov

AI & ML interests

LLMs and CV

Recent Activity

upvoted a paper 4 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

upvoted a paper 5 days ago

AutoPR: Let's Automate Your Academic Promotion!

liked a model 6 days ago

deepseek-ai/DeepSeek-V3.2-Exp

View all activity

Organizations

None yet

upvoted a paper 4 days ago

QeRL: Beyond Efficiency -- Quantization-enhanced Reinforcement Learning for LLMs

Paper • 2510.11696 • Published 5 days ago • 154

upvoted a paper 5 days ago

AutoPR: Let's Automate Your Academic Promotion!

Paper • 2510.09558 • Published 8 days ago • 48

upvoted 3 papers 9 days ago

upvoted a paper 10 days ago

Fast-dLLM v2: Efficient Block-Diffusion LLM

Paper • 2509.26328 • Published 18 days ago • 47

upvoted a paper 15 days ago

VLA-R1: Enhancing Reasoning in Vision-Language-Action Models

Paper • 2510.01623 • Published 16 days ago • 7

upvoted 2 papers 17 days ago

dParallel: Learnable Parallel Decoding for dLLMs

Paper • 2509.26488 • Published 18 days ago • 19

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published 18 days ago • 483

upvoted a paper 18 days ago

Democratizing AI scientists using ToolUniverse

Paper • 2509.23426 • Published 21 days ago • 38

upvoted 2 papers 19 days ago

Real-Time Object Detection Meets DINOv3

Paper • 2509.20787 • Published 23 days ago • 10

MinerU2.5: A Decoupled Vision-Language Model for Efficient High-Resolution Document Parsing

Paper • 2509.22186 • Published 22 days ago • 118

upvoted 2 papers 20 days ago

SWE-QA: Can Language Models Answer Repository-level Code Questions?

Paper • 2509.14635 • Published about 1 month ago • 36

Reinforcement Learning on Pre-Training Data

Paper • 2509.19249 • Published 25 days ago • 66

upvoted a paper 21 days ago

TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them

Paper • 2509.21117 • Published 23 days ago • 29

upvoted 4 papers about 1 month ago

The Illusion of Diminishing Returns: Measuring Long Horizon Execution in LLMs

Paper • 2509.09677 • Published Sep 11 • 33

Parallel-R1: Towards Parallel Thinking via Reinforcement Learning

Paper • 2509.07980 • Published Sep 9 • 98

Reinforced Visual Perception with Tools

Paper • 2509.01656 • Published Sep 1 • 31

Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models

Paper • 2509.06949 • Published Sep 8 • 56

upvoted an article about 1 month ago

Article

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face

•

Feb 11

• 76

Maksym Riabov

AI & ML interests

Recent Activity

Organizations

MRiabov's activity

Fine-Tuning Your First Large Language Model (LLM) with PyTorch and Hugging Face