Pavlo Molchanov's picture

Pavlo Molchanov PRO

pmolchanov

·

https://www.pmolchanov.com

AI & ML interests

Efficiency in Multi-Modal LLMs

Recent Activity

upvoted a paper 1 day ago

Scaling RL to Long Videos

upvoted a collection 24 days ago

upvoted an article 26 days ago

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

View all activity

Organizations

upvoted a paper 1 day ago

Scaling RL to Long Videos

Paper • 2507.07966 • Published 3 days ago • 110

upvoted a collection 24 days ago

Nemotron-H

Mamba-Transformer hybrid models • 10 items • Updated 2 days ago • 29

upvoted an article 26 days ago

Article

Supercharge Edge AI with High Accuracy Reasoning Using Llama Nemotron Nano 4B

By

and 3 others •

Jun 10

• 6

upvoted a paper about 2 months ago

AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning

Paper • 2505.16400 • Published May 22 • 33

upvoted a collection about 2 months ago

Llama Nemotron

Open, Production-ready Enterprise Models • 8 items • Updated 2 days ago • 60

upvoted 5 papers 3 months ago

CLIMB: CLustering-based Iterative Data Mixture Bootstrapping for Language Model Pre-training

Paper • 2504.13161 • Published Apr 17 • 92

Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning

Paper • 2504.11409 • Published Apr 15 • 10

Efficient Reasoning Models: A Survey

Paper • 2504.10903 • Published Apr 15 • 18

Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer Models

Paper • 2504.03624 • Published Apr 4 • 13

One-Minute Video Generation with Test-Time Training

Paper • 2504.05298 • Published Apr 7 • 106

upvoted a paper 4 months ago

Scaling Vision Pre-Training to 4K Resolution

Paper • 2503.19903 • Published Mar 25 • 42

upvoted 2 articles 5 months ago

Article

Bamba: Inference-Efficient Hybrid Mamba2 Model

By

and 28 others •

Dec 18, 2024

• 57

Article

Finally, a Replacement for BERT: Introducing ModernBERT

By

and 14 others •

Dec 19, 2024

• 664

upvoted a paper 5 months ago

SANA 1.5: Efficient Scaling of Training-Time and Inference-Time Compute in Linear Diffusion Transformer

Paper • 2501.18427 • Published Jan 30 • 20

upvoted a collection 6 months ago

Cosmos

The collection of Cosmos models • 31 items • Updated 2 days ago • 292

upvoted 2 papers 7 months ago

NVILA: Efficient Frontier Visual Language Models

Paper • 2412.04468 • Published Dec 5, 2024 • 60

Cautious Optimizers: Improving Training with One Line of Code

Paper • 2411.16085 • Published Nov 25, 2024 • 21

upvoted a collection 8 months ago

Hymba

A series of Hybrid Small Language Models. • 3 items • Updated 2 days ago • 30

upvoted 2 papers 8 months ago

Star Attention: Efficient LLM Inference over Long Sequences

Paper • 2411.17116 • Published Nov 26, 2024 • 55

Hymba: A Hybrid-head Architecture for Small Language Models

Paper • 2411.13676 • Published Nov 20, 2024 • 46