Nagori's picture

Nagori

MohammedNaeem

·

Naeem_1144

AI & ML interests

None yet

Recent Activity

upvoted a collection 1 day ago

OpenCodeReasoning-2

upvoted a collection 2 days ago

liked a model 2 days ago

anakin87/qwen-scheduler-7b-grpo

View all activity

Organizations

None yet

MohammedNaeem's activity

upvoted a collection 1 day ago

OpenCodeReasoning-2

Reasoning data for supervised finetuning of LLMs to advance code generation and critique • 4 items • Updated 1 day ago • 5

upvoted a collection 2 days ago

NextCoder

NextCoder family of code-editing LMs developed with Selective Knowledge Transfer and its training data. • 5 items • Updated about 2 hours ago • 38

upvoted a collection 5 days ago

DeepSeek-Prover

DeepSeek-Prover-Series • 10 items • Updated 5 days ago • 47

upvoted an article 25 days ago

Article

Finally, a Replacement for BERT: Introducing ModernBERT

Dec 19, 2024

• 622

upvoted a collection 26 days ago

Kimi-VL-A3B

Moonshot's efficient MoE VLMs, exceptional on agent, long-context, and thinking • 6 items • Updated 23 days ago • 66

upvoted a collection 30 days ago

Llama 4

Llama 4 release • 13 items • Updated 6 days ago • 475

upvoted an article about 1 month ago

Article

The NLP Course is becoming the LLM Course!

Apr 3

• 89

upvoted a collection 3 months ago

NeMo Curator - Classifier Models

Classifier models that can be used in NeMo Curator for labelling/filtering datasets. • 11 items • Updated 12 days ago • 16

upvoted an article 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.24k

upvoted a collection 3 months ago

Qwen2.5-VL

Vision-language model series based on Qwen2.5 • 11 items • Updated 7 days ago • 460

upvoted a collection 4 months ago

DeepSeek-V3

4 items • Updated Mar 25 • 246

upvoted a paper 4 months ago

Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning

Paper • 2412.15797 • Published Dec 20, 2024 • 18

upvoted 3 collections 4 months ago

QVQ

QVQ: Qwen models for visual reasoning • 7 items • Updated 7 days ago • 49

TimesFM Release

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting. • 4 items • Updated Apr 3 • 15

Eagle 2

Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. • 9 items • Updated 12 days ago • 36

upvoted a collection 7 months ago

Llama-3.1-Nemotron-70B

SOTA models on Arena Hard and RewardBench as of 1 Oct 2024. • 6 items • Updated 12 days ago • 155