dfuhoiysOHSVFh82934gfjklb

huba-buba

AI & ML interests

None yet

Recent Activity

liked a model 1 day ago

ai-sage/GigaChat-20B-A3B-instruct

liked a dataset 1 day ago

HuggingFaceH4/numina-deepseek-r1-qwen-7b

liked a model 2 days ago

mistralai/Mistral-Nemo-Base-2407

View all activity

Organizations

None yet

huba-buba's activity

liked a model 1 day ago

ai-sage/GigaChat-20B-A3B-instruct

Updated Dec 16, 2024 • 1.31k • 28

liked a dataset 1 day ago

HuggingFaceH4/numina-deepseek-r1-qwen-7b

Viewer • Updated 5 days ago • 40 • 100 • 10

liked a model 2 days ago

mistralai/Mistral-Nemo-Base-2407

Text Generation • Updated Nov 6, 2024 • 1.31M • 285

upvoted a paper 2 days ago

DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models

Paper • 2402.03300 • Published Feb 5, 2024 • 84

liked a model 3 days ago

deepseek-ai/Janus-Pro-7B

Any-to-Any • Updated 3 days ago • 79.3k • 2.07k

upvoted 2 papers 6 days ago

SRMT: Shared Memory for Multi-agent Lifelong Pathfinding

Paper • 2501.13200 • Published 8 days ago • 60

Sigma: Differential Rescaling of Query, Key and Value for Efficient Language Models

Paper • 2501.13629 • Published 7 days ago • 40

upvoted a paper 7 days ago

DynaSaur: Large Language Agents Beyond Predefined Actions

Paper • 2411.01747 • Published Nov 4, 2024 • 25

upvoted a paper 10 days ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published 14 days ago • 100

liked a model 12 days ago

Vikhrmodels/Vikhr-Gemma-2B-instruct-GGUF

Text Generation • Updated Aug 23, 2024 • 1.35k • 14

upvoted a paper 13 days ago

Towards Large Reasoning Models: A Survey of Reinforced Reasoning with Large Language Models

Paper • 2501.09686 • Published 14 days ago • 36

upvoted 2 papers 14 days ago

BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning

Paper • 2501.03226 • Published 24 days ago • 37

REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models

Paper • 2501.03262 • Published 27 days ago • 89

upvoted 2 articles 15 days ago

Article

License to Call: Introducing Transformers Agents 2.0

May 13, 2024

• 127

Article

Introducing smolagents: simple agents that write actions in code.

about 1 month ago

• 536

liked a model 16 days ago

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6, 2024 • 1.46M • 1.43k

upvoted a paper 16 days ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published 19 days ago • 29

liked a Space 16 days ago

Running on Zero

136

🏢

MatchAnything

liked 2 datasets 16 days ago

Egor-AI/Dataset_of_Russian_thinking

Viewer • Updated 18 days ago • 147k • 181 • 11

HuggingFaceFW/fineweb

Viewer • Updated 27 days ago • 48.6B • 438k • 1.83k