rinoa's picture

rinoa

rinoa

·

AI & ML interests

None yet

Recent Activity

new activity 7 days ago

Qwen/Qwen3-MT-Demo:qwen3-mt-demo not working

liked a Space 10 days ago

Anvarbekkk/real-time-stock-predictor

liked a Space 10 days ago

Agents-MCP-Hackathon/web-scraper

View all activity

Organizations

None yet

upvoted a collection 2 months ago

Unsloth Dynamic 2.0 Quants

New 2.0 version of our Dynamic GGUF + Quants. Dynamic 2.0 achieves superior accuracy & SOTA quantization performance. • 46 items • Updated 1 day ago • 195

upvoted an article 2 months ago

Article

Groq on Hugging Face Inference Providers 🔥

By

and 4 others •

Jun 16

• 42

upvoted a paper 3 months ago

QwenLong-L1: Towards Long-Context Large Reasoning Models with Reinforcement Learning

Paper • 2505.17667 • Published May 23 • 89

upvoted a paper 4 months ago

Paper2Code: Automating Code Generation from Scientific Papers in Machine Learning

Paper • 2504.17192 • Published Apr 24 • 114

upvoted a paper 5 months ago

TaoAvatar: Real-Time Lifelike Full-Body Talking Avatars for Augmented Reality via 3D Gaussian Splatting

Paper • 2503.17032 • Published Mar 21 • 27

upvoted an article 7 months ago

Article

Open-source DeepResearch – Freeing our search agents

By

and 4 others •

Feb 4

• 1.28k

upvoted a paper 7 months ago

O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning

Paper • 2501.06458 • Published Jan 11 • 32

upvoted 2 collections 8 months ago

Prompt-collection

1 item • Updated Dec 24, 2024 • 1

Granite 3.1 Language Models

A series of language models with 128K context length trained by IBM licensed under Apache 2.0 license. • 9 items • Updated May 2 • 63

upvoted an article 9 months ago

Article

🐺🐦‍⬛ LLM Comparison/Test: 25 SOTA LLMs (including QwQ) through 59 MMLU-Pro CS benchmark runs

By

•

Dec 4, 2024

• 79

upvoted a paper 9 months ago

BloombergGPT: A Large Language Model for Finance

Paper • 2303.17564 • Published Mar 30, 2023 • 26

upvoted 2 collections 9 months ago

🧠 Reasoning Models

8 items • Updated 25 days ago • 39

🍓 Ichigo v0.4

The experimental family designed to train LLMs to understand sound natively. • 3 items • Updated Apr 22 • 8

upvoted a collection 10 months ago

OpenCoder

OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 85

upvoted an article 10 months ago

Article

Visually Multilingual: Introducing mcdse-2b

By

•

Oct 27, 2024

• 41

upvoted 2 collections 11 months ago

Llama 3.2 3B & 1B GGUF Quants

Llama.cpp compatible quants for Llama 3.2 3B and 1B Instruct models. • 4 items • Updated Sep 26, 2024 • 46

Moshi v0.1 Release

MLX, Candle & PyTorch model checkpoints released as part of the Moshi release from Kyutai. Run inference via: https://github.com/kyutai-labs/moshi • 15 items • Updated Apr 18 • 236

upvoted a paper 11 months ago

Self-Harmonized Chain of Thought

Paper • 2409.04057 • Published Sep 6, 2024 • 18

upvoted a collection 12 months ago

Jamba 1.5

The AI21 Jamba family of models are state-of-the-art, hybrid SSM-Transformer instruction following foundation models • 2 items • Updated Mar 6 • 87

upvoted an article about 1 year ago

Article

Tool Use, Unified

Aug 12, 2024

• 114