RuAR's picture

RuAR

RachidAR

·

RachidARx

AI & ML interests

1.58 bit LLM

Recent Activity

liked a model 12 days ago

ResembleAI/chatterbox

liked a model 12 days ago

deepseek-ai/DeepSeek-R1-0528-Qwen3-8B

liked a model 14 days ago

deepseek-ai/DeepSeek-R1-0528

View all activity

Organizations

RachidAR's activity

upvoted a collection 21 days ago

Falcon-H1

Falcon-H1 Family of Hybrid-Head Language Models, including 0.5B, 1.5B, 1.5B-Deep, 3B, 7B, and 34B (pretrained and instruction-tuned). • 37 items • Updated 21 days ago • 39

upvoted a collection 25 days ago

Granite 4.0 Language Models

2 items • Updated May 2 • 13

upvoted a collection 26 days ago

Falcon Edge series

A series of powerful, universal and fine-tunable small Language Models • 7 items • Updated 21 days ago • 22

upvoted a paper about 1 month ago

Absolute Zero: Reinforced Self-play Reasoning with Zero Data

Paper • 2505.03335 • Published May 6 • 170

upvoted an article about 1 month ago

Article

Comparing sub 50GB Llama 4 Scout quants (KLD/Top P)

By

•

Apr 9

• 40

upvoted a collection about 1 month ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 65 items • Updated 12 days ago • 150

upvoted an article about 1 month ago

Article

Uncensor any LLM with abliteration

By

•

Jun 13, 2024

• 608

upvoted a collection about 1 month ago

Qwen3

40 items • Updated 21 days ago • 748

upvoted 5 collections about 2 months ago

blt

4 items • Updated Apr 17 • 23

Skywork-OR1

Skywork Open Reasoner 1 • 11 items • Updated 13 days ago • 29

BitNet

🔥BitNet family of large language models (1-bit LLMs). • 7 items • Updated May 1 • 42

Granite Experiments

Experimental projects under consideration for the Granite family. • 17 items • Updated 8 days ago • 12

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated Apr 15 • 125

upvoted 2 collections 2 months ago

Granite 3.3 Language Models

Our latest language models licensed under Apache 2.0 license. • 4 items • Updated May 2 • 34

Cogito v1 Preview

5 items • Updated Apr 8 • 111

upvoted a paper 3 months ago

ParetoQ: Scaling Laws in Extremely Low-bit LLM Quantization

Paper • 2502.02631 • Published Feb 4 • 4

upvoted a collection 3 months ago

Gemma 3 Release

24 items • Updated 12 days ago • 384

upvoted 3 papers 4 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 160

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 24

BitNet: Scaling 1-bit Transformers for Large Language Models

Paper • 2310.11453 • Published Oct 17, 2023 • 103