Eugene Klimov's picture

Eugene Klimov

Slach

·

Slach

AI & ML interests

None yet

Recent Activity

new activity about 8 hours ago

noctrex/Qwen3-Next-80B-A3B-Instruct-MXFP4_MOE-GGUF:is this model still can use 1M context window or not?

updated a collection 2 days ago

usefull opensource models

updated a collection 2 days ago

usefull opensource models

View all activity

Organizations

None yet

upvoted a collection 24 days ago

GigaAM

Foundational Model for Speech Recognition Tasks • 1 item • Updated Nov 26, 2025 • 2

upvoted a paper 24 days ago

Wikontic: Constructing Wikidata-Aligned, Ontology-Aware Knowledge Graphs with Large Language Models

Paper • 2512.00590 • Published Nov 29, 2025 • 47

upvoted a paper 27 days ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published Dec 11, 2025 • 113

upvoted an article about 1 month ago

Article

CPU Optimized Embeddings with 🤗 Optimum Intel and fastRAG

+4

Mar 15, 2024

•

14

upvoted a collection about 1 month ago

Cerebras REAP

Sparse MoE models compressed using REAP (Router-weighted Expert Activation Pruning) method • 22 items • Updated 6 days ago • 84

upvoted an article about 1 month ago

Article

We Got Claude to Fine-Tune an Open Source LLM

Dec 4, 2025

•

576

upvoted a collection about 2 months ago

Ministral 3

Mistral Ministral 3: new multimodal models in Base, Instruct, and Reasoning variants, available in 3B, 8B, and 14B sizes. • 36 items • Updated 25 days ago • 27

upvoted a collection 4 months ago

usefull opensource models

98 items • Updated 2 days ago • 1

upvoted a collection 5 months ago

VibeVoice

Frontier Text-to-Speech Models https://microsoft.github.io/VibeVoice/ • 8 items • Updated Dec 4, 2025 • 186

upvoted a paper 5 months ago

Deep Think with Confidence

Paper • 2508.15260 • Published Aug 21, 2025 • 90

upvoted 2 papers 6 months ago

nablaNABLA: Neighborhood Adaptive Block-Level Attention

Paper • 2507.13546 • Published Jul 17, 2025 • 124

T-LoRA: Single Image Diffusion Model Customization Without Overfitting

Paper • 2507.05964 • Published Jul 8, 2025 • 119

upvoted a collection 8 months ago

Qwen3

84 items • Updated 18 days ago • 1.57k

upvoted an article 9 months ago

Article

CircleGuardBench: New Standard for Evaluating AI Moderation Models

May 7, 2025

•

59

upvoted 2 collections 9 months ago

Qwen3

Qwen's new Qwen3 models. In Unsloth Dynamic 2.0, GGUF, 4-bit and 16-bit Safetensor formats. Includes 128K Context Length variants. • 79 items • Updated 25 days ago • 252

Qwen 2.5 Coder

Complete collection of Code-specific model series for Qwen2.5 in bnb 4bit, 16bit and GGUF formats. • 35 items • Updated 25 days ago • 36

upvoted 2 papers 10 months ago

When Less is Enough: Adaptive Token Reduction for Efficient Image Representation

Paper • 2503.16660 • Published Mar 20, 2025 • 72

Feature-Level Insights into Artificial Text Detection with Sparse Autoencoders

Paper • 2503.03601 • Published Mar 5, 2025 • 232

upvoted 2 papers 11 months ago

GHOST 2.0: generative high-fidelity one shot transfer of heads

Paper • 2502.18417 • Published Feb 25, 2025 • 67

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20, 2025 • 174