view post Post 1571 Check out my collection of pre-made GGUF LoRA adapters!This allow you to use both normal + abliterated version of popular models like llama, qwen, etc, without having to double to amount of VRAM usage. ngxson/gguf_lora_collection See translation
view post Post 2284 I made this small tool that can be useful for debugging Ollama chat template: ngxson/ollama_template_testCC @bartowski you may need this ;-) See translation
Extracted LoRA (mergekit) PEFT-compatible LoRA adapters produced by mergekit-extract-lora Running 2 📁 Extracted LoRA - GGUF version Redirection to ggml-org collection ngxson/LoRA-Qwen2.5-3B-Instruct-abliterated Updated 8 days ago ngxson/LoRA-Qwen2.5-7B-Instruct-abliterated-v3 Updated 11 days ago ngxson/LoRA-Qwen2.5-14B-Instruct-abliterated-v2 Updated 11 days ago • 1
MiniThinky: extra small reasoning models My first trial to make reasoning models Running 88 🧠 Llama 3.2 Reasoning WebGPU Small and powerful reasoning LLM that runs in your browser ngxson/MiniThinky-v2-1B-Llama-3.2 Text Generation • Updated 8 days ago • 5.05k • 29 ngxson/MiniThinky-v2-1B-Llama-3.2-Q8_0-GGUF Updated 10 days ago • 251 • 5 ngxson/MiniThinky-1B-Llama-3.2 Text Generation • Updated 9 days ago • 226 • 4