MTSAIR
/

Cotype-Nano-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Cotype Nano GGUF🤖

Cotype-Nano-GGUF– это легковесная LLM, которая может запускаться даже на мобильных девайсах

Cotype-Nano-GGUF is a lightweight LLM that can run even on mobile devices

Inference

from llama_cpp import Llama

llm = Llama.from_pretrained(
    repo_id="MTSAIR/Cotype-Nano-GGUF",
    filename="cotype_nano_8bit.gguf",
)

llm.create_chat_completion(
    messages = [
        {
            "role": "user",
            "content": "What is the capital of France?"
        }
    ]
)

Downloads last month: 128

GGUF

Model size

1.54B params

Architecture

qwen2

16-bit

Inference Providers NEW

Text Generation

This model is not currently available via any of the supported Inference Providers.

Collection including MTSAIR/Cotype-Nano-GGUF

Cotype-Nano

Small and strong 1.5B models • 4 items • Updated Nov 26, 2024 • 19