Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

shivash
/
hybrid-transformer-276m-v2

Text Generation
PyTorch
English
hybrid_transformer_v2
transformer
causal-lm
gqa
grouped-query-attention
memory-efficient
llama
qwen
hybrid
v2
fixed-architecture
custom_code
Model card Files Files and versions
xet
Community

You need to agree to share your contact information to access this model

This repository is publicly accessible, but you have to accept the conditions to access its files and content.

Log in or Sign Up to review the conditions and access this model content.

Gated model
You can list files but not access them

Preview of files found in this repository
  • .gitattributes
    1.52 kB
    initial commit 21 days ago
  • README.md
    2.89 kB
    Upload README.md with huggingface_hub 21 days ago
  • config.json
    965 Bytes
    Upload config.json with huggingface_hub 21 days ago
  • configuration_hybrid_transformer_v2.py
    1.66 kB
    Upload configuration_hybrid_transformer_v2.py with huggingface_hub 21 days ago
  • merges.txt
    334 Bytes
    Upload merges.txt with huggingface_hub 21 days ago
  • modeling_hybrid_transformer_v2.py
    2.66 kB
    Upload modeling_hybrid_transformer_v2.py with huggingface_hub 21 days ago
  • pytorch_model.bin

    Detected Pickle imports (3)

    • "collections.OrderedDict",
    • "torch._utils._rebuild_tensor_v2",
    • "torch.FloatStorage"

    What is a pickle import?

    1.34 GB
    xet
    Upload pytorch_model.bin with huggingface_hub 21 days ago
  • tokenizer.json
    998 kB
    Upload tokenizer.json with huggingface_hub 21 days ago
  • tokenizer_config.json
    1.49 kB
    Upload tokenizer_config.json with huggingface_hub 21 days ago
  • vocab.json
    868 kB
    Upload vocab.json with huggingface_hub 21 days ago