Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
26
Follow
AWS Inferentia and Trainium
142
License:
apache-2.0
Model card
Files
Files and versions
xet
Community
586
53085a9
optimum-neuron-cache
/
neuronxcc-2.16.372.0+4a9b2326
/
0_REGISTRY
/
0.1.0
/
inference
/
llama
30.5 kB
4 contributors
History:
27 commits
jburtoft
Synchronizing local compiler cache.
488c9e1
verified
5 months ago
TinyLlama
Synchronizing local compiler cache.
6 months ago
deepseek-ai
Synchronizing local compiler cache.
8 months ago
meta-llama
Synchronizing local compiler cache.
5 months ago
princeton-nlp
Synchronizing local compiler cache.
8 months ago
unsloth
Synchronizing local compiler cache.
8 months ago