Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
18
Follow
AWS Inferentia and Trainium
104
License:
apache-2.0
Model card
Files
Files and versions
Community
435
e3ea9d7
optimum-neuron-cache
/
neuronxcc-2.13.66.0+6dfecc895
/
0_REGISTRY
/
0.0.23.dev0
/
inference
/
llama
Ctrl+K
Ctrl+K
3 contributors
History:
7 commits
dacorvo
HF Staff
Synchronizing local compiler cache.
6ae4846
verified
11 months ago
HuggingFaceTB
Synchronizing local compiler cache.
11 months ago
NousResearch
Synchronizing local compiler cache.
12 months ago
dacorvo
Synchronizing local compiler cache.
12 months ago
meta-Llama
Synchronizing local compiler cache.
12 months ago
meta-llama
Synchronizing local compiler cache.
12 months ago
princeton-nlp
Synchronizing local compiler cache.
11 months ago