Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
18
Follow
AWS Inferentia and Trainium
112
License:
apache-2.0
Model card
Files
Files and versions
Community
456
c2ebed7
optimum-neuron-cache
/
neuronxcc-2.15.128.0+56dc5a86
/
0_REGISTRY
/
0.0.25
/
inference
/
llama
/
meta-llama
/
Meta-Llama-3-70B
Commit History
Synchronizing local compiler cache.
3000e63
verified
dacorvo
HF Staff
commited on
Oct 1, 2024
Synchronizing local compiler cache.
a783063
verified
dacorvo
HF Staff
commited on
Oct 1, 2024