Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
18
Follow
AWS Inferentia and Trainium
112
License:
apache-2.0
Model card
Files
Files and versions
Community
456
3000e63
optimum-neuron-cache
/
neuronxcc-2.15.128.0+56dc5a86
/
0_REGISTRY
/
0.0.25
/
inference
/
llama
/
meta-llama
/
Llama-3.1-70B-Instruct
Commit History
Synchronizing local compiler cache.
afe4c16
verified
dacorvo
HF Staff
commited on
Oct 1, 2024
Synchronizing local compiler cache.
80c6862
verified
dacorvo
HF Staff
commited on
Oct 1, 2024