Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
18
Follow
AWS Inferentia and Trainium
111
License:
apache-2.0
Model card
Files
Files and versions
Community
456
f4bea70
optimum-neuron-cache
/
neuronxcc-2.15.128.0+56dc5a86
/
0_REGISTRY
/
0.0.27.dev0
/
inference
/
llama
/
meta-llama
/
Llama-3.1-70B-Instruct
Commit History
Synchronizing local compiler cache.
444f034
verified
dacorvo
HF Staff
commited on
Nov 19, 2024
Synchronizing local compiler cache.
421a895
verified
dacorvo
HF Staff
commited on
Nov 19, 2024