Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
18
Follow
AWS Inferentia and Trainium
111
License:
apache-2.0
Model card
Files
Files and versions
Community
456
dd8f183
optimum-neuron-cache
/
neuronxcc-2.15.128.0+56dc5a86
/
0_REGISTRY
/
0.0.25.dev0
/
inference
/
llama
/
meta-llama
/
Llama-3.1-70B-Instruct
Commit History
Synchronizing local compiler cache.
7b0e0b8
verified
dacorvo
HF Staff
commited on
Sep 28, 2024
Synchronizing local compiler cache.
6fc5857
verified
dacorvo
HF Staff
commited on
Sep 28, 2024