Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
aws-neuron
/
optimum-neuron-cache
like
18
Follow
AWS Inferentia and Trainium
101
License:
apache-2.0
Model card
Files
Files and versions
Community
425
[Cache Request] meta-llama/Llama-3.3-70B-Instruct
#378
by
probablyicco
- opened
Mar 19
Discussion
probablyicco
Mar 19
Please add the following model to the neuron cache
See translation
Edit
Preview
Upload images, audio, and videos by dragging in the text input, pasting, or
clicking here
.
Tap or paste here to upload images
Your need to confirm your account before you can post a new comment.
Comment
·
Sign up
or
log in
to comment