optimum-neuron-cache / inference-cache-config

Commit History

Update inference-cache-config/qwen.json
8d1681e
verified

dacorvo HF Staff commited on

Create qwen.json
a166114
verified

dacorvo HF Staff commited on

Add batch size 4 configurations for LLama 1B and 3B models
3b6312a
verified

dacorvo HF Staff commited on

Rename inference-cache-config/pixart_sigma_xl_512x512.json to inference-cache-config/pixart-sigma-xl-512x512.json
1d662ce
verified

Jingya HF Staff commited on

Create pixart_sigma_xl_512x512.json
13f78a6
verified

Jingya HF Staff commited on

Rename inference-cache-config/pixart-xl-2-512x512.json to inference-cache-config/pixart-alpha-xl-512x512.json
cb11624
verified

Jingya HF Staff commited on

Rename inference-cache-config/pixArt-XL-2-512x512.json to inference-cache-config/pixart-xl-2-512x512.json
c7f992d
verified

Jingya HF Staff commited on

Create pixArt-XL-2-512x512.json
600ade6
verified

Jingya HF Staff commited on

Create sdxl-turbo.json
591ea52
verified

Jingya HF Staff commited on

Create stable-diffusion-xl-refiner-1.0.json
aa72a1a
verified

Jingya HF Staff commited on

Create stable-diffusion-xl-base-1.0.json
1554744
verified

Jingya HF Staff commited on

Create stable-diffusion-2-1.json
e4e1333
verified

Jingya HF Staff commited on

Rename inference-cache-config/diffusion.json to inference-cache-config/stable-diffusion-v1-5.json
4a034bb
verified

Jingya HF Staff commited on

add pixart and remove deprecated
e5f06c7
verified

Jingya HF Staff commited on

Added TinyLlama as requested by Jim burtoft
d9640f4
verified

dacorvo HF Staff commited on

Add phi4 cached configurations
c564534
verified

dacorvo HF Staff commited on

Add DeepSeek distilled versions of LLama 8B
509e6bf
verified

dacorvo HF Staff commited on

Add DeepSeek distilled model
4d1e615
verified

dacorvo HF Staff commited on

Update inference-cache-config/qwen2.5-large.json
84982b8
verified

dacorvo HF Staff commited on

Add DeepSeek distilled models
f4f3dcd
verified

dacorvo HF Staff commited on

Update inference-cache-config/mistral.json
01e1fe9
verified

dacorvo HF Staff commited on

Update inference-cache-config/mistral.json
7191bac
verified

dacorvo HF Staff commited on

Update inference-cache-config/mistral.json
6b536dc
verified

dacorvo HF Staff commited on

Add configuration for granite models
687da09
verified

dacorvo HF Staff commited on

Rename inference-cache-config/qwen-2.5-large.json to inference-cache-config/qwen2.5-large.json
2aa52ac
verified

dacorvo HF Staff commited on

Create qwen-2.5-large.json
a8df9db
verified

dacorvo HF Staff commited on

Rename inference-cache-config/qwen2.5 to inference-cache-config/qwen2.5.json
b9f1fde
verified

dacorvo HF Staff commited on

Add qwen2.5 config for models up to 14B params
4e25bb0
verified

dacorvo HF Staff commited on

Remove obsolete mistral variants
e60c569
verified

dacorvo HF Staff commited on

Remove obsolete llama variants
eee32f0
verified

dacorvo HF Staff commited on

Rename inference-cache-config/Llama3.1-70b.json to inference-cache-config/llama3.1-70b.json
563ba38
verified

dacorvo HF Staff commited on

Update inference-cache-config/Llama3.1-70b.json
7b0370b
verified

dacorvo HF Staff commited on

Update inference-cache-config/mistral.json
8ea3b57
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama.json
d05f579
verified

dacorvo HF Staff commited on

Rename inference-cache-config/Llama3.1-70B.json to inference-cache-config/Llama3.1-70b.json
a92cfe3
verified

dacorvo HF Staff commited on

Update inference-cache-config/mixtral.json
7342c16
verified

dacorvo HF Staff commited on

Rename inference-cache-config/Llama-3.1-70B.json to inference-cache-config/Llama3.1-70B.json
b41e94c
verified

dacorvo HF Staff commited on

Create Llama-3.1-70B.json
b1279f9
verified

dacorvo HF Staff commited on

Delete inference-cache-config/llama3-8b.json
5b0b2de
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama.json
0548cd2
verified

dacorvo HF Staff commited on

Delete inference-cache-config/llama2-7b-13b.json
219c5fd
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama.json
afb9fe6
verified

dacorvo HF Staff commited on

Rename inference-cache-config/llama-3.1-8B.json to inference-cache-config/llama.json
14844a0
verified

dacorvo HF Staff commited on

Update inference-cache-config/mistral.json
6c4c814
verified

dacorvo HF Staff commited on

Create llama-3.1-8B.json
320841a
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama3-8b.json
de9e259
verified

dacorvo HF Staff commited on

Update inference-cache-config/llama3-70b.json
5694f75
verified

dacorvo HF Staff commited on

Update inference-cache-config/stable-diffusion.json
5272eb2
verified

Jingya HF Staff commited on

Temporarily remove SD 1.5 from Runway
a74d412
verified

Jingya HF Staff commited on

Update inference-cache-config/llama-variants.json
e7179a3
verified

dacorvo HF Staff commited on