Shijie Geng
makitanikaze
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
From Hours to Minutes: Lossless Acceleration of Ultra Long Sequence
Generation up to 100K Tokens
upvoted
a
paper
about 1 month ago
Block Diffusion: Interpolating Between Autoregressive and Diffusion
Language Models
Organizations
None yet
makitanikaze's activity
ImportError: cannot import name '_flash_supports_window_size' from 'transformers.modeling_flash_attention_utils'
1
#2 opened about 1 month ago
by
XOGorKi
