min
qiyang-attn
AI & ML interests
GNN, LLM, Generative Models, MultiModal, Recommendation Models
Recent Activity
upvoted
a
paper
about 1 month ago
Expert Race: A Flexible Routing Strategy for Scaling Diffusion
Transformer with Mixture of Experts
authored
a paper
about 1 month ago
Frac-Connections: Fractional Extension of Hyper-Connections
authored
a paper
3 months ago
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling
Organizations
None yet
qiyang-attn's activity
No public activity