MaskLLM: Learnable Semi-structured Sparsity for Large Language Models (NeurIPS'24 Spotlight)
Gongfan Fang
Vinnnf
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
3 days ago
Nemotron-H: A Family of Accurate and Efficient Hybrid Mamba-Transformer
Models
upvoted
a
paper
3 days ago
Efficient Hybrid Language Model Compression through Group-Aware SSM
Pruning
authored
a paper
4 days ago
LLM-Pruner: On the Structural Pruning of Large Language Models
Organizations
Collections
1
models
3
datasets
None public yet