-
DoRA: Weight-Decomposed Low-Rank Adaptation
Paper • 2402.09353 • Published • 28 -
MobileLLM: Optimizing Sub-billion Parameter Language Models for On-Device Use Cases
Paper • 2402.14905 • Published • 133 -
Resonance RoPE: Improving Context Length Generalization of Large Language Models
Paper • 2403.00071 • Published • 25 -
AtP*: An efficient and scalable method for localizing LLM behaviour to components
Paper • 2403.00745 • Published • 14
Aharneish Abburu
Aharneish
·
AI & ML interests
None yet