-
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning
Paper • 2311.11077 • Published • 28 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 89 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 38 -
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 65
Eugene Oskin
eoskin
AI & ML interests
None yet
Recent Activity
upvoted
a
paper
about 1 month ago
RoFormer: Enhanced Transformer with Rotary Position Embedding
liked
a model
about 1 month ago
budecosystem/boomer-634m
updated
a collection
about 1 month ago
impactful-papers
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet