-
Adapters: A Unified Library for Parameter-Efficient and Modular Transfer Learning
Paper • 2311.11077 • Published • 28 -
Tensor Product Attention Is All You Need
Paper • 2501.06425 • Published • 89 -
LoRA: Low-Rank Adaptation of Large Language Models
Paper • 2106.09685 • Published • 44 -
ShortGPT: Layers in Large Language Models are More Redundant Than You Expect
Paper • 2403.03853 • Published • 66
Eugene Oskin
eoskin
AI & ML interests
None yet
Recent Activity
liked
a model
about 12 hours ago
unsloth/gpt-oss-20b-GGUF
liked
a model
about 12 hours ago
openai/gpt-oss-120b
upvoted
an
article
1 day ago
Finally, a Replacement for BERT: Introducing ModernBERT
Organizations
None yet