Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
imamnurby 's Collections
Long Sequences for LLM
Attention in LLM
LLM for Codes
LLM Benchmark
Graph Neural Network
MoE
LLM Security
General Purpose LLM
Continual Training
Pretraining
Model Merging
Chain of Thought
Instruction Tuning
Code Benchmark

Long Sequences for LLM

updated Nov 28, 2023
Upvote
-

  • YaRN: Efficient Context Window Extension of Large Language Models

    Paper • 2309.00071 • Published Aug 31, 2023 • 73
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs