-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 89 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 66 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 41 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 104
kuan li
minlik
AI & ML interests
None yet
Organizations
None yet
LLM
-
LongLoRA: Efficient Fine-tuning of Long-Context Large Language Models
Paper • 2309.12307 • Published • 89 -
LMDX: Language Model-based Document Information Extraction and Localization
Paper • 2309.10952 • Published • 66 -
Table-GPT: Table-tuned GPT for Diverse Table Tasks
Paper • 2310.09263 • Published • 41 -
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 104
IE
Information Extraction
models
15
minlik/chinese-alpaca-plus-33b-merged
Text Generation
•
33B
•
Updated
•
16
minlik/chinese-llama-13b-merged
Text Generation
•
13B
•
Updated
•
18
•
6
minlik/chinese-alpaca-pro-33b-merged
Text Generation
•
33B
•
Updated
•
16
•
4
minlik/chinese-alpaca-13b-merged
Text Generation
•
13B
•
Updated
•
17
•
16
minlik/Qwen2.5-Vl-3B-Instruct-GRPO-deepmath-ocr-7k
4B
•
Updated
•
5
minlik/Qwen2.5-VL-3B-Instruct-GRPO-deepmath-ocr-1k
4B
•
Updated
•
7
minlik/chinese-llama-plus-7b-merged
Text Generation
•
7B
•
Updated
•
16
•
8
minlik/chinese-alpaca-7b-merged
Text Generation
•
7B
•
Updated
•
19
•
10
minlik/chinese-alpaca-33b-merged
Text Generation
•
33B
•
Updated
•
1.82k
•
9
minlik/docllm-yi-6b
Text Generation
•
7B
•
Updated
•
9
•
1