Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
33
14
1
Yuxian Gu
t1101675
Follow
buaa42wxy's profile picture
instruction-pretrain's profile picture
SinclairWang's profile picture
7 followers
·
8 following
https://t1101675.github.io/
gu_yuxian
t1101675
AI & ML interests
Efficient methods for language models
Recent Activity
upvoted
a
paper
about 21 hours ago
BitNet b1.58 2B4T Technical Report
updated
a model
8 days ago
MiniLLM/MiniLLM-gpt2-340M
new
activity
25 days ago
MiniLLM/MiniLLM-gpt2-340M:
Adding `safetensors` variant of this model
View all activity
Organizations
t1101675
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
upvoted
a
paper
about 21 hours ago
BitNet b1.58 2B4T Technical Report
Paper
•
2504.12285
•
Published
2 days ago
•
47
updated
a model
8 days ago
MiniLLM/MiniLLM-gpt2-340M
Text Generation
•
Updated
8 days ago
•
52
•
2
New activity in
MiniLLM/MiniLLM-gpt2-340M
25 days ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
New activity in
MiniLLM/SFT-gpt2-120M
25 days ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
New activity in
MiniLLM/SFT-gpt2-760M
25 days ago
Adding `safetensors` variant of this model
#1 opened about 1 month ago by
SFconvertbot
New activity in
Data-Selection/PDS-470M
25 days ago
Adding `safetensors` variant of this model
#1 opened 3 months ago by
SFconvertbot
New activity in
Data-Selection/PDS-160M
25 days ago
Adding `safetensors` variant of this model
#1 opened 3 months ago by
SFconvertbot
Add link to paper
#2 opened 25 days ago by
nielsr
New activity in
Data-Selection/PDS-470M
25 days ago
Clarify Model Description and Add Project Page Link
#2 opened 25 days ago by
nielsr
New activity in
Data-Selection/PDS-1B
25 days ago
Add link to code repository
#2 opened 25 days ago by
nielsr
New activity in
Data-Selection/PDS-1.7B
25 days ago
Add link to Github and improve description
#2 opened 25 days ago by
nielsr
New activity in
Data-Selection/BSL-1.7B
25 days ago
Add link to code
#2 opened 25 days ago by
nielsr
New activity in
MiniLLM/MiniPLM-Qwen-500M
25 days ago
Improve model card: add paper abstract and link to paper
#1 opened 25 days ago by
nielsr
New activity in
MiniLLM/MiniPLM-llama3.1-212M
25 days ago
Add library name and link to code repository
#1 opened 25 days ago by
nielsr
New activity in
MiniLLM/MiniPLM-Mamba-130M
25 days ago
Improve MiniPLM-Mamba-130M model card
#1 opened 25 days ago by
nielsr
New activity in
MiniLLM/MiniPLM-Qwen-1.2B
25 days ago
Add link to code
#1 opened 25 days ago by
nielsr
New activity in
MiniLLM/Ref-Pretrain-Qwen-104M
25 days ago
Add link to code
#1 opened 25 days ago by
nielsr
New activity in
MiniLLM/Pretrain-Qwen-1.2B
25 days ago
Add link to code
#1 opened 25 days ago by
nielsr
New activity in
MiniLLM/Pretrain-Qwen-500M
25 days ago
No changes needed
#1 opened 25 days ago by
nielsr
New activity in
MiniLLM/Pretrain-Qwen-200M
25 days ago
Add link to code
#1 opened 25 days ago by
nielsr
Load more