Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
10
7
Alan Tseng
agentlans
Follow
Mi6paulino's profile picture
John6666's profile picture
xlswinter's profile picture
20 followers
Β·
17 following
agentlans
AI & ML interests
Small data, boring AI
Recent Activity
updated
a dataset
about 15 hours ago
agentlans/reddit-critical-thinking
published
a dataset
about 16 hours ago
agentlans/reddit-critical-thinking
replied
to
onekq
's
post
2 days ago
This is bitter lesson 2.0 https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf If this reads too lofty to you, consider some low-hanging fruits. Experiences here are reward signals we send to LLMs, e.g. human score in RLHF, verification in AlphaProof, or test results for code generation. RFT (reinforced finetuning) will become main stream, and IMO make LLMs behave more like agents.
View all activity
Organizations
None yet
agentlans
's activity
All
Models
Datasets
Spaces
Papers
Collections
Community
Posts
Upvotes
Likes
Articles
liked
2 datasets
2 months ago
Nafnlaus/ShrimpMoss_Chinese_Censorship_Abliteration
Preview
β’
Updated
Jan 24
β’
93
β’
5
promptfoo/CCP-sensitive-prompts
Viewer
β’
Updated
Jan 28
β’
1.36k
β’
169
β’
44
liked
a dataset
3 months ago
OpenLeecher/lmsys_chat_1m_clean
Viewer
β’
Updated
Dec 31, 2024
β’
273k
β’
942
β’
74
liked
2 models
4 months ago
microsoft/deberta-v3-xsmall
Fill-Mask
β’
Updated
Sep 26, 2022
β’
112k
β’
43
google/flan-t5-small
Text2Text Generation
β’
Updated
Oct 10, 2023
β’
651k
β’
β’
338
liked
a Space
4 months ago
Running
on
Zero
98
98
PawMatchAI
πΎ
Smart Dog Breed Detection, Comparison, and Matching Tool
liked
a model
6 months ago
ZeroXClem/Llama3.1-DarkStorm-Aspire-8B
Text Generation
β’
Updated
Oct 24, 2024
β’
3