Alan Tseng's picture

10 7

Alan Tseng

agentlans

·

agentlans

AI & ML interests

Small data, boring AI

Recent Activity

updated a dataset about 15 hours ago

agentlans/reddit-critical-thinking

published a dataset about 16 hours ago

agentlans/reddit-critical-thinking

replied to onekq's post 2 days ago

This is bitter lesson 2.0 https://storage.googleapis.com/deepmind-media/Era-of-Experience%20/The%20Era%20of%20Experience%20Paper.pdf If this reads too lofty to you, consider some low-hanging fruits. Experiences here are reward signals we send to LLMs, e.g. human score in RLHF, verification in AlphaProof, or test results for code generation. RFT (reinforced finetuning) will become main stream, and IMO make LLMs behave more like agents.

View all activity

Organizations

None yet

agentlans's activity

liked 2 datasets 2 months ago

Nafnlaus/ShrimpMoss_Chinese_Censorship_Abliteration

Preview • Updated Jan 24 • 93 • 5

promptfoo/CCP-sensitive-prompts

Viewer • Updated Jan 28 • 1.36k • 169 • 44

liked a dataset 3 months ago

OpenLeecher/lmsys_chat_1m_clean

Viewer • Updated Dec 31, 2024 • 273k • 942 • 74

liked 2 models 4 months ago

microsoft/deberta-v3-xsmall

Fill-Mask • Updated Sep 26, 2022 • 112k • 43

google/flan-t5-small

Text2Text Generation • Updated Oct 10, 2023 • 651k • • 338

liked a Space 4 months ago

PawMatchAI

Smart Dog Breed Detection, Comparison, and Matching Tool

liked a model 6 months ago

ZeroXClem/Llama3.1-DarkStorm-Aspire-8B

Text Generation • Updated Oct 24, 2024 • 3