Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up

Eliciting-Contexts-LASR

community
Activity Feed

AI & ML interests

None defined yet.

Leo Richter's profile picture themachinefan's profile picture AC's profile picture Edward Stevinson's profile picture

Collections 1

Sandbagging Models
Various versions of model organisms that perform sandbagging
  • Eliciting-Contexts/sandbagging-auditing

    Updated Feb 20
Sandbagging Models
Various versions of model organisms that perform sandbagging
  • Eliciting-Contexts/sandbagging-auditing

    Updated Feb 20

models 5

Eliciting-Contexts/sandbagging-monitor-2

Updated Apr 28

Eliciting-Contexts/sandbagging-password-lovely-blooming-flower

Updated Apr 28

Eliciting-Contexts/sandbagging-password-blooming-flower

Updated Apr 28

Eliciting-Contexts/sandbagging-password-flower

Updated Apr 28

Eliciting-Contexts/sandbagging-auditing

Updated Feb 20

datasets 6

Eliciting-Contexts/backdoors-benchmark-dataset-v2

Viewer • Updated 24 days ago • 48 • 84

Eliciting-Contexts/backdoors-benchmark-dataset

Viewer • Updated May 5 • 15 • 5

Eliciting-Contexts/simple_stories_new

Viewer • Updated May 3 • 67 • 4

Eliciting-Contexts/discover

Viewer • Updated May 3 • 18 • 7

Eliciting-Contexts/jailbreaking

Viewer • Updated May 3 • 20 • 3

Eliciting-Contexts/applications-benchmark-dataset

Viewer • Updated Apr 29 • 4 • 3
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs