Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
tianzhechu 's Collections
BookQA-Series
SFTvsRL Models & Data

SFTvsRL Models & Data

updated Mar 13

This collection contains 4 initial checkpoints for https://github.com/LeslieTrue/SFTvsRL and necessary data for V-IRL training.

Upvote
9

  • tianzhechu/GP-VL-Init

    11B • Updated Feb 8 • 3

  • tianzhechu/GP-L-Init

    11B • Updated Feb 8 • 7

  • tianzhechu/VIRL-L-Init

    11B • Updated Feb 8 • 3 • 1

  • tianzhechu/VIRL-VL-Init

    11B • Updated Feb 8 • 5

  • tianzhechu/SFTvsRL_Data

    Viewer • Updated Feb 8 • 5.97k • 13 • 4

  • tianzhechu/GP-L-RL20

    11B • Updated Mar 12 • 2
Upvote
9
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs