Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
lamm-mit 's Collections
VibeGen
Graph-Aware Isomorphic Attention in Transformers
PRefLexOR
LAMM MIT papers
Cephalo
SciAgents
Leaf-inspired Image Generation
Bioinspired LLMs

PRefLexOR

updated Jan 22

PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking

Upvote
3

  • PRefLexOR: Preference-based Recursive Language Modeling for Exploratory Optimization of Reasoning and Agentic Thinking

    Paper • 2410.12375 • Published Oct 16, 2024 • 4

    Note Paper on arXiv


  • In-situ graph reasoning and knowledge expansion using Graph-PReFLexOR

    Paper • 2501.08120 • Published Jan 14 • 5

  • lamm-mit/PRefLexOR_ORPO_DPO_EXO_10242024

    Text Generation • 4B • Updated Oct 25, 2024 • 7

    Note Model produces thinking tokens before answering


  • lamm-mit/PRefLexOR_ORPO_DPO_EXO_REFLECT_10222024

    Text Generation • 4B • Updated Oct 25, 2024 • 3 • 3

    Note Model produces both thinking and reflection tokens before answering


  • lamm-mit/meta-llama-Meta-Llama-3.2-3B-Instruct-Reasoning-Tokenizer

    Updated Oct 19, 2024

    Note Llama tokenizer with special tokens added (thinking, reflection, scratchpad, response)


  • lamm-mit/Graph-Preflexor_01062025

    Text Generation • 4B • Updated Jan 17 • 21 • 10

    Note Model with in-situ graph and abstract pattern reasoning

Upvote
3
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs