Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Heaplax
/
ARMAP-RM-LoRA
like
0
Reinforcement Learning
Transformers
arxiv:
2502.12130
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
4f2c47a
ARMAP-RM-LoRA
/
RM-alfworld
/
checkpoint-460
/
adapter_model
Ctrl+K
Ctrl+K
2 contributors
History:
1 commit
Heaplax
Upload folder using huggingface_hub
29c609c
verified
3 months ago
lora_default
Upload folder using huggingface_hub
3 months ago
README.md
Safe
88 Bytes
Upload folder using huggingface_hub
3 months ago