PEFT
Safetensors

Model Card for StarCoder2-LPO

This is the adapter of the Starcoder2 model trained using LPO on DiSCo for the paper "Teaching an Old LLM Secure Coding: Localized Preference Optimization on Distilled Preferences" (https://arxiv.org/abs/2506.00419). Merge it to the model {"bigcode/starcoder2-7b" + StarCoder2-SFT} (base model merged to the StarCoder2-SFT adapter) in order to use for downstream tasks.

Downloads last month
6
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for StonyBrookNLP/StarCoder2-LPO

Adapter
(15)
this model