Text Generation
Transformers
Safetensors
cohere2
conversational

What if use LLM without default system prompt?

#13
by MRU4913 - opened

What if we use an LLM without the default system prompt? Will there be any degradation?@alexrs

Cohere Labs org

@MRU4913 the model was trained with a system prompt, therefore some degradation is expected if no system prompt is given. However, this might depend on the task you are trying to solve, so feel free to experiment and see what works best for your use case!

The default system prompt eats up extra tokens (even though prefilling is cheap and fast). Is there any point in training with this prompt? I feel like it stops the model from being used in more diverse ways. @alexrs . Is this prompt added during the RF stage or the SFT stage?

Cohere Labs org

@MRU4913 We support different system prompts to achieve different behavior (i.e interactive and non-interactive, different levels of safety, RAG and tools, etc) -- https://docs.cohere.com/docs/command-a-hf

It does use extra tokens in the input, but the model is trained to support a context length of 256K, so there should still be enough room there for most use cases!

feel like it stops the model from being used in more diverse ways

You can definitely experiment with no system prompt and custom system prompts if your use case can benefit from it (and you're encouraged to share results if you do so!)

MRU4913 changed discussion status to closed
Your need to confirm your account before you can post a new comment.

Sign up or log in to comment