OpenChat 7B is a library of open-source language models, fine-tuned with C-RLFT (Conditioned Reinforcement Learning Fine-Tuning) - a strategy inspired by offline reinforcement learning. It has been trained on mixed-quality data without preference labels.