Distribution Fine Tuning: A post-training step to make models write better
Distribution Fine Tuning (DFT) is a post-training technique that refines language model outputs by adjusting the probability distribution of token predictions, rather than modifying the model's weights. This approach enhances writing quality, coherence, and style consistency without requiring full retraining, making it a lightweight and efficient method for improving model performance after initial training.