Direct Preference Optimization