>>106156921
>- Removed c2 Samples
>- Llama3.1 was more disappointing, in the Instruct Tune? It felt overbaked, atleast. Likely due to the DPO being done after their SFT Stage.
>- Tuning on L3.1 base did not give good results