Anonymous
8/6/2025, 1:52:47 AM
No.106156941
>>106156921
>- Removed c2 Samples
>- Llama3.1 was more disappointing, in the Instruct Tune? It felt overbaked, atleast. Likely due to the DPO being done after their SFT Stage.
>- Tuning on L3.1 base did not give good results
>- Removed c2 Samples
>- Llama3.1 was more disappointing, in the Instruct Tune? It felt overbaked, atleast. Likely due to the DPO being done after their SFT Stage.
>- Tuning on L3.1 base did not give good results