Some questions for training.

#3
by desimfj - opened

I am currently using TRL's SFT to train Qwen3-30b-a3b and encountered the following issue: https://github.com/huggingface/trl/issues/4204. Could you please share your SFT configuration? Thank you very much!

Sign up or log in to comment