Some questions for training.
#3
by
desimfj
- opened
I am currently using TRL's SFT to train Qwen3-30b-a3b and encountered the following issue: https://github.com/huggingface/trl/issues/4204. Could you please share your SFT configuration? Thank you very much!