feat(data_processing): Implement token length control with semantic preservation 922ed80 YanBoChen commited on Jul 28