feat(data-processing): implement data processing pipeline with embeddings 68cfce0 YanBoChen commited on Jul 27