Add new embeddings and update data processing scripts; remove MiniEncoder 05a2a0c = commited on Jun 3