document-translator / readme.md
mjuvilla's picture
First commit. For now the translation has not been integrated but reading a docx and writing its translation while keeping the formatting and style should work
978cbf1
|
raw
history blame
745 Bytes

document_translator

Project to translate files (for now .docx) using BSC's models while keeping the formatting and style of the original file.

Requirements

python 3.12

fast_align

Clone https://github.com/clab/fast_align, run the compilation commands indicated in the project's readme, place fast_align and atools (.exe if using windows) in this project's root.

fast_align fine-tuning files

I took the 4 files (ca-en.params, ca-en.err, en-ca.params and en-ca.err) from https://huggingface.co/projecte-aina/aina-translator-ca-en/tree/main. Maybe we could automatize the download of these files. For now, place these files in config_folder (defined in main.py).

python requirements

pip install -r requirements.txt