document-translator / readme.md
mjuvilla's picture
First commit. For now the translation has not been integrated but reading a docx and writing its translation while keeping the formatting and style should work
978cbf1
|
raw
history blame
745 Bytes
# document_translator
Project to translate files (for now .docx) using BSC's models while keeping the formatting and style of the original file.
## Requirements
### python 3.12
### fast_align
Clone https://github.com/clab/fast_align, run the compilation commands indicated in the project's readme, place fast_align and atools (.exe if using windows) in this project's root.
### fast_align fine-tuning files
I took the 4 files (ca-en.params, ca-en.err, en-ca.params and en-ca.err) from https://huggingface.co/projecte-aina/aina-translator-ca-en/tree/main. Maybe we could automatize the download of these files. For now, place these files in config_folder (defined in main.py).
### python requirements
pip install -r requirements.txt