Scattertext on Japanese novels
Regex and Python based preprocessing demo
Counting words in English and Japanese
Compares original and LLM (or other) preprocessed text