Khmer mT5 Summarization Model (Duplicated Text)

This repository contains a fine-tuned mT5-small model for Khmer text summarization that is specially trained to collapse duplicated or redundant content into concise, coherent summaries.


Model Details

  • Base model: google/mt5-small
  • Fine-tuned for: Khmer summarization with duplicate-text removal
  • Training dataset: kimleang123/khmer-text-dataset-duplicated
  • Task: Sequence-to-Sequence (text2text-generation)
  • Evaluation: ROUGE-1/2/L on held-out Khmer articles containing repeated passages

Downloads last month
5
Safetensors
Model size
300M params
Tensor type
F32
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Space using songhieng/khmer-mt5-summarization-duplicated 1

Collection including songhieng/khmer-mt5-summarization-duplicated