Update README.md
Browse files
README.md
CHANGED
@@ -41,7 +41,7 @@ Read more details about Sailor2 at https://sailorllm.github.io/blog/sailor2.
|
|
41 |
- [sea-commoncrawl](https://huggingface.co/datasets/sailor2/sea-commoncrawl): Cleaned and deduplicated commoncrawl
|
42 |
- [sea-internet](https://huggingface.co/datasets/sailor2/sea-internet): Cleaned multilingual data from Internet Archive
|
43 |
- [sea-pdf-text](https://huggingface.co/datasets/sailor2/sea-pdf-text): Cleaned pdf data
|
44 |
-
- [sea-
|
45 |
- [sea-commoncrawl-high-quality](https://huggingface.co/datasets/sailor2/sea-commoncrawl-high-quality): extra cleaned and deduplicated commoncrawl
|
46 |
|
47 |
</details>
|
|
|
41 |
- [sea-commoncrawl](https://huggingface.co/datasets/sailor2/sea-commoncrawl): Cleaned and deduplicated commoncrawl
|
42 |
- [sea-internet](https://huggingface.co/datasets/sailor2/sea-internet): Cleaned multilingual data from Internet Archive
|
43 |
- [sea-pdf-text](https://huggingface.co/datasets/sailor2/sea-pdf-text): Cleaned pdf data
|
44 |
+
- [sea-synthetic](https://huggingface.co/datasets/sailor2/sea-synthetic): Translation dataset from Cosmopedia across multiple languages
|
45 |
- [sea-commoncrawl-high-quality](https://huggingface.co/datasets/sailor2/sea-commoncrawl-high-quality): extra cleaned and deduplicated commoncrawl
|
46 |
|
47 |
</details>
|