The training datasets used for training the ChEmbed family of text embedding models
AI & ML interests
None defined yet.
Recent Activity
Organization Card
Edit this README.md markdown file to author your organization card.
datasets
74
BASF-AI/PlantCAD2_virtual_hackathon
Viewer
•
Updated
•
9
•
30
BASF-AI/dolma-pes2o-chemistry
Viewer
•
Updated
•
361k
•
56
•
1
BASF-AI/ChemRxiv-Papers
Viewer
•
Updated
•
30.4k
•
25
•
1
BASF-AI/ChemRxiv-Paragraphs
Viewer
•
Updated
•
209k
•
12
•
2
BASF-AI/ChemRxiv-Train-CC-BY
Viewer
•
Updated
•
139k
•
21
BASF-AI/dolma-chem-only-query-generated
Viewer
•
Updated
•
1.17M
•
56
BASF-AI/ChemRxivRetrieval
Viewer
•
Updated
•
79.5k
•
17
•
1
BASF-AI/ChemRxiv-Train-CC-BY-v2
Viewer
•
Updated
•
138k
•
44
•
2
BASF-AI/PubChem-Raw
Viewer
•
Updated
•
2.5M
•
19
BASF-AI/PubChem-v4
Viewer
•
Updated
•
393k
•
35
•
1