Spaces:
Running
Running
Update README.md
Browse files
README.md
CHANGED
@@ -1,10 +1,52 @@
|
|
1 |
---
|
2 |
title: README
|
3 |
-
emoji:
|
4 |
colorFrom: green
|
5 |
-
colorTo:
|
6 |
sdk: static
|
7 |
pinned: false
|
|
|
|
|
|
|
8 |
---
|
9 |
|
10 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
---
|
2 |
title: README
|
3 |
+
emoji: 🏢
|
4 |
colorFrom: green
|
5 |
+
colorTo: indigo
|
6 |
sdk: static
|
7 |
pinned: false
|
8 |
+
thumbnail: >-
|
9 |
+
https://cdn-uploads.huggingface.co/production/uploads/678e8ad782840dd75bbfcb59/gQGweVRWbaH4trQtH4lSL.png
|
10 |
+
short_description: Medical datasets for healthcare model training.
|
11 |
---
|
12 |
|
13 |
+
# **Medical Datasets**
|
14 |
+
|
15 |
+
This Medical dataset is crafted as a versatile resource for enthusiasts of data science, machine learning, and data analysis. It replicates the characteristics of real-world healthcare data, offering users a platform to practice, refine, and showcase their data manipulation and analytical skills within the healthcare domain.
|
16 |
+
|
17 |
+
## **Potential Uses**
|
18 |
+
- Building and testing predictive models specific to healthcare.
|
19 |
+
- Practicing techniques for data cleaning, transformation, and analysis.
|
20 |
+
- Designing visualizations to uncover insights into healthcare trends.
|
21 |
+
- Learning and teaching data science and machine learning concepts in a healthcare setting.
|
22 |
+
|
23 |
+
## **Acknowledgments**
|
24 |
+
- This dataset is entirely synthetic, created with a focus on respecting healthcare data privacy and security. It contains no real patient information and complies with privacy regulations.
|
25 |
+
- The goal is to support advancements in data science and healthcare analytics while inspiring innovative ideas.
|
26 |
+
|
27 |
+
## **Dataset Contents**
|
28 |
+
|
29 |
+
### **Evaluation Medical Instruction Datasets**
|
30 |
+
Contains datasets used for evaluating medical instruction models:
|
31 |
+
- `evaluation-medical-instruction-dataset.json`
|
32 |
+
- `medmcqa-train-instruction-dataset.json`
|
33 |
+
- `medial-train-instruction-dataset.json`
|
34 |
+
- `pubmedqa-train-instruction-train.json`
|
35 |
+
|
36 |
+
### **General Medical Instruction Datasets**
|
37 |
+
Contains general medical instruction datasets:
|
38 |
+
- `general-medical-instruction-dataset.json`
|
39 |
+
- `GenMedGPT-5k.json`
|
40 |
+
- `HealthCareMagic-100k.json`
|
41 |
+
- `medical_meadow_wikidoc_medical_flashcards.json`
|
42 |
+
- `medical_meadow_wikidoc_patient_info.json`
|
43 |
+
- `medicationqa.json`
|
44 |
+
|
45 |
+
### **Medical Preference Data**
|
46 |
+
- `medical-preference-data.json`: Contains data related to medical preferences.
|
47 |
+
|
48 |
+
### **Medical Pretraining Datasets**
|
49 |
+
Contains datasets used for pretraining medical models.
|
50 |
+
|
51 |
+
## **Usage**
|
52 |
+
These datasets can be used to train and evaluate models for medical question answering and other related tasks. Each dataset is provided in **JSON** and **TXT** formats and contains various medical instructions, inputs, and outputs.
|