Update docs.md
Browse files
docs.md
CHANGED
@@ -33,6 +33,10 @@
|
|
33 |
<li><strong>1M+ clinical samples</strong></li>
|
34 |
</ul>
|
35 |
|
|
|
|
|
|
|
|
|
36 |
<h2>π Key Features</h2>
|
37 |
<p>Our benchmark spans a wide range of document types and clinical tasks, including classification, event extraction, and generation. It further supports three inference strategies: <strong>zero-shot</strong>, <strong>few-shot</strong>, and <strong>chain-of-thought (CoT)</strong> prompting. We evaluated <strong>52 LLMs</strong>, including general-purpose, open-source, proprietary, and medical-domain models.</p>
|
38 |
<ul>
|
@@ -48,6 +52,7 @@
|
|
48 |
</li>
|
49 |
</ul>
|
50 |
|
|
|
51 |
<h2>π BRIDGE Leaderboard</h2>
|
52 |
<p>To support ongoing evaluation, we introduce our <strong>BRIDGE Leaderboard</strong>, which provides:</p>
|
53 |
<ul>
|
|
|
33 |
<li><strong>1M+ clinical samples</strong></li>
|
34 |
</ul>
|
35 |
|
36 |
+
<div style="text-align: center;">
|
37 |
+
<img src="https://cdn-uploads.huggingface.co/production/uploads/67a040fb6934f9aa1c866f99/2fh-jETNSL9iXJXTT-fdN.png" style="width: 50%;" alt="BRIDGE benchmark graphic">
|
38 |
+
</div>
|
39 |
+
|
40 |
<h2>π Key Features</h2>
|
41 |
<p>Our benchmark spans a wide range of document types and clinical tasks, including classification, event extraction, and generation. It further supports three inference strategies: <strong>zero-shot</strong>, <strong>few-shot</strong>, and <strong>chain-of-thought (CoT)</strong> prompting. We evaluated <strong>52 LLMs</strong>, including general-purpose, open-source, proprietary, and medical-domain models.</p>
|
42 |
<ul>
|
|
|
52 |
</li>
|
53 |
</ul>
|
54 |
|
55 |
+
|
56 |
<h2>π BRIDGE Leaderboard</h2>
|
57 |
<p>To support ongoing evaluation, we introduce our <strong>BRIDGE Leaderboard</strong>, which provides:</p>
|
58 |
<ul>
|