Spaces:
Running
Running
Update app.py
Browse files
app.py
CHANGED
@@ -102,7 +102,23 @@ This method introduces sampling control along three key dimensions:
|
|
102 |
Among these, the novel **reasoning depth `H`** plays a critical role: by sampling outputs at different depths of partially completed reasoning chains, the model creates multiple sets of "fragmented thoughts + solutions," which are then jointly evaluated to select the most trustworthy outcome.
|
103 |
""")
|
104 |
gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
105 |
gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
106 |
gr.Image("figs/combine.png", label="Framework", show_label=False, elem_id="my-img")
|
107 |
gr.Markdown(
|
108 |
"""
|
|
|
102 |
Among these, the novel **reasoning depth `H`** plays a critical role: by sampling outputs at different depths of partially completed reasoning chains, the model creates multiple sets of "fragmented thoughts + solutions," which are then jointly evaluated to select the most trustworthy outcome.
|
103 |
""")
|
104 |
gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
|
105 |
+
gr.Markdown(
|
106 |
+
"""
|
107 |
+
### 🔍 Scaling Analysis of *n*, *m*, and *H* in DeepSeek-R1 Models
|
108 |
+
|
109 |
+
A detailed test-time scaling analysis on the DeepSeek-R1 series reveals the individual impact of the three sampling dimensions: `n` (number of reasoning paths), `m` (number of answers per path), and `H` (depth-wise reasoning samples).
|
110 |
+
|
111 |
+
Across multiple reasoning benchmarks, the results show that increasing **`H` — sampling across reasoning depths — yields the highest cost-effectiveness**. That is, sampling more intermediate answers along the depth of a single reasoning path leads to **greater accuracy improvements with fewer additional tokens**, compared to simply increasing the number of paths (`n`) or answers (`m`).
|
112 |
+
""")
|
113 |
gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")
|
114 |
+
gr.Markdown(
|
115 |
+
"""
|
116 |
+
### 🔄 Joint Sampling of *n*, *m*, and *H* for Enhanced Accuracy
|
117 |
+
|
118 |
+
In practical scenarios, the sampling dimensions `n`, `m`, and `H` can be **jointly optimized** rather than tuned in isolation. By **dynamically allocating the sampling budget across these dimensions**, the model can significantly enhance its reasoning accuracy.
|
119 |
+
|
120 |
+
This joint sampling strategy leverages the complementary strengths of each dimension—diversity (`n`), redundancy (`m`), and depth-awareness (`H`)—to achieve robust performance under a fixed token budget.
|
121 |
+
""")
|
122 |
gr.Image("figs/combine.png", label="Framework", show_label=False, elem_id="my-img")
|
123 |
gr.Markdown(
|
124 |
"""
|