yuhuixu commited on
Commit
30fb361
·
verified ·
1 Parent(s): 49e8662

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +16 -0
app.py CHANGED
@@ -102,7 +102,23 @@ This method introduces sampling control along three key dimensions:
102
  Among these, the novel **reasoning depth `H`** plays a critical role: by sampling outputs at different depths of partially completed reasoning chains, the model creates multiple sets of "fragmented thoughts + solutions," which are then jointly evaluated to select the most trustworthy outcome.
103
  """)
104
  gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
 
 
 
 
 
 
 
 
105
  gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")
 
 
 
 
 
 
 
 
106
  gr.Image("figs/combine.png", label="Framework", show_label=False, elem_id="my-img")
107
  gr.Markdown(
108
  """
 
102
  Among these, the novel **reasoning depth `H`** plays a critical role: by sampling outputs at different depths of partially completed reasoning chains, the model creates multiple sets of "fragmented thoughts + solutions," which are then jointly evaluated to select the most trustworthy outcome.
103
  """)
104
  gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
105
+ gr.Markdown(
106
+ """
107
+ ### 🔍 Scaling Analysis of *n*, *m*, and *H* in DeepSeek-R1 Models
108
+
109
+ A detailed test-time scaling analysis on the DeepSeek-R1 series reveals the individual impact of the three sampling dimensions: `n` (number of reasoning paths), `m` (number of answers per path), and `H` (depth-wise reasoning samples).
110
+
111
+ Across multiple reasoning benchmarks, the results show that increasing **`H` — sampling across reasoning depths — yields the highest cost-effectiveness**. That is, sampling more intermediate answers along the depth of a single reasoning path leads to **greater accuracy improvements with fewer additional tokens**, compared to simply increasing the number of paths (`n`) or answers (`m`).
112
+ """)
113
  gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")
114
+ gr.Markdown(
115
+ """
116
+ ### 🔄 Joint Sampling of *n*, *m*, and *H* for Enhanced Accuracy
117
+
118
+ In practical scenarios, the sampling dimensions `n`, `m`, and `H` can be **jointly optimized** rather than tuned in isolation. By **dynamically allocating the sampling budget across these dimensions**, the model can significantly enhance its reasoning accuracy.
119
+
120
+ This joint sampling strategy leverages the complementary strengths of each dimension—diversity (`n`), redundancy (`m`), and depth-awareness (`H`)—to achieve robust performance under a fixed token budget.
121
+ """)
122
  gr.Image("figs/combine.png", label="Framework", show_label=False, elem_id="my-img")
123
  gr.Markdown(
124
  """