yuhuixu commited on
Commit
49e8662
·
verified ·
1 Parent(s): b863e37

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +11 -9
app.py CHANGED
@@ -89,15 +89,17 @@ training.
89
  """
90
 
91
  ## Introduction
92
- We propose **Elastic Reasoning**, a novel framework for scalable chain of thoughts
93
- that explicitly separates reasoning into two phases—`thinking and solution`—with
94
- independently allocated budgets. At test time, Elastic Reasoning prioritize that
95
- completeness of solution segments, significantly improving reliability under tight
96
- resource constraints. To train models that are robust to truncated thinking, we
97
- introduce a lightweight `budget-constrained rollout` strategy, integrated into GRPO,
98
- which teaches the model to reason adaptively when the thinking process is cut
99
- short and generalizes effectively to unseen budget constraints without additional
100
- training.
 
 
101
  """)
102
  gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
103
  gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")
 
89
  """
90
 
91
  ## Introduction
92
+ Building upon the same core insight as **Elastic Reasoning**—that correct answers can often be derived without waiting for a full chain-of-thought (CoT)—**Fractured Sampling** shifts focus to the **sampling strategy** of reasoning.
93
+
94
+ Instead of relying on complete, uninterrupted reasoning sequences, Fractured Sampling **breaks the CoT along the temporal dimension**, exploring whether it's possible to "get the right answer without thinking all the way through."
95
+
96
+ This method introduces sampling control along three key dimensions:
97
+
98
+ - **Solution Diversity (m) sampling multiple final outputs from a single reasoning trace.
99
+ - **Trajectory Diversity (n) sampling multiple independent reasoning traces with different seeds (vanilla CoT sampling).
100
+ - **Reasoning Depth Diversity (H) — sampling at different intermediate stages of a single reasoning trace.
101
+
102
+ Among these, the novel **reasoning depth `H`** plays a critical role: by sampling outputs at different depths of partially completed reasoning chains, the model creates multiple sets of "fragmented thoughts + solutions," which are then jointly evaluated to select the most trustworthy outcome.
103
  """)
104
  gr.Image("figs/frac-frame.png", label="Framework", show_label=False, elem_id="my-img")
105
  gr.Image("figs/single.png", label="Framework", show_label=False, elem_id="my-img")