yuhuixu commited on
Commit
873b4f8
Β·
verified Β·
1 Parent(s): 6fe96dd

Update app.py

Browse files
Files changed (1) hide show
  1. app.py +4 -3
app.py CHANGED
@@ -42,14 +42,15 @@ training.
42
  <img src="figs/framework.png" width="80%" />
43
  </p>
44
  """)
45
- gr.Markdown(
46
  """
47
  **Main Takeaways**
48
  1. βœ‚οΈ Thinking + Solution are explicitly separated with independent budgets β€” boosting reliability under tight compute constraints.
49
  2. 🧠 Budget-Constrained Rollout: We train models to handle truncated reasoning using GRPO.
50
  3. πŸ“ˆ Flexible scalability: Robust performance across diverse inference budgets on reasoning benchmarks like AIME and LiveCodeBench.
51
  4. βš™οΈ Better performance with fewer tokens: Our trained model generates outputs that are 30% shorter while maintaining (or even improving) accuracy.
52
-
 
53
  <p align="center">
54
  <img src="figs/aime.png" width="46%" />
55
  <img src="figs/livecode.png" width="48%" />
@@ -59,7 +60,7 @@ gr.Markdown(
59
  <img src="figs/codetable.png" width="90%" />
60
  </p>
61
  """)
62
- gr.Markdown(
63
  """
64
  ## Citation
65
 
 
42
  <img src="figs/framework.png" width="80%" />
43
  </p>
44
  """)
45
+ gr.Markdown(
46
  """
47
  **Main Takeaways**
48
  1. βœ‚οΈ Thinking + Solution are explicitly separated with independent budgets β€” boosting reliability under tight compute constraints.
49
  2. 🧠 Budget-Constrained Rollout: We train models to handle truncated reasoning using GRPO.
50
  3. πŸ“ˆ Flexible scalability: Robust performance across diverse inference budgets on reasoning benchmarks like AIME and LiveCodeBench.
51
  4. βš™οΈ Better performance with fewer tokens: Our trained model generates outputs that are 30% shorter while maintaining (or even improving) accuracy.
52
+ """)
53
+ gr.HTML("""
54
  <p align="center">
55
  <img src="figs/aime.png" width="46%" />
56
  <img src="figs/livecode.png" width="48%" />
 
60
  <img src="figs/codetable.png" width="90%" />
61
  </p>
62
  """)
63
+ gr.Markdown(
64
  """
65
  ## Citation
66