wujun commited on
Commit
18d999c
·
1 Parent(s): 8e64547

update link of gg_init in README.md

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -12,7 +12,7 @@ short_description: The code of gg prior.
12
  # Introduction
13
  - Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
14
  - Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
15
- - Paper Link: [https://services.arxiv.org/html/submission/6499264/view](https://services.arxiv.org/html/submission/6499264/view)
16
 
17
  This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.
18
 
 
12
  # Introduction
13
  - Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
14
  - Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
15
+ - Paper Link: [https://arxiv.org/abs/2506.00486](https://arxiv.org/abs/2506.00486)
16
 
17
  This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.
18