Spaces:

shifeng3711
/

gg_prior

Running

wujun commited on 13 days ago

Commit

18d999c

1 Parent(s): 8e64547

update link of gg_init in README.md

Files changed (1) hide show

README.md CHANGED Viewed

@@ -12,7 +12,7 @@ short_description: The code of gg prior.
 # Introduction
 - Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
 - Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
-- Paper Link: [https://services.arxiv.org/html/submission/6499264/view](https://services.arxiv.org/html/submission/6499264/view)
 This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.

 # Introduction
 - Reference: It Takes a Good Model to Train a Good Model: Generalized Gaussian Priors for Optimized LLMs
 - Authors: Jun Wu, Yirong Xiong, Jiangtao Wen, Yuxing Han
+- Paper Link: [https://arxiv.org/abs/2506.00486](https://arxiv.org/abs/2506.00486)
 This repository provides a complete implementation of the methods described in the corresponding paper. Specifically, we implement the Generalized Gaussian Initialization, DeepShape, and the RF8 floating-point format as proposed in the paper. Furthermore, we adapt and reproduce the BackSlash training algorithm, and incorporate it seamlessly into our framework based on generalized Gaussian priors.