Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
Cheng Rui's picture
4 1

Cheng Rui

postitive666
·
  • positive666

AI & ML interests

None yet

Organizations

None yet

New activity in Tengyunw/qwen3_8b_eagle3 3 months ago

Why is the TPS of eagle3-qwen in the sglang inference of single-card H20 not as high as that of the original QWEN3 when the decoding algorithm is added?

1
#8 opened 3 months ago by
postitive666
New activity in fblgit/cybertron-v4-qw7B-MGS about 1 year ago

some quston about post-traing (function calling)

5
#4 opened about 1 year ago by
postitive666
New activity in rombodawg/Rombos-LLM-V2.5-Qwen-7b about 1 year ago

After model fusion, did you continue with finetuning?

6
#5 opened about 1 year ago by
postitive666
commented 2 papers over 1 year ago

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78 •
6

Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection

Paper • 2310.11511 • Published Oct 17, 2023 • 78 •
6
Company
TOS Privacy About Careers
Website
Models Datasets Spaces Pricing Docs