Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
11
10
5
Dawei Zhu
dwzhu
Follow
Noctilucens's profile picture
mike-jiang's profile picture
AdinaY's profile picture
15 followers
·
6 following
dwzhu-pku
AI & ML interests
natural language processing
Recent Activity
authored
a paper
about 12 hours ago
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
upvoted
a
paper
about 18 hours ago
MiMo: Unlocking the Reasoning Potential of Language Model -- From Pretraining to Posttraining
authored
a paper
about 2 months ago
ConFiguRe: Exploring Discourse-level Chinese Figures of Speech
View all activity
Organizations
Papers
10
arxiv:
2505.07608
arxiv:
2503.17407
arxiv:
2502.13595
arxiv:
2412.12706
Expand 10 papers
models
15
Sort: Recently updated
dwzhu/e5rope-base
Sentence Similarity
•
Updated
Sep 17, 2024
•
78
•
17
dwzhu/e5-base-4k
Sentence Similarity
•
Updated
May 14, 2024
•
3.55k
•
10
dwzhu/nomic-bert-2048
Feature Extraction
•
Updated
Feb 27, 2024
•
18
dwzhu/LLaMA2-7B-PoSE-YaRN-16k
Text Generation
•
Updated
Nov 27, 2023
•
5
•
5
dwzhu/Baichuan2-7B-PoSE-YaRN-16k
Text Generation
•
Updated
Oct 11, 2023
•
15
•
1
dwzhu/Baichuan2-7B-PoSE-NTK-16k
Text Generation
•
Updated
Oct 11, 2023
•
5
•
1
dwzhu/LLaMA-7B-PoSE-Linear-16k
Text Generation
•
Updated
Oct 11, 2023
•
6
•
1
dwzhu/LLaMA-7B-PoSE-YaRN-16k
Text Generation
•
Updated
Oct 11, 2023
•
10
•
2
dwzhu/LLaMA-7B-PoSE-YaRN-96k
Text Generation
•
Updated
Oct 11, 2023
•
5
•
1
dwzhu/LLaMA2-7B-PoSE-NTK-16k
Text Generation
•
Updated
Oct 11, 2023
•
7
•
2
Expand 15 models
datasets
3
Sort: Recently updated
dwzhu/LongEmbed
Viewer
•
Updated
Apr 21, 2024
•
29.6k
•
614
•
7
dwzhu/needle_in_a_haystack_retrieval
Updated
Mar 28, 2024
•
11
•
1
dwzhu/PoSE-Datasets
Preview
•
Updated
Oct 26, 2023
•
45
•
7