Doge family of small language models!

Doge Face
community
AI & ML interests
A Family of Dynamic UltraFast Small Language Models Ready for Embodied Artificial General Intelligence!
Recent Activity
Organization Card

SmallDoge
Welcome to SmallDoge, where we pioneer the development of compact, high-performance small language models. Our focus is on creating ultra-fast SLMs using innovative dynamic algorithms. Committed to transparency and collaboration, all our training details and code are openly accessible on the SmallDoge GitHub repository.
Our Mission: To democratize access to advanced AI by developing efficient, open-source small language models that empower a wide range of applications and research.
Join our community on Discord!
Explore Our Projects
We offer a suite of resources and models:
- Small-Doges: A versatile series of SLMs, including pre-trained base models, supervised fine-tuned models, and models enhanced with reinforcement learning.
- Doge-CheckPoints: A collection of model checkpoints designed for seamless continued training on new datasets, ensuring smoother adaptation and minimizing training instability.
- Small-Datasets: Curated, multi-stage, high-quality datasets specifically engineered to effectively train small language models, boosting their capabilities and helpfulness.
- Doge-Downstream-Applications: A selection of SLMs optimized for various downstream tasks and real-world applications.
Collections
4
models
81

SmallDoge/Qwen2.5-14b-llmlingua-50
Text Generation
•
Updated
•
7

SmallDoge/Qwen2.5-14b-budget2048
Text Generation
•
Updated
•
2

SmallDoge/Qwen2.5-math-7b-budget2048
Text Generation
•
Updated
•
3

SmallDoge/Llama3.1-8b-110k
Text Generation
•
Updated
•
3

SmallDoge/Qwen2.5-math-14b-llmlingua-90
Text Generation
•
Updated
•
7

SmallDoge/Qwen2.5-math-7b-llmlingua-90
Text Generation
•
Updated
•
5

SmallDoge/Qwen2.5-math-7b-llmlingua-50
Text Generation
•
Updated
•
3

SmallDoge/Doge2-175M-checkpoint
Text Generation
•
Updated
•
7

SmallDoge/Doge2-tokenizer
Updated
•
1

SmallDoge/Qwen2.5-math-7b-chain-of-draft25k
Text Generation
•
Updated
•
14
datasets
32
SmallDoge/SmallThoughts
Viewer
•
Updated
•
102k
•
370
•
44
SmallDoge/Doge2-tokenizer-samples
Viewer
•
Updated
•
2M
•
83
SmallDoge/DMA-Pretrain
Viewer
•
Updated
•
17M
•
232
SmallDoge/smallcorpus
Viewer
•
Updated
•
69M
•
650
•
2
SmallDoge/CoD-25K
Viewer
•
Updated
•
25k
•
143
SmallDoge/SmallTalks
Viewer
•
Updated
•
4.48M
•
3.93k
•
9
SmallDoge/MiniCorpus
Viewer
•
Updated
•
3.4M
•
248
SmallDoge/OpenThoughts-920K
Viewer
•
Updated
•
927k
•
119
SmallDoge/OpenR1-Math-DPO
Viewer
•
Updated
•
88.5k
•
47
SmallDoge/SmallThoughts-25K
Viewer
•
Updated
•
25k
•
69