·
AI & ML interests
NER, LLM
Organizations
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-32-experts
Updated
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-28-experts
Updated
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-24-experts
Updated
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-20-experts
Updated
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-16-experts
Updated
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-12-experts
Updated
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-8-experts
Updated
aalexchengg/gpt-oss-20b-Math10K-diff-info-Distill-forward-kl-4-experts
Updated
aalexchengg/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-20-experts-debugger
Updated
aalexchengg/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-debugger
Updated
aalexchengg/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-32-experts-debugger
Updated
aalexchengg/OLMoE-Math10K-Distill-6-experts-test
Updated
aalexchengg/Deepseek-Coder-V2-Lite-13B-Instruct-Math10K-Distill-6-experts-test
Updated
aalexchengg/compute_loss_model
Text Generation
•
Updated
•
5
aalexchengg/train_step_model
Text Generation
•
Updated
•
3
aalexchengg/Deepseek-Coder-V2-Lite-13B-Instruct-Open-R1-Distill
Updated
Text Generation
•
0.5B
•
Updated
•
23
•
Sentence Similarity
•
0.1B
•
Updated
•
5