Chinese LLMs on Hugging Face

community

Activity Feed

AI & ML interests

None defined yet.

Recent Activity

Ningyu authored a paper 1 day ago

A Survey of LLM-Driven AI Agent Communication: Protocols, Security Risks, and Defense Countermeasures

Ningyu authored a paper 1 day ago

MemOS: A Memory OS for AI System

zhangysk authored a paper 1 day ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

View all activity

zhangysk

authored 5 papers 1 day ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published 14 days ago • 106

Seed Diffusion: A Large-Scale Diffusion Language Model with High-Speed Inference

Paper • 2508.02193 • Published 11 days ago • 120

xianbao

updated a collection 2 days ago

🤖 Robotics

Collection

5 items • Updated 2 days ago

alielfilali01

posted an update 9 days ago

Post

426

Guys WTH is "yofo-*" ???
Most OpenAI staff associated with the openai/gpt-oss-68911959590a1634ba11c7a4 release are affiliated to dozens of yofo orgs ...

i.e

yofo-wildflower

Some HF folks as well 👀

Tonic

posted an update 12 days ago

Post

3242

🫡 I am the first and only one to like the French Tax Code Dataset

that's it , that's the post

find the dataset here : louisbrulenaudet/code-impots
follow : @louisbrulenaudet

2 replies

AdinaY

posted an update 14 days ago

Post

1174

🔥 July highlights from Chinese AI community

zh-ai-community/july-2025-open-works-from-the-chinese-community-686586f1a8840797e477ae5a

✨ Another "DeepSeek moment" - Kimi K2 🙌

✨ Qwen goes fully matrixed - Instruct / Thinking / Coder models across 30B - 480B 🤯

✨ The multimodal wave🌊
- GLM-4.1V-Thinking: Image+Text > Text
- Intern-S1: Image+Text > Text
- Wan 2.2 - Text +Image > video
- Skywork-R1V3: Image+Text > Text
- Skywork-UniPic: Text > Image / Image > Text
- Tar-7B: Any-to-Any
- Ming-Lite-Omni-1.5: Any-to-Any
- Step3: Image+Text > Text
- HunyuanWorld-1: Image > 3D
- ThinkSound: Video > Audio
- Neta-Lumina: Text > Image

✨Tiny & deployable models 🤏
- SmallThinker runs on 1GB RAM

✨Agentic coding goes mainstream 💻
- Qwen3-Coder: fully spec'd tool calling
- GLM-4.5: browser agents, IDE assistant
- Qwen3 WebDev demo: text-to-frontend code

✨Domain-Specific & Utility Models/Tools/Dataset
- Science one S1: Scientific model
- Agentar DeepFinance: Finance dataset
- ObjectClear: Interactive Vision Tool
- Qwen3 MT Demo: Machine Translation Tool

✨ Big month not only for models, but for policy too🏛️
- Announced Global Action Plan for AI Governance
- Proposes to set up a World AI Cooperation Organization in Shanghai
- Released International AI Open Source Collaboration Initiative
- Published Risk Assessment Guidelines for Endpoint AI Agents

✨ Big event - WAIC
- 355K offline visitors
- 108 new released in 4 days
- 145 sessions across key domains

I’ve been tracking things closely, but July’s open-source wave still blew me away. Can’t wait to see what’s coming next! 🚀

AdinaY

posted an update 14 days ago

Post

1619

Qwen team did it again!!

They just released Qwen3-Coder-30B-A3B-Instruct on the hub🔥
Qwen/Qwen3-Coder-30B-A3B-Instruct

✨ Apache 2.0
✨30B total / 3.3B active (128 experts, 8 top-k)
✨ Native 256K context, extendable to 1M via Yarn
✨ Built for Agentic Coding

AdinaY

updated a collection 14 days ago

🧩 July 2025 - Open works from the Chinese community

Collection

38 items • Updated 14 days ago • 8

AdinaY

posted an update 14 days ago

Post

353

It’s here! After the WAIC announcement, StepFun has just dropped Step 3 🔥 their latest multimodal reasoning model on the hub.

Paper: Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding (2507.19427)
Model: stepfun-ai/step3

✨ 321B total / 32B active - Apache 2.0
✨ MFA + AFD : cutting decoding cost by up to 70% vs. DeepSeek-V3
✨ 4T image-text pretraining: strong vision–language grounding
✨ Modular, efficient, deployable: runs on just 8×48GB GPUs

AdinaY

updated a collection 14 days ago

🧩 July 2025 - Open works from the Chinese community

Collection

38 items • Updated 14 days ago • 8

AdinaY

updated a collection 15 days ago

🧩 July 2025 - Open works from the Chinese community

Collection

38 items • Updated 14 days ago • 8

JingweiZuo

authored 2 papers 15 days ago

NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models

Paper • 2506.07731 • Published Jun 9 • 2

Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance

Paper • 2507.22448 • Published 16 days ago • 62

AdinaY

posted an update 15 days ago

Post

3517

Qwen3-30B-A3B-Thinking-2507 🔥 latest step in scaling thinking capabilities from Alibaba Qwen team.

Qwen/Qwen3-30B-A3B-Thinking-2507-FP8

✨ 30B total / 3B active - Apache 2.0
✨ Native 256K context
✨ SOTA coding, alignment, agentic reasoning

AdinaY

updated a collection 15 days ago

🧩 July 2025 - Open works from the Chinese community

Collection

38 items • Updated 14 days ago • 8

AI & ML interests

Recent Activity

Team members 149

zh-ai-community's activity