Hugging Face for Legal

community

https://huggingface.co/HFforLegal

Activity Feed Request to join this org

AI & ML interests

Breaking the opacity of language models for legal professionals 📖 Join us by smashing the button at top right 🤗

Recent Activity

1024m authored a paper 6 days ago

Query Attribute Modeling: Improving search relevance with Semantic Search and Meta Data Filtering

alokabhishek authored a paper 6 days ago

Data and AI governance: Promoting equity, ethics, and fairness in large language models

1024m authored a paper 8 days ago

DSBC : Data Science task Benchmarking with Context engineering

View all activity

fdaudens

posted an update about 12 hours ago

Post

399

Want to learn to build an AI Agent? I put together a cookbook for creating your own news research agent with OpenAI GPT-OSS:

- Searches headlines & specific sites
- Pulls full articles when you need depth
- Summarizes with clickable sources
- Runs in a simple Gradio chat UI
- No GPU, no local setup — just open-weight GPT-OSS models via Hugging Face

If you’ve been wanting to try agents but weren’t sure where to start, this is an end-to-end example you can fork, run, and adapt.

Full guide + code https://huggingface.co/blog/fdaudens/openai-gpt-oss-agent-inference-providers

fdaudens

posted an update 2 days ago

Post

286

What can OpenAI’s new open models do with the news? I built a News Agent to find out.

It can answer questions about the news in real time, and every answer comes with original source links so you can dive deeper.

Ask it things like:
- "What are the top news stories today?"
- "What's the latest on artificial intelligence?"
- Follow-up questions on specific stories

Runs with Hugging Face inference providers, letting you compare results from the OpenAI 20B and 120B models

So far, I’m quite impressed by the capabilities of even the smaller 20B model. Definitely not a perfect project, but curious to hear your thoughts!

fdaudens/gpt-oss-news-agent

2 replies

fdaudens

posted an update 4 days ago

Post

3243

OpenAI’s GPT-OSS has sparked ~400 new models on Hugging Face and racked up 5M downloads in less than a week, already outpacing DeepSeek R1’s first-week numbers.

For comparison: when R1 launched, I tracked 550 derivatives (across 8 base models) in a week, with ~3M downloads. GPT-OSS is ahead on adoption and engagement.

It’s also the most-liked release of any major LLM this summer. The 20B and 120B versions quickly shot past Kimi K2, GLM 4.5, and others in likes.

Most-downloaded GPT-OSS models include LM Studio and Unsloth AI versions:
1️⃣ openai/gpt-oss-20b - 2.0M
2️⃣ lmstudio-community/gpt-oss-20b-MLX-8bit - 750K
3️⃣ openai/gpt-oss-120b - 430K
4️⃣ unsloth/gpt-oss-20b-GGUF - 380K
5️⃣ lmstudio-community/gpt-oss-20b-GGUF - 330K

The 20B version is clearly finding its audience, showing the power of smaller, faster, more memory- and energy-efficient models. (These numbers don’t include calls to the models via inference providers, so the real usage is likely even bigger, especially for the 120B version)

Open-weight models let anyone build on top. Empower the builders, and innovation takes off. 🚀

1 reply

1024m

authored a paper 6 days ago

Query Attribute Modeling: Improving search relevance with Semantic Search and Meta Data Filtering

Paper • 2508.04683 • Published 8 days ago

alokabhishek

authored a paper 6 days ago

Data and AI governance: Promoting equity, ethics, and fairness in large language models

Paper • 2508.03970 • Published 9 days ago • 1

clem

posted an update 7 days ago

Post

3176

Thread to gossip during the

openai GPT-5 livestream: https://www.youtube.com/watch?v=0Uu_VJeVVfo. Feel free to post your impressions below!

29 replies

1024m

authored a paper 8 days ago

DSBC : Data Science task Benchmarking with Context engineering

Paper • 2507.23336 • Published 15 days ago • 1

fdaudens

posted an update 9 days ago

Post

2570

Well, it took just 2 hours for openai/gpt-oss-120b to hit #1 on Hugging Face. Don’t remember seeing anything rise that fast!

1 reply

Tonic

posted an update 12 days ago

Post

3244

🫡 I am the first and only one to like the French Tax Code Dataset

that's it , that's the post

find the dataset here : louisbrulenaudet/code-impots
follow : @louisbrulenaudet

2 replies

AdinaY

posted an update 14 days ago

Post

1174

🔥 July highlights from Chinese AI community

zh-ai-community/july-2025-open-works-from-the-chinese-community-686586f1a8840797e477ae5a

✨ Another "DeepSeek moment" - Kimi K2 🙌

✨ Qwen goes fully matrixed - Instruct / Thinking / Coder models across 30B - 480B 🤯

✨ The multimodal wave🌊
- GLM-4.1V-Thinking: Image+Text > Text
- Intern-S1: Image+Text > Text
- Wan 2.2 - Text +Image > video
- Skywork-R1V3: Image+Text > Text
- Skywork-UniPic: Text > Image / Image > Text
- Tar-7B: Any-to-Any
- Ming-Lite-Omni-1.5: Any-to-Any
- Step3: Image+Text > Text
- HunyuanWorld-1: Image > 3D
- ThinkSound: Video > Audio
- Neta-Lumina: Text > Image

✨Tiny & deployable models 🤏
- SmallThinker runs on 1GB RAM

✨Agentic coding goes mainstream 💻
- Qwen3-Coder: fully spec'd tool calling
- GLM-4.5: browser agents, IDE assistant
- Qwen3 WebDev demo: text-to-frontend code

✨Domain-Specific & Utility Models/Tools/Dataset
- Science one S1: Scientific model
- Agentar DeepFinance: Finance dataset
- ObjectClear: Interactive Vision Tool
- Qwen3 MT Demo: Machine Translation Tool

✨ Big month not only for models, but for policy too🏛️
- Announced Global Action Plan for AI Governance
- Proposes to set up a World AI Cooperation Organization in Shanghai
- Released International AI Open Source Collaboration Initiative
- Published Risk Assessment Guidelines for Endpoint AI Agents

✨ Big event - WAIC
- 355K offline visitors
- 108 new released in 4 days
- 145 sessions across key domains

I’ve been tracking things closely, but July’s open-source wave still blew me away. Can’t wait to see what’s coming next! 🚀

AdinaY

posted an update 14 days ago

Post

1619

Qwen team did it again!!

They just released Qwen3-Coder-30B-A3B-Instruct on the hub🔥
Qwen/Qwen3-Coder-30B-A3B-Instruct

✨ Apache 2.0
✨30B total / 3.3B active (128 experts, 8 top-k)
✨ Native 256K context, extendable to 1M via Yarn
✨ Built for Agentic Coding

AdinaY

posted an update 15 days ago

Post

353

It’s here! After the WAIC announcement, StepFun has just dropped Step 3 🔥 their latest multimodal reasoning model on the hub.

Paper: Step-3 is Large yet Affordable: Model-system Co-design for Cost-effective Decoding (2507.19427)
Model: stepfun-ai/step3

✨ 321B total / 32B active - Apache 2.0
✨ MFA + AFD : cutting decoding cost by up to 70% vs. DeepSeek-V3
✨ 4T image-text pretraining: strong vision–language grounding
✨ Modular, efficient, deployable: runs on just 8×48GB GPUs

AdinaY

posted an update 15 days ago

Post

3518

Qwen3-30B-A3B-Thinking-2507 🔥 latest step in scaling thinking capabilities from Alibaba Qwen team.

Qwen/Qwen3-30B-A3B-Thinking-2507-FP8

✨ 30B total / 3B active - Apache 2.0
✨ Native 256K context
✨ SOTA coding, alignment, agentic reasoning

AdinaY

posted an update 16 days ago

Post

2713

Skywork UniPic 🔥a unified autoregressive multimodal model for image understanding, generation, & editing, by Skywork 天工

Skywork/skywork-unipic-6888c0789cdb82457b2acf32

✨ 1.5 B - MIT License
✨ Runs on RTX 4090
✨ Truly unified architecture

AdinaY

posted an update 16 days ago

Post

1719

Qwen just released Qwen3-30B-A3B-Instruct-2507 🔥 an upgrade to the non-thinking mode model

Qwen/Qwen3-30B-A3B-Instruct-2507

✨ 30B MoE / 3.3B active - Apache 2.0
✨ Strong gains in reasoning, math, coding, & multilingual tasks
✨ Native support for 256K long-context inputs

AdinaY

posted an update 18 days ago

Post

435

Wan2.2 🔥A video diffusion model with MoE just released by Alibaba_Wan

Wan-AI/Wan2.2-TI2V-5B
Wan-AI/Wan2.2-I2V-A14B-Diffusers

✨ 5B/14B - Apache2.0
✨ Cinematic-level aesthetics (lighting, tone, composition)
✨ Massive training data (+83% videos)→ smoother motion
✨ Supports image-only video generation, even without a prompt.

AdinaY

posted an update 18 days ago

Post

365

GLM-4.5 🔥 The largest open models yet from Zhipu.
Built for intelligent agents with unified capabilities: reasoning, coding, tool use.

zai-org/glm-45-687c621d34bda8c9e4bf503b

✨ 355B total / 32B active - MIT license
✨ Hybrid reasoning modes: Thinking mode for complex tasks/ Non-thinking mode for instant replies

AdinaY

posted an update 18 days ago

Post

324

Panshi 磐石 🪨 Scientific Foundation Model by the Chinese Academy of Sciences

ScienceOne-AI/S1-Base-8B
ScienceOne-AI/S1-Base-32B

✨ 8B/32B- Apache2.0
✨ Trained on scientific data & laws across math, physics, chemistry, bio, etc.
✨ Supports 300+ tools, 170M+ papers, autonomous scientific planning

3 replies

AdinaY

posted an update 18 days ago

Post

351

Tencent Hunyuan released their first 3D world model: Hunyuan World 1.0 🔥

tencent/HunyuanWorld-1

✨From a single prompt to explorable 3D scenes in minutes
✨ Supports Immersive roaming / Semantic-level interactivity / Physics-ready simulation

AdinaY

posted an update 21 days ago

Post

1708

Big respect to the Qwen team! They just dropped another model🔥

Qwen3-235B-A22B-Thinking-2507 🧠 new reasoning model by Qwen

Qwen/Qwen3-235B-A22B-Thinking-2507

✨ 235B total / 22B active (8 experts)
✨ 256K context window
✨ Agent-ready with tool use & <think> reasoning mode

Hope the team gets some well-deserved rest this weekend after all the massive releases 🙌

AI & ML interests

Recent Activity

Team members 83

HFforLegal's activity