Stefan Wiest

community

https://stefanwiest.de

stefanwiest

Activity Feed

AI & ML interests

AI Research & Engineering · Multi-Agent System Coordination

Recent Activity

skew202 updated a Space 2 days ago

stefanwiest/README

skew202 updated a collection 9 days ago

Agent Safety & Alignment

skew202 updated a collection 9 days ago

Agent Safety & Alignment

View all activity

stefanwiest 's collections 6

Multi-Agent Coordination & Signaling

Agent coordination resources: A2A, MCP, HCT signaling protocols, CAMEL, IoA orchestration patterns for multi-agent systems.

stefanwiest/hct-spec

Updated 10 days ago
Running

1

NerfStatus: HF Inference Monitor

📊

1
meta-llama/Llama-3.1-70B-Instruct

Text Generation • 71B • Updated Dec 15, 2024 • 505k • • 885
Qwen/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 6.02M • • 1.01k

Memory, Context & RAG

RAG architectures, hierarchical memory, semantic chunking, query rewriting, context compression for agents.

microsoft/ms_marco

Viewer • Updated Jan 4, 2024 • 1.11M • 12.9k • 220
sentence-transformers/all-nli

Viewer • Updated May 15, 2024 • 2.86M • 3.93k • 47
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Paper • 2005.11401 • Published May 22, 2020 • 14
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models

Paper • 2402.04614 • Published Feb 7, 2024 • 3

Tool Use & Agent Execution

LLM tool use: Toolformer, ReAct, MCP protocol, dynamic tool selection, autonomous agent execution loops.

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 612k • • 592
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 12M • • 5.25k
Toolformer: Language Models Can Teach Themselves to Use Tools

Paper • 2302.04761 • Published Feb 9, 2023 • 12
ReAct: Synergizing Reasoning and Acting in Language Models

Paper • 2210.03629 • Published Oct 6, 2022 • 31

LLM Reasoning & Planning Techniques

CoT, ToT, GoT, ReAct, ReWOO prompting techniques for LLM reasoning with implementation guidance and benchmarks.

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 379k • • 12.9k
Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12, 2025 • 373k • • 1.97k
google/gemma-2-27b-it

Text Generation • 27B • Updated Aug 27, 2024 • 539k • 556
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Paper • 2201.11903 • Published Jan 28, 2022 • 15

LLM Quality & Degradation Monitoring

LLM degradation detection, hallucination research, probe-based testing, model collapse, quality monitoring tools.

Running

1

NerfStatus: HF Inference Monitor

📊

1
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.72M • • 1.93k
microsoft/Phi-3-mini-4k-instruct

Text Generation • 4B • Updated Dec 10, 2025 • 1.42M • 1.37k
Qwen/Qwen2.5-3B-Instruct

Text Generation • 3B • Updated Sep 25, 2024 • 10.6M • 363

Agent Safety & Alignment

AI agent security: prompt injection defense, jailbreak detection, guardrails, constitutional AI, zero trust architecture.

meta-llama/Llama-Guard-3-8B

Text Generation • 8B • Updated Oct 11, 2024 • 32.7k • • 257
Jailbroken: How Does LLM Safety Training Fail?

Paper • 2307.02483 • Published Jul 5, 2023 • 14
Constitutional AI: Harmlessness from AI Feedback

Paper • 2212.08073 • Published Dec 15, 2022 • 4
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations

Paper • 2312.06674 • Published Dec 7, 2023 • 8

Multi-Agent Coordination & Signaling

Agent coordination resources: A2A, MCP, HCT signaling protocols, CAMEL, IoA orchestration patterns for multi-agent systems.

stefanwiest/hct-spec

Updated 10 days ago
Running

1

NerfStatus: HF Inference Monitor

📊

1
meta-llama/Llama-3.1-70B-Instruct

Text Generation • 71B • Updated Dec 15, 2024 • 505k • • 885
Qwen/Qwen2.5-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 6.02M • • 1.01k

LLM Reasoning & Planning Techniques

CoT, ToT, GoT, ReAct, ReWOO prompting techniques for LLM reasoning with implementation guidance and benchmarks.

deepseek-ai/DeepSeek-R1

Text Generation • 685B • Updated Mar 27, 2025 • 379k • • 12.9k
Qwen/Qwen2.5-Coder-32B-Instruct

Text Generation • 33B • Updated Jan 12, 2025 • 373k • • 1.97k
google/gemma-2-27b-it

Text Generation • 27B • Updated Aug 27, 2024 • 539k • 556
Chain-of-Thought Prompting Elicits Reasoning in Large Language Models

Paper • 2201.11903 • Published Jan 28, 2022 • 15

Memory, Context & RAG

RAG architectures, hierarchical memory, semantic chunking, query rewriting, context compression for agents.

microsoft/ms_marco

Viewer • Updated Jan 4, 2024 • 1.11M • 12.9k • 220
sentence-transformers/all-nli

Viewer • Updated May 15, 2024 • 2.86M • 3.93k • 47
Retrieval-Augmented Generation for Knowledge-Intensive NLP Tasks

Paper • 2005.11401 • Published May 22, 2020 • 14
Faithfulness vs. Plausibility: On the (Un)Reliability of Explanations from Large Language Models

Paper • 2402.04614 • Published Feb 7, 2024 • 3

LLM Quality & Degradation Monitoring

LLM degradation detection, hallucination research, probe-based testing, model collapse, quality monitoring tools.

Running

1

NerfStatus: HF Inference Monitor

📊

1
meta-llama/Llama-3.2-3B-Instruct

Text Generation • 3B • Updated Oct 24, 2024 • 1.72M • • 1.93k
microsoft/Phi-3-mini-4k-instruct

Text Generation • 4B • Updated Dec 10, 2025 • 1.42M • 1.37k
Qwen/Qwen2.5-3B-Instruct

Text Generation • 3B • Updated Sep 25, 2024 • 10.6M • 363

Tool Use & Agent Execution

LLM tool use: Toolformer, ReAct, MCP protocol, dynamic tool selection, autonomous agent execution loops.

Qwen/Qwen2.5-Coder-7B-Instruct

Text Generation • 8B • Updated Jan 12, 2025 • 612k • • 592
meta-llama/Llama-3.1-8B-Instruct

Text Generation • 8B • Updated Sep 25, 2024 • 12M • • 5.25k
Toolformer: Language Models Can Teach Themselves to Use Tools

Paper • 2302.04761 • Published Feb 9, 2023 • 12
ReAct: Synergizing Reasoning and Acting in Language Models

Paper • 2210.03629 • Published Oct 6, 2022 • 31

Agent Safety & Alignment

AI agent security: prompt injection defense, jailbreak detection, guardrails, constitutional AI, zero trust architecture.

meta-llama/Llama-Guard-3-8B

Text Generation • 8B • Updated Oct 11, 2024 • 32.7k • • 257
Jailbroken: How Does LLM Safety Training Fail?

Paper • 2307.02483 • Published Jul 5, 2023 • 14
Constitutional AI: Harmlessness from AI Feedback

Paper • 2212.08073 • Published Dec 15, 2022 • 4
Llama Guard: LLM-based Input-Output Safeguard for Human-AI Conversations

Paper • 2312.06674 • Published Dec 7, 2023 • 8

AI & ML interests

Recent Activity

Team members 1

stefanwiest 's collections 6

NerfStatus: HF Inference Monitor

NerfStatus: HF Inference Monitor

NerfStatus: HF Inference Monitor

NerfStatus: HF Inference Monitor