Second State

company

https://github.com/second-state/LlamaEdge

secondstateinc

second-state

Activity Feed Request to join this org

AI & ML interests

Run open source LLMs across CPU and GPU without changing the binary in Rust and Wasm locally!

Organization Card

Community About org cards

Run Open source LLMs and create OpenAI-compatible API services for the Llama2 series of LLMs locally With LlamaEdge!

Give it a try

Run a single command in your command line terminal.

bash <(curl -sSfL 'https://raw.githubusercontent.com/LlamaEdge/LlamaEdge/main/run-llm.sh') --interactive

Follow the on-screen instructions to install the WasmEdge Runtime and download your favorite open-source LLM. Then, choose whether you want to chat with the model via the CLI or via a web UI.

See it in action | GitHub | Docs

Why?

LlamaEdge, powered by Rust and WasmEdge, provides a strong alternative to Python in AI inference.

Lightweight. The total runtime size is 30MB.
Fast. Full native speed on GPUs.
Portable. Single cross-platform binary on different CPUs, GPUs, and OSes.
Secure. Sandboxed and isolated execution on untrusted devices.
Container-ready. Supported in Docker, containerd, Podman, and Kubernetes.

Learn more

Please visit the LlamaEdge project to learn more.

Collections 18

View 18 collections

models 295

second-state/Qwen3-Reranker-0.6B-GGUF

Updated Sep 28, 2025

second-state/Seed-OSS-36B-Instruct-GGUF

Text Generation • 36B • Updated Sep 12, 2025 • 78

second-state/embeddinggemma-300m-GGUF

datasets 0

None public yet

Second State

AI & ML interests

Give it a try

Why?

Learn more

Collections 18

second-state/SmolLM2-1.7B-Instruct-GGUF

second-state/SmolVLM2-500M-Video-Instruct-GGUF

second-state/SmolVLM2-256M-Video-Instruct-GGUF

second-state/Qwen2.5-VL-7B-Instruct-GGUF

second-state/Qwen2.5-VL-32B-Instruct-GGUF

second-state/SmolLM2-1.7B-Instruct-GGUF

second-state/SmolVLM2-500M-Video-Instruct-GGUF

second-state/SmolVLM2-256M-Video-Instruct-GGUF

second-state/Qwen2.5-VL-7B-Instruct-GGUF

second-state/Qwen2.5-VL-32B-Instruct-GGUF

models 295

second-state/Qwen3-Reranker-0.6B-GGUF

second-state/Seed-OSS-36B-Instruct-GGUF

second-state/embeddinggemma-300m-GGUF

second-state/NVIDIA-Nemotron-Nano-9B-v2-GGUF

second-state/Nemotron-Mini-4B-Instruct-GGUF

second-state/jina-embeddings-v3-GGUF

second-state/MiniCPM-V-4-GGUF

second-state/MiniCPM-V-4_5-GGUF

second-state/Qwen3-Coder-30B-A3B-Instruct-GGUF

second-state/gemma-3-270m-it-GGUF

datasets 0

AI & ML interests

Team members 8

Give it a try

Why?

Learn more

Collections 18

models 295 Sort: Recently updated

datasets 0

models 295