17 13 14

kas

shing3232

AI & ML interests

None yet

Recent Activity

liked a Space 1 day ago

akhaliq/voxel-deepseek-terminus

liked a model 6 days ago

Aleph-Alpha/llama-tfree-hat-pretrained-7b-dpo

new activity about 1 month ago

deepseek-ai/DeepSeek-V3.1:tool call for reasoning mode

View all activity

Organizations

None yet

Collections 1

spaces 1

No application file

Qwen2 Sakura

😻

models 9

datasets 2

shing3232/dataset_imatrix

Viewer • Updated Jan 15, 2024 • 1 • 9

shing3232/imatrix

Updated Jan 15, 2024 • 11

kas

AI & ML interests

Recent Activity

Organizations

Collections 1

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Beyond Language Models: Byte Models are Digital World Simulators

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Kijai/PrecompiledWheels

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Beyond Language Models: Byte Models are Digital World Simulators

BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs

Kijai/PrecompiledWheels

spaces 1

Qwen2 Sakura

models 9

shing3232/Sakura-1.5B-Qwen2.5-v1.0-GGUF-IMX

shing3232/sakura-14b-qwen2beta-v0.9.2-IMX

shing3232/Sakura13B-LNovel-v0.9-qwen1.5-GGUF-IMX

shing3232/Sakura1.8B-LNovel-v0.9pre2-qwen1_GGUF-IMX

shing3232/Sakura13B-LNovel-v0.9b-GGUF-IMX-2.33_re

shing3232/Sakura1.8B-LNovel-v0.9-qwen1.5_GGUF-IMX_re

shing3232/Sakura13B-LNovel-v0.9b-GGUF-IMX-2.33

shing3232/Sakura-LNovel-v0.9b-GGUF-IMX-JPZH

shing3232/Sakura-13B-LNovel-v0.9b-GGUF-IMX-wikitest

datasets 2

shing3232/dataset_imatrix

shing3232/imatrix

kas

AI & ML interests

Recent Activity

Organizations

Collections 1

spaces 1

Qwen2 Sakura

models 9 Sort: Recently updated

datasets 2 Sort: Recently updated

models 9

datasets 2