-
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits
Paper • 2402.17764 • Published • 625 -
Beyond Language Models: Byte Models are Digital World Simulators
Paper • 2402.19155 • Published • 53 -
BitNet v2: Native 4-bit Activations with Hadamard Transformation for 1-bit LLMs
Paper • 2504.18415 • Published • 47 -
Kijai/PrecompiledWheels
Updated • 51
kas
shing3232
AI & ML interests
None yet
Recent Activity
liked
a Space
1 day ago
akhaliq/voxel-deepseek-terminus
liked
a model
6 days ago
Aleph-Alpha/llama-tfree-hat-pretrained-7b-dpo
new activity
about 1 month ago
deepseek-ai/DeepSeek-V3.1:tool call for reasoning mode
Organizations
None yet