AI & ML interests

LLM

Recent Activity

singularitys0  updated a Space 4 days ago
fnlp/MOSS-Speech
Phospheneser  updated a collection 6 days ago
MOSS-Speech
View all activity

fnlp 's collections 7

MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA-refactor
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"
MHA2MLA
The MHA2MLA model published in the paper "Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-Based LLMs"