kuleshov-group/caduceus-ph_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 24 • 1
kuleshov-group/caduceus-ph_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 63 • 1
kuleshov-group/caduceus-ph_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.42k • 6
kuleshov-group/caduceus-ps_seqlen-1k_d_model-118_n_layer-4_lr-8e-3 Fill-Mask • 471k • Updated Oct 20, 2025 • 37 • 1
kuleshov-group/caduceus-ps_seqlen-1k_d_model-256_n_layer-4_lr-8e-3 Fill-Mask • 1.93M • Updated Oct 20, 2025 • 30 • 2
kuleshov-group/caduceus-ps_seqlen-131k_d_model-256_n_layer-16 Fill-Mask • 7.73M • Updated Oct 20, 2025 • 1.82k • 14
kuleshov-group/bd3lm-owt-block_size1024-pretrain Text Generation • 0.2B • Updated Mar 18, 2025 • 536 • 1