jln.shk_64x4: JSAE (resid_mid -> resid_out) 7.5k steps
j.mlp_layer.shk_64x4: JSAE (mlp_in -> mlp_out) 7.5k steps
David Quarel
davidquarel
·
AI & ML interests
None yet
Recent Activity
updated
a model
4 days ago
davidquarel/jaxgmg_ckpt_pt
published
a model
5 days ago
davidquarel/jaxgmg_ckpt_pt
updated
a model
13 days ago
davidquarel/jaxgmg_checkpoints