evanhanders's picture
pushes all monosemantic models
1ca938b verified
raw
history blame contribute delete
77 Bytes
act_fn: relu
d_head: 5
d_model: 20
d_vocab: 5
n_ctx: 15
n_layers: 2
seed: 42