evanhanders's picture
pushes all monosemantic models
1ca938b verified
raw
history blame contribute delete
77 Bytes
act_fn: relu
d_head: 8
d_model: 32
d_vocab: 5
n_ctx: 15
n_layers: 3
seed: 42