Jonathan Mitchell
jmitch5
AI & ML interests
Generative Modeling
Recent Activity
commented on
an
article
about 2 months ago
Efficient LLM Pretraining: Packed Sequences and Masked Attention