Kyle O'Brien's picture

Kyle O'Brien PRO

Kyle1668

·

https://kyobrien.io

AI & ML interests

Interpretability, model editing, alignment

Recent Activity

authored a paper 1 day ago

Composable Interventions for Language Models

authored a paper 1 day ago

Recite, Reconstruct, Recollect: Memorization in LMs as a Multifaceted Phenomenon

authored a paper 1 day ago

Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs

View all activity

Organizations

New activity in EleutherAI/deep-ignorance-unfiltered-cb 1 day ago

Improve model card: Add pipeline tag, library name, and explicit links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-e2e-strong-filter-cb 1 day ago

Improve model card: Add pipeline tag, library name, and explicit links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-unfiltered-cb-lat 1 day ago

Improve model card: Add pipeline tag, library, paper, and code links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-pretraining-stage-strong-filter 1 day ago

Improve model card: Add pipeline tag, library name, and explicit links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-unfiltered 1 day ago

Improve model card: Add pipeline tag, library, paper, project, and code links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-e2e-strong-filter-weak-knowledge-corrupted 1 day ago

Improve model card: Add pipeline tag, library name, and prominent links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-strong-filter-pt-weak-filter-anneal-cb-lat 1 day ago

Improve model card: Add pipeline tag, library name, and key resource links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-e2e-strong-filter 1 day ago

Improve model card: Add pipeline tag, library name, and explicit links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-strong-filter-pt-weak-filter-anneal 1 day ago

Improve model card: Add pipeline tag, library name, and correct links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-pretraining-stage-unfiltered 1 day ago

Improve model card: Add pipeline tag, library name, and explicit links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-strong-filter-pt-weak-filter-anneal-cb 1 day ago

Improve model card: Add metadata, paper/project/code links, and abstract

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-e2e-weak-filter 1 day ago

Improve model card: Add pipeline tag, library name, and links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-weak-filter-pt-strong-filter-anneal 1 day ago

Improve model card: Add pipeline tag, library name, and explicit links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-e2e-strong-filter-strong-knowledge-corrupted 1 day ago

Improve model card: Add pipeline tag, library name, paper abstract, and explicit links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-e2e-strong-filter-cb-lat 1 day ago

Improve model card: Add pipeline tag, library name, and paper/project/code links

#1 opened 1 day ago by

New activity in EleutherAI/deep-ignorance-pretraining-stage-weak-filter 1 day ago

Improve model card: Add pipeline tag, library, and explicit links

#1 opened 1 day ago by

New activity in toxigen/toxigen-data over 1 year ago

add prompts

#26 opened over 1 year ago by

Update Default Config 3

#25 opened over 1 year ago by

Delete Legacy Files

#22 opened over 1 year ago by

add prompts

#21 opened over 1 year ago by