Apart Lab

GiveWiki
 Accepting donations
$0
0 donations
Support score: 0Organization

Apart Lab collaborates with scholars in AI safety to publish technical research into safer and more aligned machine learning systems.

Recent accepted papers from the Apart Lab

Neuron to Graph: Interpreting Language Model Neurons at Scale

Alex Foote, Neel Nanda, Esben Kran, Ionnis Konstas, Shay Cohen, Fazl Barez

Read paperVisit project site

May 5, 2023  |  RTML workshop at ICLR 2023

Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark

Jason Hoelscher-Obermaier, Julia Persson, Esben Kran, Ionnis Konstas, Fazl Barez

Visit project site

July 10, 2023  |  ACL 2023

0
0