Apart Lab
Support score: 0Organization
Apart Lab collaborates with scholars in AI safety to publish technical research into safer and more aligned machine learning systems.
Recent accepted papers from the Apart Lab
Neuron to Graph: Interpreting Language Model Neurons at Scale
Alex Foote, Neel Nanda, Esben Kran, Ionnis Konstas, Shay Cohen, Fazl Barez
Read paper | Visit project site
May 5, 2023 | RTML workshop at ICLR 2023
Detecting Edit Failures In Large Language Models: An Improved Specificity Benchmark
Jason Hoelscher-Obermaier, Julia Persson, Esben Kran, Ionnis Konstas, Fazl Barez
July 10, 2023 | ACL 2023
No comments yet