I'm a postdoc at Mila supervised by Yoshua Bengio. I'm currently the Scientific Lead of the first International AI Safety Report, a project backed by 33 nations and intergovernmental organizations.

My research in machine learning covers risk management, LLM honesty, health applications, and data selection for large-scale deep learning. Across these areas, my publications as lead author have been covered by TV and newspapers like The Guardian, Time, etc, while others have been discussed by ministers or incorporated into national legislation.

Before joining Mila, I did my PhD at the University of Oxford under Yarin Gal funded by Google DeepMind, and I worked on learning human preferences and game-theoretical machine learning with David Duvenaud and Roger Grosse at Toronto’s Vector Institute and UC Berkeley, and with the Centre for the Governance of AI at Oxford. I studied machine learning (UCL), maths (Amsterdam) and Future Planet Studies (Amsterdam).

Contact me

soeren.mindermann ατ gmail.com

Publications as first author

*equal contribution to first authorship
International AI Safety Report (interim + final reports)
Yoshua Bengio (Chair), Sören Mindermann (Scientific Lead), Daniel Privitera (Lead Writer), Tamay Besiroglu, Rishi Bommasani, Stephen Casper, Yejin Choi, Philip Fox, Ben Garfinkel, Danielle Goldfarb, Hoda Heidari, Anson Ho, Sayash Kapoor, Leila Khalatbari, Shayne Longpre, Sam Manning, Vasilios Mavroudis, Mantas Mazeika, Julian Michael, Jessica Newman, Kwan Yee Ng, Chinasa T. Okolo, Deborah Raji, Girish Sastry, Elizabeth Seger, Theodora Skeadas, Tobin South, Emma Strubell, Florian Tramèr, Lucia Velasco, Nicole Wheeler, Daron Acemoglu, Olubayo Adekanmbi, David Dalrymple, Thomas G. Dietterich, Edward W. Felten, Pascale Fung, Pierre-Olivier Gourinchas, Fredrik Heintz, Geoffrey Hinton, Nick Jennings, Andreas Krause, Susan Leavy, Percy Liang, Teresa Ludermir, Vidushi Marda, Helen Margetts, John McDermid, Jane Munga, Arvind Narayanan, Alondra Nelson, Clara Neppel, Alice Oh, Gopal Ramchurn, Stuart Russell, Marietje Schaake, Bernhard Schölkopf, Dawn Song, Alvaro Soto, Lee Tiedrich, Gaël Varoquaux, Andrew Yao, Ya-Qin Zhang, Fahad Albalawi, Marwan Alserkal, Olubunmi Ajala, Guillaume Avrin, Christian Busch, André Carlos Ponce de Leon Ferreira de Carvalho, Bronwyn Fox, Amandeep Singh Gill, Ahmet Halit Hatip, Juha Heikkilä, Gill Jolly, Ziv Katzir, Hiroaki Kitano, Antonio Krüger, Chris Johnson, Saif M. Khan, Kyoung Mu Lee, Dominic Vincent Ligot, Oleksii Molchanovskyi, Andrea Monti, Nusu Mwamanzi, Mona Nemer, Nuria Oliver, José Ramón López Portillo, Balaraman Ravindran, Raquel Pezoa Rivera, Hammam Riza, Crystal Rugege, Ciarán Seoighe, Jerry Sheehan, Haroon Sheikh, Denise Wong, Yi Zeng
2025
JM Brauner*, S Mindermann*, M Sharma*, D Johnston, J Salvatier, ...
Science, 2021
Sören Mindermann*, Muhammed Razzak*, Winnie Xu*, Andreas Kirsch, Mrinank Sharma, Aidan Gomez, Sebastian Farquhar, Jan Brauner, Yarin Gal
ICML, 2022
M Sharma*, S Mindermann*, C Rogers-Smith, G Leech, B Snodin, J Ahuja, ...
Nature Communications, 2021
S Mindermann*, R Shah*, A Gleave, D Hadfield-Menell
ICML workshop Goals in Reinforcement Learning, 2018
A Jesson*, S Mindermann*, U Shalit, Y Gal
NeurIPS, 2020
S Mishra*, S Mindermann*, M Sharma*, C Whittaker*, T Mellan, T Wilton, ...
The Lancet: EClinicalMedicine, 2021
M Sharma*, S Mindermann*, J Brauner*, G Leech, A Stephenson, ...
NeurIPS (Spotlight talk), 2020


Publications as senior author

*equal contribution to senior authorship
Yoshua Bengio, Geoffrey Hinton, Andrew Yao, Dawn Song, Pieter Abbeel, Yuval Noah Harari, Ya-Qin Zhang, Lan Xue, Shai Shalev-Shwartz, Gillian Hadfield, Jeff Clune, Tegan Maharaj, Frank Hutter, Atılım Güneş Baydin, Sheila McIlraith, Qiqi Gao, Ashwin Acharya, David Krueger, Anca Dragan, Philip Torr, Stuart Russell, Daniel Kahneman, Jan Brauner*, Sören Mindermann*
(I'm not as 'senior' as the other authors here ;) but this is based on leading the project)
Science, 2024
G Leech, C Rogers-Smith, J Sandbrink, B Snodin, R Zinkov, B Rader, J Brownstein, Y Gal, S Bhatt*, M Sharma*, S Mindermann*, J Brauner*, L Aitchison*
Proceedings of the National Academy of Sciences (PNAS), 2022
G Altman, J Ahuja, JT Monrad, G Dhaliwal, C Rogers-Smith, G Leech, B Snodin, JB Sandbrink, L Finnveden, AJ Norman, SB Oehm, JF Sandkühler, J Kulveit, S Flaxman, Y Gal, S Mishra, S Bhatt, M Sharma*, S Mindermann*, J Brauner*
Nature Scientific Data, 2022


Publications as co-author

R Ngo, L Chan, S Mindermann
International Conference on Learning Representations, 2024
Evan Hubinger, Carson Denison, (many others) ... Sören Mindermann, Ryan Greenblatt, Buck Shlegeris, Nicholas Schiefer, Ethan Perez
Arxiv, 2024
L Pacchiardi, AJ Chan, S Mindermann, I Moscovitz, AY Pan, Y Gal, O Evans, J Brauner
International Conference on Learning Representations, 2024
S Kundu, Y Bai, (many others) ... S Mindermann, N Joseph, S McCandlish, J Kaplan
Arxiv, 2024
A Jesson, S Mindermann, Y Gal, U Shalit
International Conference on Machine Learning, 2021
G Meyerowitz-Katz, S Bhatt, O Ratmann, JM Brauner, S Flaxman, S Mishra, M Sharma, S Mindermann, V Bradley, M Vollmer, L Merone, G Yamey
BMJ Global Health, 2021
Tomáš Gavenčiak, Joshua Teperowski Monrad, Gavin Leech, Mrinank Sharma, Sören Mindermann, Jan Marus Brauner, Samir Bhatt, Jan Kulveit
PLOS Computational Biology, 2022

Policy impact

TV and newspaper interviews

Invited talks