Contact
Universitätsstraße 32
70569 Stuttgart
Deutschland
Room: 00.123
Subject
- AI Safety
- Machine Behavior / Machine Psychology
- Responsible AI
A complete list of my publications as well as books can be found here. The following is a selection:
- Jotautaitė, Monika; Caviola, Lucius; Brewster, David A.; Hagendorff, Thilo (2025): Speciesism in AI: Evaluating Discrimination Against Animals in Large Language Models. In arXiv:2508.11534, pp. 1–29. (Link)
- Hagendorff, Thilo; Derner, Erik; Oliver, Nuria (2025): Large Reasoning Models Are Autonomous Jailbreak Agents. In arXiv:2508.04039, pp. 1–54. (Link)
- Hagendorff, Thilo (2025): On the Inevitability of Left-Leaning Political Bias in Aligned Language Models. In arXiv:2507.15328, pp. 1–11. (Link)
- Hagendorff, Thilo; Fabi, Sarah (2025): Beyond Chains of Thought: Benchmarking Latent-Space Reasoning Abilities in Large Language Models. In arXiv:2504.10615, pp. 1–12. (Link)
- Vaugrante, Laurène; Carlon, Francesca; Menke, Maluna; Hagendorff, Thilo (2025): Compromising Honesty and Harmlessness in Language Models via Deception Attacks. In arXiv:2502.08301, pp. 1–14. (Link)
- Vaugrante, Laurène; Niepert, Mathias; Hagendorff, Thilo (2024): A Looming Replication Crisis in Evaluating Behavior in Language Models? Evidence and Solutions. In arXiv:2409.20303, pp. 1–23. (Link)
- Hagendorff, Thilo (2024): Mapping the Ethics of Generative AI. A Comprehensive Scoping Review. In Minds and Machines 34 (39), 1–27. (Link)
- Hagendorff, Thilo; Dasgupta, Ishita; Binz, Marcel; Chan, Stephanie C. Y.; Lampinen, Andrew; Wang, Jane X. et al. (2024): Machine Psychology. In arXiv:2303.13988, pp. 1–17. (Link)
- Hagendorff, Thilo (2024): Deception abilities emerged in large language models. In Proceedings of the National Academy of Sciences 121 (24), 1-8. (Link)
- Meding, Kristof; Hagendorff, Thilo (2024): Fairness Hacking: The Malicious Practice of Shrouding Unfairness in Algorithms. In Philosophy & Technology 37 (1), pp. 1–22. (Link)
- Hagendorff, Thilo; Fabi, Sarah; Kosinski, Michal (2023): Human-like intuitive behavior and reasoning biases emerged in large language models but disappeared in ChatGPT. In Nature Computational Science 3 (10), pp. 833–838. (Link)
- Hagendorff, Thilo; Fabi, Sarah (2023): Why we need biased AI: How including cognitive biases can enhance AI systems. In Journal of Experimental & Theoretical Artificial Intelligence, pp. 1–14. (Link)
- Hagendorff, Thilo; Bossert, Leonie N.; Tse, Yip Fai; Singer, Peter (2023): Speciesist bias in AI: how AI applications perpetuate discrimination and unfair outcomes against animals. In AI Ethics 3 (3), pp. 717–734. (Link)
- Hagendorff, Thilo (2022): A Virtue-Based Framework to Support Putting AI Ethics into Practice. In Philosophy & Technology 35 (3), pp. 1–246. (Link)
- Hagendorff, Thilo; Danks, David (2022): Ethical and methodological challenges in building morally informed AI systems. In AI Ethics, pp. 1–14. (Link)
- Hagendorff, Thilo (2022): AI ethics and its pitfalls: not living up to its own standards? In AI and Ethics, pp. 1–8. (Link)
- Hagendorff, Thilo (2022): Blind spots in AI ethics. In AI Ethics 2 (4), pp. 851–867. (Link)
- Hagendorff, Thilo (2021): Linking human and machine behavior. A new approach to evaluate training data quality for beneficial machine learning. In Minds and Machines 31, pp. 563–593. (Link)
- Hagendorff, Thilo; Meding, Kristof (2021): Ethical considerations and statistical analysis of industry involvement in machine learning research. In AI & SOCIETY - Journal of Knowledge, Culture and Communication, pp. 1–11. (Link)
- Hagendorff, Thilo (2021): Forbidden knowledge in machine learning. Reflections on the limits of research and publication. In AI & SOCIETY - Journal of Knowledge, Culture and Communication 36 (3), pp. 767–781. (Link)
- Helm, Paula; Hagendorff, Thilo (2021): Beyond the Prediction Paradigm. Challenges for AI in the Struggle Against Organized Crime. In Law and Contemporary Problems 84 (3), pp. 1–17. (Link)
- Hagendorff, Thilo (2020): The Ethics of AI Ethics. An Evaluation of Guidelines. In: Minds and Machines 30 (3), pp. 457–461. (Link)
- Hagendorff, Thilo (2019): From privacy to anti-discrimination in times of machine learning. In: Ethics and Information Technology 33 (3), pp. 331–343. (Link)
- Hagendorff, Thilo; Wezel, Katharina (2019): 15 challenges for AI: or what AI (currently) can’t do. In AI & SOCIETY - Journal of Knowledge, Culture and Communication 35 (2), pp. 355-365. (Link)
Dr. Thilo Hagendorff is an expert in AI safety and machine behavior. He is working as an Independent Research Group Leader at the University of Stuttgart. Previously, he worked for the Cluster of Excellence “Machine Learning” at the University of Tuebingen. He was a visiting scholar at Stanford University, UC San Diego, and ELLIS Alicante. As a lecturer, he teaches at the Hasso Plattner Institute in Potsdam, among others.
More details can be found here.
Spokespersons
Managing Director
Phone:
+49 711 685 88100
Professor for Teaching and Learning with Intelligent Systems | Spokesperson of the Stuttgart Research Focus IRIS | Co-Director of the AI Software Academy
Phone:
+49 711 685 81176
Board of Directors
Chair of the Board
Phone:
+49 711 685 84350
Chair of the Board
Phone:
+49 711 685 69395
Managing Director
Phone:
+49 711 685 88100
Professor for Teaching and Learning with Intelligent Systems | Spokesperson of the Stuttgart Research Focus IRIS | Co-Director of the AI Software Academy
Phone:
+49 711 685 81176
Chair of the Board
Phone:
+49 711 685 81940
International Advisory Board
International Advisory Board
Phone:
+1 510-486-7134
International Advisory Board
Phone:
+46 90 786 63 08
International Advisory Board
Phone:
+49 7071 2970793
IRIS Members at the University of Stuttgart
Head of the institute
Phone:
+49 711 685 67733
Chair of the Board
Phone:
+49 711 685 84350
Director and Vice Rector for Science Transfer and International Affairs
Phone:
+49 711 685 61719
Professor for Human-Computer Interaction and Cognitive Systems
Phone:
+49 711 685 60048
Chair of the Board
Phone:
+49 711 685 69395
Independent Research Group Leader | Diversity-Aware NLP Intelligent Systems (DANIS)
Department Head Philosophy of Computational Sciences (HLRS)
Phone:
+49 711 685 87289
Independent Research Group Leader | AI Safety
Phone:
+49 711 685 84314
Head of Department SOWI VII
Phone:
+49 711 685 81132
Head of Department Sowi V
Phone:
+49 711 685 83941
Chair of Foundations of Computational Linguistics
Phone:
+49 711 685 81365
Director
Phone:
+49 711 685 66577
Professor of Practical Philosophy
Phone:
+49 711 685 83658
Founding Director
Phone:
+49 711 685 82786
Director of the Institute (Molecular Tumor Cell Biology)
Phone:
+49 711 685 69301
Project Coordination CampusConnect – Sharing Innovative Education
Phone:
+49 711 685 82026
Full Professor
Phone:
+49 711 685 83156
Head of Department
Phone:
+49 711 685 63612
Director
Phone:
+49 711 685 60484
Professor for Augmented Reality und Virtual Reality, Managing Director VIS
Phone:
+49 711 685 88603
Junior Professor (Assistant Professor) Computational Linguistics
Phone:
+49 711 685 84577
Independent Research Group Leader | Computational Digital Psychology
Phone:
+49 711 685 84430
Managing Director
Phone:
+49 711 685 88100
Professor English Linguistics
Phone:
+49 711 685 83121
Management Coordinator
Phone:
+49 711 685 82379
Holder of the Chair "Control technology and mechatronics for production systems"
Phone:
+49 711 685 82422
Chair of Digital Phonetics
Phone:
+49 711 685 81372
Managing Director
Professor
Professor
Professor for Teaching and Learning with Intelligent Systems | Spokesperson of the Stuttgart Research Focus IRIS | Co-Director of the AI Software Academy
Phone:
+49 711 685 81176
Chair of the Board
Phone:
+49 711 685 81940
Head of Department
Phone:
+49 711 685 65278
Postdoctoral Researcher focusing on organizational use dynamics of AI systems
Phone:
+49 711 685 81141
Adminstrative Support
Team Assistant IRIS3D
Phone:
+49 711 685 84432
IRIS Coordination Team
Scientific Coordinator of the Teaching and Learning Forum RISING | Doctoral Researcher focusing on trust in intelligent systems
Coordinator of the SRF IRIS
Phone:
+49 711 685 84371
Mental Health First Aider
Scientific Coordinator of the Teaching and Learning Forum RISING | Doctoral Researcher focusing on trust in intelligent systems
Independent Research Group Leaders
Independent Research Group Leader | AI Safety
Phone:
+49 711 685 84314
Independent Research Group Leader | Computational Digital Psychology
Phone:
+49 711 685 84430
Independent Research Group Leader | Diversity-Aware NLP Intelligent Systems (DANIS)
Researchers and Scientific Employees
Research Assistant
Phone:
+49 711 685 81161
Research Assistant
Phone:
+49 711 685 84317
Research Assistant
Academic staff member
Phone:
+49 711 685 81134
Research Assistant
Phone:
+49 711 685 67067
Research Assistant
Academic Staff
Research Assistant
Phone:
+49 711 685 84317
Research Assistant
Phone:
+49 711 685 84317
Student Employees | Student Members
Student Research Assistant
Student Research Assistant
Student Research Assistant
Masters Student
Phone:
+49 176 75639790
Student Research Assistant
Student Research Assistant
Student Research Assistant
Student Research Assistant | Teaching Assistant
Student Research Assistant
Student Research Assistant
Student Research Assistant
Student Research Assistant | Teaching Assistant
Associate Members
Professor of Computer Science, Hochschule der Medien Stuttgart | Director of the Humanoid Lab
Phone:
+49 711 89232882
Chair of Bioethics | ETH Zürich
Assistant Professor, Intelligence Community Fellow | Stevens Institute of Technology
Phone:
+1 201-216-5040
Communications, Culture, Diversity - Founder & Managing Partner | 789 Consulting
Phone:
+49 172 8252822
Professor of Philosophy of Science and Technology
Phone:
+49 6151 1657446
Research Group Leader | Max Planck Institute for Intelligent Systems
Phone:
+49 711 6893516
Team Lead | GESIS - Leibniz Institute for the Social Sciences (Cologne) and Heinrich-Heine University of Dusseldorf
Phone:
+49 221 47694262
Associate Professor of Education | University College London, Institute of Education
Faculty member of the Political Science Program of the Division of Social Sciences, University of the Philippines Tacloban College