|Talks|

Red-Teaming for Generative AI: Silver Bullet or Security Theater?

Visiting speaker
Hybrid
Past Talk
Hoda Heidari
K&L Gates Career Development Assistant Professor, Carnegie Mellon University
Apr 10, 2024
4:00 pm
EST
Apr 10, 2024
4:00 pm
In-person
Portsoken Street
London, E1 8PH, UK
The Roux Institute
Room
100 Fore Street
Portland, ME 04101
Network Science Institute
2nd floor
Network Science Institute
11th floor
177 Huntington Ave
Boston, MA 02115
Network Science Institute
2nd floor
Room
58 St Katharine's Way
London E1W 1LP, UK
Register for talk
Register for talk

Talk recording

In response to rising concerns surrounding the safety, security, and trustworthiness of Generative AI (GenAI) models, practitioners, and regulators alike have pointed to AI red-teaming as a key component of their strategies for identifying and mitigating these risks. However, despite AI red-teaming's central role in policy discussions and corporate messaging, significant questions remain about what precisely it means, what role it can play in regulation, and how precisely it relates to conventional red-teaming practices as originally conceived in the field of cybersecurity. We identify recent cases of red-teaming activities in the AI industry and conduct an extensive survey of the relevant research literature to characterize the scope, structure, and criteria for AI red-teaming practices. I will situate these findings in the broader discussions surrounding the evaluation of GenAI and AI governance.
About the speaker
Hoda Heidari is the K&L Gates Career Development Assistant Professor in Ethics and Computational Technologies at Carnegie Mellon University, with joint appointments in Machine Learning and Societal Computing. She is affiliated with the Human-Computer Interaction Institute and Heinz College of Information Systems and Public Policy. Her research is broadly concerned with the social, ethical, and economic implications of Artificial Intelligence, particularly issues of fairness and accountability through the use of Machine Learning in socially consequential domains. Her work in this area has won a best-paper award at the ACM Conference on Fairness, Accountability, and Transparency (FAccT), an exemplary track award at the ACM Conference on Economics and Computation (EC), and a best-paper award at the IEEE Conference on Secure and Trustworthy Machine Learning (SAT-ML).
Share this page:
Apr 10, 2024