Red-Teaming for Generative AI: Silver Bullet or Security Theater?
On-campus talk
Hoda Heidari
K&L Gates Career Development Assistant Professor, Carnegie Mellon University
Past Talk
Hybrid talk
Wednesday
Apr 10, 2024
Watch video
4:00 pm
Virtual
177 Huntington Ave.
11th floor
Online
Register here
In response to rising concerns surrounding the safety, security, and trustworthiness of Generative AI (GenAI) models, practitioners, and regulators alike have pointed to AI red-teaming as a key component of their strategies for identifying and mitigating these risks. However, despite AI red-teaming's central role in policy discussions and corporate messaging, significant questions remain about what precisely it means, what role it can play in regulation, and how precisely it relates to conventional red-teaming practices as originally conceived in the field of cybersecurity. We identify recent cases of red-teaming activities in the AI industry and conduct an extensive survey of the relevant research literature to characterize the scope, structure, and criteria for AI red-teaming practices. I will situate these findings in the broader discussions surrounding the evaluation of GenAI and AI governance.
About the speaker
About the speaker
Hoda Heidari is the K&L Gates Career Development Assistant Professor in Ethics and Computational Technologies at Carnegie Mellon University, with joint appointments in Machine Learning and Societal Computing. She is affiliated with the Human-Computer Interaction Institute and Heinz College of Information Systems and Public Policy. Her research is broadly concerned with the social, ethical, and economic implications of Artificial Intelligence, particularly issues of fairness and accountability through the use of Machine Learning in socially consequential domains. Her work in this area has won a best-paper award at the ACM Conference on Fairness, Accountability, and Transparency (FAccT), an exemplary track award at the ACM Conference on Economics and Computation (EC), and a best-paper award at the IEEE Conference on Secure and Trustworthy Machine Learning (SAT-ML).
Hoda Heidari is the K&L Gates Career Development Assistant Professor in Ethics and Computational Technologies at Carnegie Mellon University, with joint appointments in Machine Learning and Societal Computing. She is affiliated with the Human-Computer Interaction Institute and Heinz College of Information Systems and Public Policy. Her research is broadly concerned with the social, ethical, and economic implications of Artificial Intelligence, particularly issues of fairness and accountability through the use of Machine Learning in socially consequential domains. Her work in this area has won a best-paper award at the ACM Conference on Fairness, Accountability, and Transparency (FAccT), an exemplary track award at the ACM Conference on Economics and Computation (EC), and a best-paper award at the IEEE Conference on Secure and Trustworthy Machine Learning (SAT-ML).