GENAIPABENCH: A Benchmark for Generative AI-based Privacy Assistants

Abstract

In the age of data-driven technology, privacy has emerged as a critical concern for both users and organizations. Privacy policies are widely used to outline the data management practices of a company. However, it has been demonstrated that privacy policies, as well as regulations, are generally too complex for people, which motivates the need for privacy assistants. The emergence of generative AI (genAI) technology presents an opportunity to improve the capabilities of privacy assistants to answer questions people might have about their privacy. However, while textual genAI technology generates content that seems accurate, it is been shown that it can generate fake/incorrect information (e.g., due to its tendency to hallucinate) which might mislead the user. This paper presents, GENAIPABENCH, a novel privacy benchmark to evaluate the performance of generative AI-based privacy assistants (GenAIPAs). GENAIPABENCH includes: 1) A comprehensive set of questions about an organization’s privacy policy along with annotated answers and data protection regulation questions; 2) A set of metrics to evaluate the answers obtained from the generative AI system; 3) An evaluation tool that generates appropriate prompts to introduce the system to the privacy document and different variations of the privacy questions to evaluate its robustness. We present an evaluation of OpenAI’s ChatGPT using GENAIPABENCH. Our findings indicate that ChatGPT holds considerable promise as a privacy agent, although it experiences challenges such as handling complex tasks, ensuring response consistency, and accurately referencing the information provided.

Publication
arXiv
Aamir Hamid
Aamir Hamid
Ph.D Student

My research interests include Machine Leaning,Deep Learning, and Privacy.

Hemanth Reddy Samidi
Hemanth Reddy Samidi
MS Student

My research interests include Machine Learning, Data Science, Data Visualization, and Privacy.

Tim Finin
Tim Finin
Professor

Tim Finin is the Willard and Lillian Hackerman Chair in Engineering and a Computer Science and Electrical Engineering professor at the University of Maryland, Baltimore County (UMBC). He has over 50 years of experience in applying AI to problems in information systems and language understanding. His current research focuses on representing and reasoning with knowledge graphs, analyzing and extracting information from text, and enhancing security and privacy in information systems. He is an ACM fellow, a AAAI fellow, an IEEE technical achievement award recipient, and was selected as the UMBC Presidential Research Professor in 2012. Finin received an S.B. degree in Electrical Engineering from MIT and a Ph.D. in Computer Science from the University of Illinois at Urbana-Champaign. He has held positions at UMBC, Unisys, the University of Pennsylvania, Johns Hopkins University, and the MIT AI Laboratory. He has chaired the UMBC Computer Science department, served on the Computing Research Association board of directors, been a AAAI councilor, and chaired many major research conferences. He is a former editor-in-chief of the Elsevier Journal of Web Semantics.

Primal Pappachan
Primal Pappachan
Assistant Professor

My research interests include data management, privacy, and Internet of Things.

Roberto Yus
Roberto Yus
Assistant Professor

My research interests include Data Management, Knowledge Representation, the Internet of Things, and Privacy.

Related