Exploring the Use of Abusive Generative AI Models on Civitai

Authors: Yiluo Wei, Yiming Zhu, Pan Hui, Gareth Tyson

Published: 2024-07-16 06:18:03+00:00

AI Summary

This paper presents the first large-scale empirical study of the Civitai AIGC platform, focusing on the prevalence of abusive content. The authors analyze a dataset of 87,000 models and 2 million images to characterize the themes, popularity, and user engagement related to NSFW content and deepfakes, finding a significant presence of both.

Abstract

The rise of generative AI is transforming the landscape of digital imagery, and exerting a significant influence on online creative communities. This has led to the emergence of AI-Generated Content (AIGC) social platforms, such as Civitai. These distinctive social platforms allow users to build and share their own generative AI models, thereby enhancing the potential for more diverse artistic expression. Designed in the vein of social networks, they also provide artists with the means to showcase their creations (generated from the models), engage in discussions, and obtain feedback, thus nurturing a sense of community. Yet, this openness also raises concerns about the abuse of such platforms, e.g., using models to disseminate deceptive deepfakes or infringe upon copyrights. To explore this, we conduct the first comprehensive empirical study of an AIGC social platform, focusing on its use for generating abusive content. As an exemplar, we construct a comprehensive dataset covering Civitai, the largest available AIGC social platform. Based on this dataset of 87K models and 2M images, we explore the characteristics of content and discuss strategies for moderation to better govern these platforms.


Key findings
A significant portion of models and images contained NSFW content and deepfakes, with deepfakes frequently associated with NSFW material and targeting celebrities. NSFW models and images received significantly more engagement than non-NSFW content, and creators of abusive content held higher centrality within the Civitai social network.
Approach
The researchers collected metadata from Civitai using its API and Selenium, augmented the data with thematic labels using ChatGPT, and analyzed the themes, popularity (measured by image generation counts), user engagement (downloads, favorites, comments), and creator network centrality to understand the prevalence and characteristics of abusive content.
Datasets
A comprehensive dataset of 87,042 generative models and 2,740,149 AI-generated images from Civitai, along with existing datasets DiffusionDB and JourneyDB for comparative analysis of prompts.
Model(s)
ChatGPT (gpt-3.5-turbo-0125) for theme extraction and person name recognition; OpenAI's text-moderation-006 model for NSFW content detection in prompts.
Author countries
China, Hong Kong, United Kingdom, Finland