Please note that all submissions to the site are subject to the wiki's licence, CC 4.0 BY-SA, as found here

Generative AI: Difference between revisions

From Consumer Action Taskforce
Jump to navigationJump to search
No edit summary
Fixed some punctuation and improved readability.
 
(2 intermediate revisions by 2 users not shown)
Line 1: Line 1:
{{StubNotice}}
{{StubNotice}}
 
<!-- ADMIN COMMENT: This article is fine as a collection of consumer-facing GenAI issues and incidents, however we need to avoid straying into territory relating to workers rights, the ethics of web-scraping for training data, or other similar concerns. This wiki is to be strictly focussed on consumer affairs! If a company changes the terms of a contract to force users into letting their private data be used for AI training, that is relevant. If a company has laid off 500 workers to replace them with a chatbot, the treatment of those workers is not relevant to this wiki (although consumer-facing issues caused by the use of the chatbot might be!).-->


Generative AI, also referred to as GenAI or simply AI, is a program whose existence is to generate pieces of media based off of a simple prompt (e.g. "How long do I heat popcorn for in the microwave?" or "bowl of buttery popcorn, realistic, artstation, pretty") with various and random results. GenAI over its currently short existence being accessible to the public has garnered large amounts of concern across the various fields it has been applied to. <!-- I used to help operate a Kialo discussion covering Generative AI, that discussion may be beneficial for reference as a way to further flesh this page out. Just please take note that most claims are a few years old and may not be accurate, so please fact-check any statements from there before mentioning anything anti-consumer here
Generative AI, also referred to as GenAI or simply AI, is a program whose existence is to generate pieces of media based off of a simple prompt (e.g. "How long do I heat popcorn for in the microwave?" or "bowl of buttery popcorn, realistic, artstation, pretty") with various and random results. GenAI over its currently short existence being accessible to the public has garnered large amounts of concern across the various fields it has been applied to. <!-- I used to help operate a Kialo discussion covering Generative AI, that discussion may be beneficial for reference as a way to further flesh this page out. Just please take note that most claims are a few years old and may not be accurate, so please fact-check any statements from there before mentioning anything anti-consumer here
Line 18: Line 18:
|-
|-
|Replacing skilled workers with AI
|Replacing skilled workers with AI
|Due to its generalized nature, jobs across fields from digital art to writing and programming have had experienced staff replaced by lesser-paid (and often lesser-experienced) employees who would be tasked to use generative tools to do their work. Often, these are detrimental to the quality of the product released by the company.
|Due to its generalized nature, jobs across fields from digital art to writing and programming have had experienced staff replaced by lesser-paid (and often lesser-experienced) employees who would be tasked to use generative tools to do their work. To remain relevant to the wiki's purpose, the usage leads to the detriment of product quality for consumers, such as representatives replaced with chatbots, or products being sold by companies use poorly-generated content that may harm the consumer<ref>https://www.vox.com/24141648/ai-ebook-grift-mushroom-foraging-mycological-society</ref>.<!-- Reference included more to represent what is intended -->
|
|
|-
|-
Line 29: Line 29:


=== Reddit training AI off of posts ===
=== Reddit training AI off of posts ===
In late 2024, Reddit announced the release of 'Reddit Answers', a LLM that was publicly stated<ref>https://support.reddithelp.com/hc/en-us/articles/32026729424916-Reddit-Answers-Currently-in-Beta</ref> to use content created by users to train the tool, without requiring prior consent or prior public notice. <!-- Needs further coverage here -->
In late 2024, Reddit announced the release of 'Reddit Answers,' a Language Learning Model (LLM) that was publicly stated<ref>https://support.reddithelp.com/hc/en-us/articles/32026729424916-Reddit-Answers-Currently-in-Beta</ref> to use content created by users to train the tool, without requiring prior consent or prior public notice. <!-- Needs further coverage here -->


=== DeviantArt DreamUp <!-- Considering the over 2 year long history that continues to have new drama stir from this, we should look into eventually making a dedicated article focused on DreamUp --> ===
=== DeviantArt DreamUp <!-- Considering the over 2 year long history that continues to have new drama stir from this, we should look into eventually making a dedicated article focused on DreamUp --> ===
Line 35: Line 35:
https://www.deviantart.com/team/gallery --><!-- Due to my close familiarity with the situation, yes, I developed this section a lot more than initially planned. -->
https://www.deviantart.com/team/gallery --><!-- Due to my close familiarity with the situation, yes, I developed this section a lot more than initially planned. -->


=== Stability AI's mass scraping ===
When training their generative models, StabilityAI was frequently caught scraping massive amounts of the internet to fuel their training database. This has gone so far as to lead to Getty Images suing StabilityAI over using their content as training data.<ref>https://www.reuters.com/legal/getty-images-lawsuit-says-stability-ai-misused-photos-train-ai-2023-02-06/</ref>


=== LAION-5b training database ===
=== LAION-5b training database ===

Latest revision as of 00:13, 19 January 2025


Article Status Notice: This Article is a stub

Notice: This Article Requires Additional Expansion

This article is underdeveloped, and needs additional work to meet the wiki's Content Guidelines and be in line with our Mission Statement for comprehensive coverage of consumer protection issues. Issues may include:

  • This article needs to be expanded to provide meaningful information
  • This article requires additional verifiable evidence to demonstrate systemic impact
  • More documentation is needed to establish how this reflects broader consumer protection concerns
  • The connection between individual incidents and company-wide practices needs to be better established
  • The article is simply too short, and lacks sufficient content

How You Can Help:

  • Add documented examples with verifiable sources
  • Provide evidence of similar incidents affecting other consumers
  • Include relevant company policies or communications that demonstrate systemic practices
  • Link to credible reporting that covers these issues
  • Flesh out the article with relevant information

This notice will be removed once the article is sufficiently developed. Once you believe the article is ready to have its notice removed, visit the Discord (join here) and post to the #appeals channel, or mention its status on the article's talk page.

Generative AI, also referred to as GenAI or simply AI, is a program whose existence is to generate pieces of media based off of a simple prompt (e.g. "How long do I heat popcorn for in the microwave?" or "bowl of buttery popcorn, realistic, artstation, pretty") with various and random results. GenAI over its currently short existence being accessible to the public has garnered large amounts of concern across the various fields it has been applied to.

General Controversies Surrounding Generative AI[edit | edit source]

Controversy Brief Description Related Article(s)/Section(s)
Training data collected without consent Various platforms have scraped data ranging within the petabytes concerning content created by users and potentially owned by companies, without first obtaining an adequate license to use this data. This has gone so far as to not even request consent or even notifying users in advance that their content was used to train AI-powered tools.
Replacing skilled workers with AI Due to its generalized nature, jobs across fields from digital art to writing and programming have had experienced staff replaced by lesser-paid (and often lesser-experienced) employees who would be tasked to use generative tools to do their work. To remain relevant to the wiki's purpose, the usage leads to the detriment of product quality for consumers, such as representatives replaced with chatbots, or products being sold by companies use poorly-generated content that may harm the consumer[1].

Specific Controversies Involving Generative AI[edit | edit source]

Reddit training AI off of posts[edit | edit source]

In late 2024, Reddit announced the release of 'Reddit Answers,' a Language Learning Model (LLM) that was publicly stated[2] to use content created by users to train the tool, without requiring prior consent or prior public notice.

DeviantArt DreamUp[edit | edit source]

While more speculative, it is reasonable for users to assume[3] that when DeviantArt initially automatically opted all users into allowing their work to be training data for generative AI[4][5], that all content uploaded to DeviantArt was used as training data for their DreamUp tool, however according to statements from DeviantArt CEO Moti Levy[6], DeviantArt did not plan or intend to train their tool based on user-generated works and that any user-generated works that were used in their model, were introduced by StabilityAI. Regardless, the introduction of DreamUp to the art sharing platform has both stirred controversy on the platform[7], and also fractured the platform into 2 parties[8], those for generative AI (typically those who hold newer accounts) and those against (typically users who have existed on the platform for far longer.) Due to the introduction of DreamUp, the platform has been cluttered by AI generated images, and staff have historically, frequently, and intentionally featured multiple users who exclusively upload GenAI content[9][10][11] or post content that uses generative content as a base[12], with a majority of featured creators being ones who nearly or exclusively upload AI generated content.


LAION-5b training database[edit | edit source]

Many users have had their content scraped by LAION to power their training database, and the only way they can opt out is via a third party[13].

References[edit | edit source]