WordPress and Tumblr are selling user content to train AI from Midjourney and OpenAI
Feb 28, 2024 at 12:15 PM

WordPress and Tumblr are selling user content to train AI from Midjourney and OpenAI

Automattic, the parent company of Tumblr and WordPress, is reportedly in negotiation with AI firms Midjourney and OpenAI for providing training data sourced from user posts. This information comes from internal documents that 404 Media has gained access to.

According to 404 Media, Automattic has already assembled an “initial data dump” comprising all public Tumblr posts from the period 2014 to 2023, including content not visible on public blogs.

In response to the ensuing concerns, Automattic published a post titled “Protecting User Choice”, in which it clarifies its stance on AI platform crawlers. The company says it has decided to block these crawlers by default, including those operated by major tech corporations.

Automattic further clarified: “We are also collaborating directly with select AI firms only if their plans align with our community's values: attribution, opt-outs, and control. All opt-out settings will be respected in our partnerships. We also intend to regularly update our partners about users who newly opt out and request that their content be removed from past sources and future training.”

This move by Automattic mirrors a trend observed in several companies, including Reddit, which have struck deals with AI tool developers to provide training data, often sourced from publicly available online information.

Feb 28, 2024 by Paul

no
MaoholguinstoyangenovHeel
nolaray found this interesting
WordPress iconWordPress
  1652
  • ...

WordPress is a blog publishing software designed with an emphasis on accessibility, performance, security, and ease of use. It aims to function with minimum setup, allowing users to freely share their story, product, or services. Rated 4, WordPress offers customization, self-deployment, and the ability to extend functionality via plugins/extensions. Notable alternatives include Drupal, Joomla, and Ghost.

Comments

Norton
Feb 29, 2024
1

We are all just commodity. Welcome to the modern 'enlightened' world .

ddnn
Feb 28, 2024
3

So going back to Tumblr after being annoyed by IG was a waste of time after all... Guess nowhere is safe from this anyway; just a matter of whether the companies make a contract to do it as opposed to scraping the dating without the platform's consent. But that's the problem, it's not the platform's consent that matters - it's the user's.

Gu