cantankerous_cashew@lemmy.world to

Technology@lemmy.worldEnglish · 4 days ago

Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal

cross-posted to:
technology@lemmy.world

1

Meta Secretly Trained Its AI on a Notorious Piracy Database, Newly Unredacted Court Docs Reveal

cantankerous_cashew@lemmy.world to

Technology@lemmy.worldEnglish · 4 days ago

cross-posted to:
technology@lemmy.world

One of the most important AI copyright legal battles just took a major turn.

Chat

rumba@lemmy.zip
link
fedilink
English
arrow-up
0·
4 days ago
The notorious piracy database in question is Library Genesis.

Cached article:

https://web.archive.org/web/20250110075821/https://www.wired.com/story/new-documents-unredacted-meta-copyright-ai-lawsuit/
- CriticalMiss@lemmy.world
  link
  fedilink
  English
  arrow-up
  0·
  4 days ago
  Earlier reports suggested they trained it on books from Bibliotik.
  
  What changed?
  - BetaDoggo_@lemmy.world
    link
    fedilink
    English
    arrow-up
    0·
    4 days ago
    The llama-1 paper acknowledged the use of the books dataset, libgen isn’t mentioned in any of the papers so this is new info.
  - halcyoncmdr@lemmy.world
    link
    fedilink
    English
    arrow-up
    0·
    4 days ago
    Probably just both honestly.

Technology@lemmy.world

technology@lemmy.world

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !technology@lemmy.world

This is a most excellent place for technology news and articles.

Our Rules

Follow the lemmy.world rules.
Only tech related content.
Be excellent to each another!
Mod approved content bots can post up to 10 articles per day.
Threads asking for personal tech support may be deleted.
Politics threads may be removed.
No memes allowed as posts, OK to post as comments.
Only approved bots from the list below, to ask if your bot can be added please contact us.
Check for duplicates before posting, duplicates may be removed

Approved Bots

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

609 users / day
2.19K users / week
3.24K users / month
3.38K users / 6 months
0 local subscribers
60.5K subscribers
626 Posts
12.3K Comments
Modlog