836
Meta admits using pirated books to train AI, but won't pay for it
(www.techspot.com)
This is a most excellent place for technology news and articles.
So ISPs are blocking Meta for their breaking of copyright?
Because ISPs block Sci-Hub.
No, one of them is having governments trying to kick off the internet, and the other is allowed to continue doing what they're doing and the worst they'll face is a fine. Not even close to the same, completely disproportionate. If they were blocking all Meta LLMs until they had removed all copyrighted material, maybe we could say the same.
ISPs may block sites to prevent unauthorized copying. It's not a punishment for past wrong-doing. I'm not sure about the details, I think this differs a lot between jurisdictions. But basically, as ISPs they are involved in the unauthorized act of copying. Their servers copy the data to the end user/customer. So, they may be on the hook for infringement themselves if they don't act.
Again, I am not aware of Meta sharing the copyrighted books in question. So, I don't know what the legal basis for blocking Meta would be. If ISPs block a site without a legal basis, they are probably on the hook for breach of contract.
IDK on what basis the sharing of Meta's LLMs could be stopped. If anyone could claim copyright it would be Meta itself and they allow sharing them. (I have doubts if AI models are copyrightable under current US law.)
https://www.nytimes.com/2024/01/08/technology/openai-new-york-times-lawsuit.html
If the New York Times' evidence is true (I haven't seen the evidence, so I can't comment on veracity), then you can recreate copyrighted works with LLMs, and as such, they're doing the same thing as the Pirate Bay, distributing copyrighted works without authorization and making money off the venture.
So far, no ISPs are blocking Meta for this.
I expect ISPs would get into a lot of legal trouble if they did.
The NYT sued OpenAI and MS. a) That doesn't involve Meta. b) It's a claim by the NYT.
Why should ISPs deny their paying customers access to Meta sites or sites hosting LLMs released by Meta? These customers have contracts with their service providers. On what grounds, would ISPs be in the right to stop providing these internet services?
Both Meta and ChatGPT used books3, it's functionally the same type of case.
In the countries where ISP blocking happens, its usually because a copyright holder has sued and demanded blocking at the ISP level and has won in court. Then, the government begins the path of working with ISPs to block the site.
Unless you think most governments that do this do it arbitrarily? No, they do it because a copyright holder sued, like the New York Times has. The NYT has not demanded ISP-level blocking, but that does not mean that they couldn't. I can't speak to their choice not to do so other than it seems that companies only save that for truly altruistic groups, and rarely do it for other big corporations.
IDK why you believe this. Breaking contracts is illegal. You get sued and have to pay damages. Some contracts, in some jurisdictions, may allow such arbitrary decisions. In other jurisdictions such clauses may be unenforceable.
Well, that's not something that copyright law cares about very much. Unfortunately, this community seems very pro-copyright; very Ayn Rand even. You're not likely to get much agreement for any sensible reforms; quite the opposite. I don't think arguing that Meta is doing the same as TPB is going to win anyone over. It's more likely to get people here to call for more onerous and more harmful IP laws.
FWIW, no. the NYT case and this is different in some crucial ways.