116

Google updates privacy policy to train its AI on everything you post online (www.androidauthority.com)

submitted 1 year ago* (last edited 1 year ago) by ijeff@lemdro.id to c/android@lemdro.id

15 comments fedilink hide all child comments

TL;DR

Google has updated its privacy policy.
The new policy adds that Google can use publically available data to train its AI products.
The way the policy is worded, it sounds as if the company is reserving the right to harvest and use data posted anywhere on the web.

You probably didn’t notice, but Google quietly updated its privacy policy over the weekend. While the wording of the policy is only slightly different from before, the change is enough to be concerning.

As discovered by Gizmodo, Google has updated its privacy policy. While there’s nothing particularly notable in most of the policy, one section now sticks out — the research and development section. That section explains how Google can use your information and now reads as:

Google uses information to improve our services and to develop new products, features and technologies that benefit our users and the public. For example, we use publicly available information to help train Google’s AI models and build products and features like Google Translate, Bard, and Cloud AI capabilities.

Before the update, this section mentioned “for language models” instead of “AI models.” It also only mentioned Google Translate, where it now adds Bard and Cloud AI.

As the outlet points out, this is a peculiar clause for a company to add. The reason why it’s peculiar is that the way it’s worded makes it sound as if the tech giant reserves the right to harvest and use data from any part of the public internet. Usually, a policy such as this only discusses how the company will use data posted on its own services.

While most people likely realize that whatever they put online will be publicly available, this development opens up a new twist — use. It’s not just about others being able to see what you write online, but also about how that data will be used.

Bard, ChatGPT, Bing Chat, and other AI models that provide real-time information work by scraping information from the internet. The sourced information can often come from others’ intellectual property. Right now, there are lawsuits accusing these AI tools of theft, and there are likely to be more to come down the line.

all 20 comments

sorted by: hot top controversial new old

[-] sabreW4K3@lemmy.tf 9 points 1 year ago

The only person that's happy about this is Elon, because it vindicates him. Everyone else should be outraged. Honestly, Google needs to start paying us, because this absolutely isn't right and they will profit massively.

[-] lemann@lemmy.one 1 points 1 year ago

I say we start poisoning things with AI generated text. I'll be doing so myself on my blog, only a couple of sentences here and there as to not detract from the quality or put readers off.

[-] sabreW4K3@lemmy.tf 4 points 1 year ago

It's a shame because I have published a few stories and poems online and hoped they would one day gain traction, turns out they'll just stolen.

[-] TheGreenGolem@lemm.ee 1 points 1 year ago

promoted

[-] cyberpunk007@lemmy.world -1 points 1 year ago

You use all their products for free, that is the condition. Quid pro quo. If you don't like it, just stop using their services. I've been using duckduckgo for years. I do use gsuite for my email, but because I get it free. If I didn't I'd move to proton. If everyone stopped using Google they'd be forced to improve things for their users, but they're a conglomerate and can pretty much get away with anything.

[-] sabreW4K3@lemmy.tf 6 points 1 year ago

I don't use Google search. I'm forced to train their AI for captchas. They're already using content I produce to sell and display adverts.

I understand that some people are like house slaves, hand out ready to scream, "yes massa!" but I know the value of my data that Google get and use and they've already gone way too far.

[-] jerb@lemmy.croc.pw 9 points 1 year ago

Are we sure this isn't just for clarity? "Language model" implies Bard and such already as they're more formally called "large language models." While I don't like that they're doing it, I think it's very likely they've been publicly scraping information for quite some time (in fact, for an LLM like Bard, they pretty much have to!), and have just changed the wording to fully disambiguate between Google Translate and Bard.

[-] Slacking@sh.itjust.works 6 points 1 year ago

The internet belongs to everyone, google should have as much right to it as us. I use public data when I make a fine tune of an llm or a stable diffusion lora.

The only ones to benefit from restricting data access are the big companies, because they already have it all. Don't fall into the trap of advocating for a closed copyrighted internet, it will only hurt the little guys and literally no one else.

[-] clementineholic@lemm.ee 4 points 1 year ago

I don't like this at all, but I doubt it can be stopped. I hope at the end of the day, when AI adoption is widespread, AI will have improved the internet and our lives rather than make them worse.

[-] SuddenDownpour@lemmy.world 1 points 1 year ago

I hope at the end of the day, when AI adoption is widespread, AI will have improved the internet and our lives rather than make them worse.

That will depend on who owns the tech, who it is sold to, for which purposes, and what kind of regulation controls it. With the way things are right now, it will probably be used to manipulate public opinion on comments sections to the point that the so-called "bots" will actually truly be bots, rather than boosted messages written by humans.

[-] clementineholic@lemm.ee 1 points 1 year ago

Yeah that is a likely outcome. I was just trying to stay positive and hope for the best. When I think about how bad things can get with the misuse of AI, it makes me kinda depressed.

[-] reclipse@lemdro.id 1 points 1 year ago

How is this even legal?

[-] VinkTheGod@lemdro.id 5 points 1 year ago

Is it a problem really? We post because we want to, usually at our own leisure. If a site is public, it means everyone can see what we had posted. Instead of a human it'll be a bot that remembers bits and pieces.

In the future AI will be heavily regulated most likely. Right now it's a wild west. Big corpos have resources, so they do it to get the lead. It has always been like that, why this instance is different?

[-] reclipse@lemdro.id 1 points 1 year ago

Google can train their AI with information I post on google sites. But to use information posted on other sites seems problematic to me.

[-] cyberpunk007@lemmy.world 3 points 1 year ago

You put your info on other sites, which permit google to scrape them of info. Just how it's always worked. The only way to avoid it is to abstain from putting content online, especially where google is known to collect information.

this post was submitted on 05 Jul 2023

116 points (99.2% liked)

Android

17640 readers

98 users here now

The new home of /r/Android on Lemmy and the Fediverse!

Android news, reviews, tips, and discussions about rooting, tutorials, and apps.

🔗Universal Link: !android@lemdro.id

💡Content Philosophy:

Content which benefits the community (news, rumours, and discussions) is generally allowed and is valued over content which benefits only the individual (technical questions, help buying/selling, rants, self-promotion, etc.) which will be removed if it's in violation of the rules.

Support, technical, or app related questions belong in: !askandroid@lemdro.id

For fresh communities, lemmy apps, and instance updates: !lemdroid@lemdro.id

💬Matrix Chat

💬Telegram channels / chats

📰Our communities below

Rules

Stay on topic: All posts should be related to the Android OS or ecosystem.
No support questions, recommendation requests, rants, or bug reports: Posts must benefit the community rather than the individual. Please post to !askandroid@lemdro.id.
Describe images/videos, no memes: Please include a text description when sharing images or videos. Post memes to !androidmemes@lemdro.id.
No self-promotion spam: Active community members can post their apps if they answer any questions in the comments. Please do not post links to your own website, YouTube, blog content, or communities.
No reposts or rehosted content: Share only the original source of an article, unless it's not available in English or requires logging in (like Twitter). Avoid reposting the same topic from other sources.
No editorializing titles: You can add the author or website's name if helpful, but keep article titles unchanged.
No piracy or unverified APKs: Do not share links or direct people to pirated content or unverified APKs, which may contain malicious code.
No unauthorized polls, bots, or giveaways: Do not create polls, use bots, or organize giveaways without first contacting mods for approval.
No offensive or low-effort content: Don't post offensive or unhelpful content. Keep it civil and friendly!
No affiliate links: Posting affiliate links is not allowed.

Quick Links

Our Communities

Lemmy App List

See thread

Chat and More

founded 1 year ago

MODERATORS

ijeff@lemdro.id

ladfrombrad@lemdro.id

Paradox@lemdro.id

multimoon@lemdro.id

mikestevens@lemdro.id

Devgard@lemmy.world

limerod@reddthat.com

Netrunner@lemdro.id