A.I.’s un-learning problem: Researchers say it’s virtually impossible to make an A.I. model ‘forget’ the things it learns from private user data : technology

[–] hansl@lemmy.ml 23 points 1 year ago (1 children)

It’s closer to how you (as a person) know things than, say, how a database know things.

I still remember my childhood home phone number. You could ask me to forget it a million times I wouldn’t be able to. It’s useless information today. I just can’t stop remembering it.

[–] Veraticus@lib.lgbt -4 points 1 year ago* (last edited 1 year ago) (4 children)

No, you knowing your old phone number is closer to how a database knows things than how LLMs know things.

LLMs don't "know" information. They don't retain an individual fact, or know that something is true and something else is false (or that anything "is" at all). Everything they say is generated based on the likelihood of a word following another word based on the context that word is placed in.

You can't ask it to "forget" a piece of information because there's no "childhood phone number" in its memory. Instead there's an increased likelihood it will say your phone number as the result of someone prompting it to tell it a phone number. It doesn't "know" the information at all, it simply has become a part of the weights it uses to generate phrases.

[–] Zeth0s@lemmy.world 15 points 1 year ago* (last edited 1 year ago) (1 children)

It's the same in your brain though. There is no number in your brain. Just a set of synapses that allows a depolarization wave to propagate across neurons, via neurotransmitters released and absorbed in a narrow space.

The way the brain is built allows you to "remember" stuff, reconstruct information incompletely stored as different, unique connections in a network. But it is not "certain", we can't know if it's the absolute truth. That's why we need password databases and phone books, because our memory is not a database. It is probably worse than gpt-4

[+] Veraticus@lib.lgbt -9 points 1 year ago (1 children)

It doesn't matter that there is no literal number in your brain and that there are instead chemical/electronic impulses. There is an impulse there signifying your childhood phone number. You did (and do) know that. And other things too presumably.

While our brains are not perfectly efficient, we can and do actually store information in them. Information that we can judge as correct or incorrect; true or false; extant or nonexistent.

LLMs don't know anything and never knew anything. Their responses are mathematical models of word likelihood.

They don't understand English. They don't know what reality is like or what a phone number represents. If they get your phone number wrong, it isn't because they "misremembered" or because they're "uncertain." It's because it is literally incapable of retaining a fact. The phone number you asked it for is part of a mathematical model now, and it will return the output of that model, not the correct phone number.

Conversely, even if you get your phone number wrong, it isn't because you didn't know it. It's because memory is imperfect and degrades over time.

[–] Zeth0s@lemmy.world 5 points 1 year ago* (last edited 1 year ago) (1 children)

There no such an impulse, there is a neural network in your brain. These AI stuff were born as a simulation of human neural networks.

And your neural network cannot tell if something is true or untrue, it might remember a phone number as true even if it is not. English has literally a word for that, that you used: misremembed. It is so common...

It is true that LLMs do not know in a human way, they do not understand, they cannot tell if what they say is true. But they do retain facts. Ask who won f1 championship in 2001 to chatgpt. It knows it. I have problem remembering correctly, I need to check. Gpt-4 knows better than me that was there. No shame in that

[–] Veraticus@lib.lgbt -5 points 1 year ago (1 children)

You can indeed tell if something is true or untrue. You might be wrong, but that is quite different -- you can have internal knowledge that is right or wrong. The very word "misremembered" implies that you did (or even could) know it properly.

LLMs do not retain facts and they can and frequently do get information wrong.

Here's a good test. Choose a video game or TV show you know really well -- something a little older and somewhat complicated. Ask ChatGPT about specific plot points in the video game.

As an example, I know Final Fantasy 14 extremely well and have played it a long time. ChatGPT will confidently state facts about the game that are entirely and totally incorrect: it confuses characters, it moves plot points around. This is because it chooses what is likely to say, not what is actually correct. Indeed, it has no ability to know what is correct at all.

AI is not a simulation of human neural networks. It uses the concept of mathematical neural networks, but it is a word model, nothing more.

[–] fsmacolyte@lemmy.world 2 points 1 year ago* (last edited 1 year ago)

The free version gets things wrong a bunch. It's impressive how good GPT-4 is. Human brains are still a million times better in almost every way (they cost a few dollars of energy to operate per day, for example) but it's really hard to believe how capable the state of the art of LLMs is until you've tried it.

You're right about one thing though. Humans are able to know things, and to know when we don't know things. Current LLMs (transformer-based architecture) simply can't do that yet.

[–] SpiderShoeCult@sopuli.xyz 6 points 1 year ago (1 children)

Genuinely curious how you would describe humans remembering stuff, because if I remember correctly my biology classes, it's about reinforced neural pathways that become more likely to be taken by an electrical impulse than those that are less 'travelled'. The whole notion of neural networks is right there in the name, based on how neurons work.

[+] Veraticus@lib.lgbt -6 points 1 year ago (1 children)

The difference is LLMs don't "remember" anything because they don't "know" anything. They don't know facts, English, that reality exists; they have no internal truths, simply a mathematical model of word weights. You can't ask it to forget information because it knows no information.

This is obviously quite different from asking a human to forget anything; we can identify the information in our brain, it exists there. We simply have no conscious control over our ability to remember it.

The fact that LLMs employ neural networks doesn't make them like humans or like brains at all.

[–] SpiderShoeCult@sopuli.xyz 2 points 1 year ago (1 children)

I never implied they "remembered", I asked you how you interpret humans remembering since you likened it to a database, which science says it is not. Nor did I make any claims about AI knowing stuff, you inferred that by yourself. I also did not claim they possess any sort of human like traits. I honestly do not care to speculate.

The modelling statement speaks to how it came to be and the intention of programmers and serves to illustrate my point regarding the functioning of the brain.

My question remains unanswered.

[–] Veraticus@lib.lgbt -3 points 1 year ago (1 children)

I said:

No, you knowing your old phone number is closer to how a database knows things than how LLMs know things.

Which is true. Human memory is more like a database than an LLM's "memory." You have knowledge in your brain which you can consult. There is data in a database that it can consult. While memory is not a database, in this sense they are similar. They both exist and contain information in some way that can be acted upon.

LLMs do not have any database, no memories, and contain no knowledge. They are fundamentally different from how humans know anything, and it's pretty accurate to say LLMs "know" nothing at all.

[–] SpiderShoeCult@sopuli.xyz 2 points 1 year ago (1 children)

Leaving aside LLMs, the brain is not a database. there is no specific place that you can point to and say 'there resides the word for orange'. Assuming that would be the case, it would be highly inefficient to assign a spot somewhere for each bit of information (again, not talking about software here, still the brain). And if you would, then you would be able to isolate that place, cut it out, and actually induce somebody to forget the word and the notion (since we link words with meaning - say orange and you think of the fruit, colour or perhaps a carrot). If we hade a database organized into tables and say orange was a member of colours and another table, 'orange things', deleting the member 'orange' would make you not recognize that carrots nowadays are orange.

Instead, what happens - for example in those who have a stroke or those who suffer from epilepsy (a misfiring of meurons) - is that there appears a tip-of-the tongue phenomenon where they know what they want to say and can recognize notions, it's just the pathway to that specific word is interrupted and causes a miss, presumably when the brain tries to go on the path it knows it should take because it's the path taken many times for that specific notion and is prevented. But they don't lose the ability to say their phone number, they might lose the ability to say 'four' and some just resort to describing the notion - say the fruit that makes breakfast juice instead. Of course, if the damage done is high enough to wipe out a large amout of neurons, you lose larger amounts of words.

Downsides - you cannot learn stuff instantly, as you could if the brain was a database. That's why practice makes perfect. You remember your childhood phone number because you repeated it so many times that there is a strong enough link between some neurons.

Upsides - there is more learning capacity if you just relate notions and words versus, for lack of a better term, hardcoding them. Again, not talking about software here.

Also leads to some funky things like a pencil sharpener being called literally a pencil eater in Danish.

[–] Veraticus@lib.lgbt 0 points 1 year ago (1 children)

I never said the brain (or memory) was a database. I said it was more like a database than what LLMs have, which is nothing.

[–] SpiderShoeCult@sopuli.xyz 1 points 1 year ago (1 children)

And human beings are more like a fungus (eukaryotes, saprophites) than an LLM is, that doesn't mean we're mushrooms.

However, the human brain is more like an LLM than a database, because the LLM was modelled after the human brain. It's also very similar in the way that nobody actually can tell precisely how it works, for some reason it just does.

Now I wouldn't worry about philosophical implications about the nature of consciousness and such, we're a long way and we'll find a way of screwing it up.

I do question why people are so vehement to always point out what we 'have' and how special we are. Nobody sane is saying LLMs are human consciousness 2.0. So why act threatened?

[–] Veraticus@lib.lgbt 0 points 1 year ago (1 children)

Lol what the fuck? We know exactly how LLMs work. It's not magic, and it's nothing like a human brain. They're literally word frequency algorithms. There's nothing special about them and I'm the opposite of threatened; I think it's absurd people who patently don't understand them are weighing on this debate disagreeing with me when it's obvious their position can best be described as ignorant.

[–] SpiderShoeCult@sopuli.xyz 1 points 1 year ago (1 children)

I'm just going to leave this here.

some random article

A quote from the article, I found especially interesting.

"As a result, no one on Earth fully understands the inner workings of LLMs. Researchers are working to gain a better understanding, but this is a slow process that will take years—perhaps decades—to complete."

Quite an interesting read and I'm sure you can find some others if you want to and try hard enough.

[–] Veraticus@lib.lgbt 0 points 1 year ago

This is a somewhat sensationalist and frankly uninteresting way to describe neural networks. Obviously it would take years of analysis to understand the weights of each individual node and what they're accomplishing (if it is even understandable in a way that would make sense to people without very advanced math degrees). But that doesn't mean we don't understand the model or what it does. We can and we do.

You have misunderstood this article if what you took from it is this:

It’s also very similar in the way that nobody actually can tell precisely how it works, for some reason it just does.

We do understand how it works -- as an overall system. Inspecting the individual nodes is as irrelevant to understanding an LLM as cataloguing trees in a forest tells you the name of the city to which the forest is adjacent.

[–] MarcoPogo@lemmy.world 5 points 1 year ago (1 children)

Are we sure that this is substantially different from how our brain remembers things? We also remember by association

[–] Veraticus@lib.lgbt -4 points 1 year ago

But our memories exist -- I can say definitively "I know my childhood phone number." It might be meaningless, but the information is stored in my head. I know it.

AI models don't know your childhood phone number, even if you tell them explicitly, even if they trained on it. Your childhood phone number becomes part of a model of word weights that makes it slightly more likely, when someone asks it for a phone number, that some digits of your childhood phone number might appear (or perhaps the entire thing!).

But the original information is lost.

You can't ask it to "forget" the phone number because it doesn't know it and never knew it. Even if it supplies literally your exact phone number, it isn't because it knew your phone number or because that information is correct. It's because that sequence of numbers is, based on its model, very likely to occur in that order.

[–] theneverfox@pawb.social 1 points 1 year ago

This isn't true at all - first, we don't know things like a database knows things.

Second, they do retain individual facts in the same sort of way we know things, through relationships. The difference is, for us the Eiffel tower is a concept, and the name, appearance, and everything else about it are relationships - we can forget the name of something but remember everything else about it. They're word based, so the name is everything for them - they can't learn facts about a building then later learn the name of it and retain the facts, but they could later learn additional names for it

For example, they did experiments using some visualization tools and edited it manually. They changed the link been Eiffel tower and Paris to Rome, and the model began to believe it was in Rome. You could then ask what you'd see from the Eiffel tower, and it'd start listing landmarks like the coliseum

So you absolutely could have it erase facts - you just have to delete relationships or scramble details. It just might have unintended side effects, and no tools currently exist to do this in an automated fashion

For humans, it's much harder - our minds use layers of abstraction and aren't a unified set of info. That mean you could zap knowledge of the Eiffel tower, and we might forget about it. But then thinking about Paris, we might remember it and rebuild certain facts about it, then thinking about world fairs we might remember when it was built and by who, etc

[–] dustyData@lemmy.world 12 points 1 year ago

Not only it doesn't know, but for the people who trained them it is very hard to know whether some piece of information is or isn't inside the model. Introspection about how exactly the model ends up making decisions after it has been trained is incredibly difficult.

[–] SatanicNotMessianic@lemmy.ml 10 points 1 year ago (1 children)

It’s actually because they do know things in a way that’s analogous to how people know things.

Let’s say you wanted to forget that cats exist. You’d have to forget every cat meme you’ve ever seen, of course, but your entire knowledge of memes would also have to change. You’d have to forget that you knew how a huge part of the trend started with “i can haz cheeseburger.”

You’d have to forget that you owned a cat, which will change your entire memory of your life history about adopting the cat, getting home in time to feed it, and how it interacted with your other animals or family. Almost every aspect of your life is affected when you own an animal, and all of those would have to somehow be remembered in a no-cat context. Depending on how broadly we define “cat,” you might even need to radically change your understanding of African ecosystems, the history of sailing, evolutionary biology, and so on. Your understanding of mice and rats would have to change. Your understanding of dogs would have to change. Your memory of cartoons would have to change - can you even remember Jerry without Tom? Those are just off the top of my head at 8 in the morning. The ramifications would be huge.

Concepts are all interconnected, and that’s how this class of AI works. I’ve owned cars most of my life, so it’s a huge part of my personal memory and self-definition. They’re also ubiquitous in culture. Hundreds of thousands to millions of concepts relate to cats in some way, and each one of them would need to change, as would each concept that relates to those concepts. Pretty much everything is connected to everything else and as new data are added, they’re added in such a way that they relate to virtually everything that’s already there. Removing cats might not seem to change your knowledge of quarks, but there’s some very very small linkage between the two.

Smaller impact memories are also difficult. That guy with the weird mustache you saw during your vacation to Madrid ten years ago probably doesn’t have that much of a cascading effect, but because Esteban (you never knew his name) has such a tiny impact, it’s also very difficult to detect and remove. His removal won’t affect much of anything in terms of your memory or recall, but if you’re suddenly legally obligated to demonstrate you’ve successfully removed him from your memory, it will be tough.

Basically, the laws were written at a time when people were records in a database and each had their own row. Forgetting a person just meant deleting that row. That’s not the case with these systems.

The thing is that we don’t compel researchers to re-train their models on a data set if someone requests their removal. If you have traditional research on obesity, for instance, and you have a regression model that’s looking at various contributing factors, you do not have to start all over again if someone requests their data be deleted. It should mean that the person’s data are removed from your data set it it doesn’t mean that you can’t continue to use that model - at least it never has, to my knowledge. Your right to be forgotten doesn’t translate to you being allowed to invalidate the scientific models generated that glom together your data with that of tens of thousands of others. You can be left out of the next round of research on that dataset, but I have never heard of people being legally compelled to regenerate a model based on that.

There are absolutely novel legal questions that are going to be involved here, but I just wanted to clarify that it’s really not a simple answer from any perspective.

[+] Veraticus@lib.lgbt -9 points 1 year ago (1 children)

No, the way humans know things and LLMs know things is entirely different.

The flaw in your understanding is believing that LLMs have internal representations of memes and cats and cars. They do not. They have no memories or internal facts... whereas I think most people agree that humans can actually know things and have internal memories and truths.

It is fundamentally different from asking you to forget that cats exist. You are incapable of altering your memories because that is how brains work. LLMs are incapable of removing information because the information is used to build the model with which they choose their words, which is then undifferentiatable when it's inside the model.

An LLM has no understanding of anything you ask it and is simply a mathematical model of word weights. Unless you truly believe humans have no internal reality and no memories and simply say things based on what is the most likely response, you also believe humans and LLM knowledge is entirely different to each other.

[–] SatanicNotMessianic@lemmy.ml 5 points 1 year ago (1 children)

No, I disagree. Human knowledge is semantic in nature. “A cat walks across a room” is very close, in semantic space, to “The dog walked through the bedroom” even though they’re not sharing any individual words in common. Cat maps to dog, across maps to through, bedroom maps to room, and walks maps to walked. We can draw a semantic network showing how “volcano” maps onto “migraine” using a semantic network derived from human subject survey results.

LLMs absolutely have a model of “cats.” “Cat” is a region in an N dimensional semantic vector space that can be measured against every other concept for proximity, which is a metric space measure of relatedness. This idea has been leveraged since the days of latent semantic analysis and all of the work that went into that research.

For context, I’m thinking in terms of cognitive linguistics as described by researchers like Fauconnier and Lakoff who explore how conceptual bundling and metaphor define and constrain human thought. Those concepts imply that a realization can be made in a metric space such that the distance between ideas is related to how different those ideas are, which can in turn be inferred by contextual usage observed over many occurrences.

The biggest difference between a large model (as primitive as they are, but we’re talking about model-building as a concept here) and human modeling is that human knowledge is embodied. At the end of the day we exist in a physical, social, and informational universe that a model trained on the artifacts can only reproduce as a secondary phenomenon.

But that’s world apart from saying that the cross-linking and mutual dependencies in a metric concept-space is not remotely analogous between humans and large models.

[–] Veraticus@lib.lgbt -3 points 1 year ago (1 children)

But that’s world apart from saying that the cross-linking and mutual dependencies in a metric concept-space is not remotely analogous between humans and large models.

It's not a world apart; it is the difference itself. And no, they are not remotely analogous.

When we talk about a "cat," we talk about something we know and experience; something we have a mental model for. And when we speak of cats, we synthesize our actual lived memories and experiences into responses.

When an LLM talks about a "cat," it does not have a referent. There is no internal model of a cat to it. Cat is simply a word with weights relative to other words. It does not think of a "cat" when it says "cat" because it does not know what a "cat" is and, indeed, cannot think at all. Think of it as a very complicated pachinko machine, as another comment pointed out. The ball you drop is the question and it hits a bunch of pegs on the way down that are words. There is no thought or concept behind the words; it is simply chance that creates the output.

Unless you truly believe humans are dead machines on the inside and that our responses to prompts are based merely on the likelihood of words being connected, then you also believe that humans and LLMs are completely different on a very fundamental level.

[–] SatanicNotMessianic@lemmy.ml 2 points 1 year ago (1 children)

Could you outline what you think a human cognitive model of “cat” looks like without referring to anything non-cat?

[–] Veraticus@lib.lgbt -2 points 1 year ago (1 children)

Yes; it is a cat. I can think of what that is. Can an LLM?

[–] SatanicNotMessianic@lemmy.ml 2 points 1 year ago (1 children)

Describe it. Imagine I’ve never encountered a cat, because I’m from Mars.

[–] Veraticus@lib.lgbt 0 points 1 year ago* (last edited 1 year ago) (1 children)

You can't! It's like describing fire to someone that's never experienced fire.

This is the root of experience and memory and why humans are different from LLMs. Which, again, can never understand or experience a cat or fire. But the difference is more fundamental than that. To an LLM, there is no difference between fire and cat. They are simply words with frequencies attached that lead to other words. Their difference is the positions they occupy in a mathematical model where sometimes it will output one instead of the other, nothing more.

Unless you're arguing my inability to express a mental construct to you completely means I myself don't experience it. Which I think you would agree is absurd?

[–] SatanicNotMessianic@lemmy.ml 2 points 1 year ago (1 children)

I have absolutely no idea what your model is for how humans understand, relate, and communicate concepts.

[–] Veraticus@lib.lgbt -1 points 1 year ago (1 children)

How is that germane to this question? Do you agree humans can experience mental phenomena? Like, do you think I have any mental models at all?

If so, then that is the difference between me and an LLM.

[–] SatanicNotMessianic@lemmy.ml 2 points 1 year ago (1 children)

I think you have a mental model and that it is analogous to the model created in an LLM in that it is representable by a semantic graph/n-dimensional matrix relating concepts that are realized via terms.

You have never in your life encountered a dodo. You know what a dodo is (using the present these because I’m talking about a concept). It is a bird, so it relates evolutionarily and ecologically to “bird.” It’s flightless, so it relates to “patriarch” and “emu.” It is extinct, so it relates to all of the species extinction ideas you have. Humans perhaps contributed to the extinction, so it links to human-caused ecological change, which in turn links to human-caused climate change. Human-introduced invasive species are are causing ecological change in Australia, and that may have been a major factor in driving the dodo to extinction. People ate them, so maybe in your head it has a relation to wild turkeys. And so on. That’s how minds work. That’s how the human cognitive model of the world works. That’s how LLMs work.

Visualize an n-dimensional space in which these semantic topics are embedded. The interpretation of the dimensions don’t matter. Instead, we’re just worried about the distances between concepts. Dodo is closer to turkey than it is to snake. Dodo is closer to snake than it is to rock. Dodo is closer to rock than it is to the feeling of melancholy I get when listening to Tori Amos. We can grasp this intuitively. We can mathematize it by formally placing the various concepts in a metric space.

There’s a lot more to unpack, from neural correlates of consciousness to cognitive linguistics and embodied learning using metaphorical reasoning, but that’s kind of the gist of it boiled down to an overly long post.

[–] Veraticus@lib.lgbt -1 points 1 year ago (1 children)

That’s how LLMs work.

This is not how LLMs work. LLMs do not have complex thought webs correlating concepts birds, flightlessness, extinction, food, and so on. That is how humans work.

An LLM assembles a mathematical model of what word should follow any other word by analyzing terabytes of data. If in its training corpus the nearest word to "dodo" is "attractive," the LLM will almost always tell you that dodos are attractive. This is not because those concepts are actually related to the LLM, because the LLM is attracted to dodos, or because LLMs have any thoughts at all. It is simply the output of bunch of math based on word proximity.

Humans have cognition and mental models. LLMs have frequency and word weights. While you have correctly identified that both of these things can be portrayed as n-dimensional matrixes, you can also use those tools to describe electrical currents or the movement of stars. But those things contain no more thought and have no more mental phenomenon occurring in them than LLMs.

[–] SatanicNotMessianic@lemmy.ml 2 points 1 year ago (1 children)

That is exactly how LLMs work. LOMs embed semantic concepts in metric spaces. That is what we’re talking about.

[–] Veraticus@lib.lgbt 0 points 1 year ago

No, they embed word weights in metric spaces. Human thought is more like semantic concepts in a metric space (though I don't think that's entirely unequivocal, human thought is not very well-understood). Even if the space is similar what's in them is definitely not.

[–] Zeth0s@lemmy.world 5 points 1 year ago (1 children)

Actually it is also impossible to ask people to forget. This is something we share with AI

[–] Veraticus@lib.lgbt -1 points 1 year ago (1 children)

Yes, but only by chance.

Human brains can't forget because human brains don't operate that way. LLMs can't forget because they don't know information to begin with, at least not in the same sense that humans do.

[–] Zeth0s@lemmy.world 1 points 1 year ago

See my other reply ;)

Technology

Our Rules

Approved Bots