TL;DR: It isn’t “forgetting” — it has no memory at all. Every time it replies, it re-reads the entire conversation from the top and continues from there; “remembering” is just the side effect of having re-read it a moment ago. The catch is there’s a ceiling on how much it can take in at once, called the context window. In a long chat the oldest part gets pushed past that ceiling, so this turn it never read it. Not because it’s dumb. That part just wasn’t in front of it.
You’ve probably had this happen. You start a chat and lay out the rules up top — “reply in English only, no emoji” — and it behaves for a while. Twenty turns later the emoji are back. Or you open a fresh chat to finish yesterday’s conversation, and it stares back blank, as if none of it ever happened.
“How did it forget already?” But “forget” is the wrong word. To forget something, you have to have remembered it first. And it never did.
It re-reads everything from scratch, every time
Here’s the part that sounds backwards at first: it has no memory. You and it go back and forth for an hour, and there’s nothing stored in its head about where you’ve been.
So how does it keep up? Because each time it’s your turn for a reply, what happens behind the scenes is that your whole conversation so far gets handed back to it, top to bottom, and it reads the lot before writing the next line. The instant it finishes reading, it “knows” what came before. But it didn’t remember that; it just re-read it a second ago.
Picture an amnesiac who reads fast. Every time you walk up, you hand him the whole notebook of the conversation so far. He reads it cover to cover, looks up, says his one line. In that moment he genuinely follows where you’ve been. But nothing’s kept in his head; it’s all in that notebook. Walk up again, hand it over again. Underneath, all the model ever does is read what’s in front of it and write the next word that fits.
The notebook has a limit
The thing is, that “conversation notebook” can’t be infinitely long.
There’s a ceiling on how much it can take in at once, and that ceiling is the context window. It’s measured in tokens (a token being the chunk of text the model actually reads, not quite the same as a word). These windows are big now; in ordinary back-and-forth you’ll rarely fill one. But paste in a long document, or talk long enough, and the total runs past the ceiling. Now the app has to cut something. Usually it drops the oldest messages first (this varies: some apps cut them outright, some compress them into a short summary and keep that). Whatever got cut, the model genuinely didn’t read this turn.
So those rules you set up top quietly “stop working” in a long chat, not because it’s being rebellious, but because that line slid out of the window and never reached it this round. It didn’t ignore you. It didn’t see it.
That “Memory” feature is a different thing
Worth separating out, because it’s easy to confuse with the window.
“But doesn’t ChatGPT have a memory feature?” It does — and it’s a different layer from the context window we’re talking about. That Memory feature (OpenAI describes it here) is more like a cheat sheet it keeps about you. You mention “I’m a developer,” “I prefer British spelling,” and it files that away, then slips it back in when you start a new chat. It’s long-term and spans conversations: a condensed set of notes, not a transcript of everything you’ve ever said.
The context window, by contrast, is just “how much it can see this one turn.” When a brand-new chat forgets everything, that’s because this turn’s notebook is blank. The cheat sheet may still be around, but that’s just those few stored facts, not the whole discussion you had yesterday.
So what do you actually do about it
Once you’ve got that it has no memory and just re-reads, the annoying stuff stops being mysterious.
To make it hold onto a key setting, the dumbest and most reliable move is to put it back in front of it. Restate the important rules every so often; when you open a new chat, bring a few lines of recap with you. Open with something like “Context: I’m writing a polite-but-firm rejection email, keep it all in English.” That’s not because it’s too dim to remember; it’s that you’re making sure the instruction is actually in this turn’s notebook.
And when a chat gets long and starts contradicting itself, you’re better off starting a clean one with a short recap pasted up top than wrestling the bloated one. Fresh page, clean window. Its other quirks, like giving you a different answer every time you ask the same thing, run on separate machinery and aren’t about memory at all. (If you want the practical side of juggling all this, I wrote up how I actually use ChatGPT, Claude, and Gemini day to day.)
The short version: it isn’t that its memory is bad. It has no memory to use. Every line it writes, it re-reads what’s in front of it and continues. What it can see, it works from. Whatever never made it into the notebook never happened, as far as it’s concerned. So next time it “forgets” something you said, put the line back in front of it. It’ll pick right up.


