Context Window on KbWen Blog

Why Does AI Forget What You Said Earlier?

KbWen — Sat, 20 Jun 2026 09:45:00 +0800

TL;DR: It isn’t “forgetting” — it has no memory at all. Every time it replies, it re-reads the entire conversation from the top and continues from there; “remembering” is just the side effect of having re-read it a moment ago. The catch is there’s a ceiling on how much it can take in at once, called the context window. In a long chat the oldest part gets pushed past that ceiling, so this turn it never read it. Not because it’s dumb. That part just wasn’t in front of it.

You’ve probably had this happen. You start a chat and lay out the rules up top — “reply in English only, no emoji” — and it behaves for a while. Twenty turns later the emoji are back. Or you open a fresh chat to finish yesterday’s conversation, and it stares back blank, as if none of it ever happened.

“How did it forget already?” But “forget” is the wrong word. To forget something, you have to have remembered it first. And it never did.

It re-reads everything from scratch, every time

Here’s the part that sounds backwards at first: it has no memory. You and it go back and forth for an hour, and there’s nothing stored in its head about where you’ve been.

So how does it keep up? Because each time it’s your turn for a reply, what happens behind the scenes is that your whole conversation so far gets handed back to it, top to bottom, and it reads the lot before writing the next line. The instant it finishes reading, it “knows” what came before. But it didn’t remember that; it just re-read it a second ago.

Picture an amnesiac who reads fast. Every time you walk up, you hand him the whole notebook of the conversation so far. He reads it cover to cover, looks up, says his one line. In that moment he genuinely follows where you’ve been. But nothing’s kept in his head; it’s all in that notebook. Walk up again, hand it over again. Underneath, all the model ever does is read what’s in front of it and write the next word that fits.

The notebook has a limit

The thing is, that “conversation notebook” can’t be infinitely long.

There’s a ceiling on how much it can take in at once, and that ceiling is the context window. It’s measured in tokens (a token being the chunk of text the model actually reads, not quite the same as a word). These windows are big now; in ordinary back-and-forth you’ll rarely fill one. But paste in a long document, or talk long enough, and the total runs past the ceiling. Now the app has to cut something. Usually it drops the oldest messages first (this varies: some apps cut them outright, some compress them into a short summary and keep that). Whatever got cut, the model genuinely didn’t read this turn.

So those rules you set up top quietly “stop working” in a long chat, not because it’s being rebellious, but because that line slid out of the window and never reached it this round. It didn’t ignore you. It didn’t see it.

That “Memory” feature is a different thing

Worth separating out, because it’s easy to confuse with the window.

“But doesn’t ChatGPT have a memory feature?” It does — and it’s a different layer from the context window we’re talking about. That Memory feature (OpenAI describes it here) is more like a cheat sheet it keeps about you. You mention “I’m a developer,” “I prefer British spelling,” and it files that away, then slips it back in when you start a new chat. It’s long-term and spans conversations: a condensed set of notes, not a transcript of everything you’ve ever said.

The context window, by contrast, is just “how much it can see this one turn.” When a brand-new chat forgets everything, that’s because this turn’s notebook is blank. The cheat sheet may still be around, but that’s just those few stored facts, not the whole discussion you had yesterday.

So what do you actually do about it

Once you’ve got that it has no memory and just re-reads, the annoying stuff stops being mysterious.

To make it hold onto a key setting, the dumbest and most reliable move is to put it back in front of it. Restate the important rules every so often; when you open a new chat, bring a few lines of recap with you. Open with something like “Context: I’m writing a polite-but-firm rejection email, keep it all in English.” That’s not because it’s too dim to remember; it’s that you’re making sure the instruction is actually in this turn’s notebook.

And when a chat gets long and starts contradicting itself, you’re better off starting a clean one with a short recap pasted up top than wrestling the bloated one. Fresh page, clean window. Its other quirks, like giving you a different answer every time you ask the same thing, run on separate machinery and aren’t about memory at all. (If you want the practical side of juggling all this, I wrote up how I actually use ChatGPT, Claude, and Gemini day to day.)

The short version: it isn’t that its memory is bad. It has no memory to use. Every line it writes, it re-reads what’s in front of it and continues. What it can see, it works from. Whatever never made it into the notebook never happened, as far as it’s concerned. So next time it “forgets” something you said, put the line back in front of it. It’ll pick right up.

中文版：為什麼 AI 會忘記我前面說過的話？

為什麼 AI 會忘記我前面說過的話？

KbWen — Sat, 20 Jun 2026 09:30:00 +0800

TL;DR：它不是「忘記」，是它根本沒有記憶。每次回你話，它做的是把整段對話從頭重讀一遍，再接下去寫。所謂「記得」，不過是每次都重看了一次。問題是它一次能讀進去的量有上限，這個範圍叫 context window（上下文視窗）。對話太長，最舊的部分就被擠出去、它這次根本沒讀到。所以與其說它「忘了」，不如說那段話從頭到尾就沒進到它眼前。

這事你八成碰過。跟 AI 聊一個東西，開頭你交代清楚「全部用繁體中文、不要 emoji」，它前幾輪乖乖的；聊著聊著，十幾二十輪過去，又開始冒簡體、撒 emoji。或者更乾脆——昨天跟它討論到一半的事，今天開個新對話再問，它一臉茫然，好像那段話從沒發生過。

你會很自然地說「它怎麼又忘了」。但「忘記」這個詞有點誤導：要先「記得」過，才談得上忘。而它呢……其實從來沒有記得。

它每次都是從頭讀一遍

先講一件反直覺的事：它沒有記憶。你跟它聊了半天，它腦袋裡沒存著「我們剛剛聊到哪」。

那它怎麼接得上話？因為每次輪到它回你，背後是把你們從開頭到現在的整段對話，原封不動再餵它讀一遍，從頭讀到尾，然後往下接一句。它讀完那一瞬間「知道」前面發生了什麼，但那不是記住，是剛剛又重看了一次。

打個比方。它比較像一個失憶、但讀很快的人。你每次去找他，都把你們從頭到現在的對話紀錄整本塞給他，他飛快翻完，抬頭回你一句。那一刻他是真懂了你們聊到哪；可他腦子裡什麼都沒留，全靠手上那本。你下次再來，又是重新塞一次。它骨子裡一直在做的，就是讀完眼前這串、然後接下去寫最順的那個字而已。

那本紀錄，有塞不下的時候

問題來了：那本「對話紀錄」不能無限長。

它一次能讀進去的量是有上限的，這個上限就叫 context window，上下文視窗。算的單位是 token——也就是它眼裡的文字小塊，不完全等於一個字（這個我在 Token 是什麼裡聊得比較細，草莓數 r 那篇也有個積木的比喻，這篇不看也不影響）。

現在這個視窗開得很大，一般閒聊很難塞爆。但只要你貼了一篇長文、或一路聊了很久很久，總量超過上限，系統就得動手砍——通常是把最舊的對話先丟掉（各家做法不太一樣，有的直接砍掉、有的先壓成一小段摘要再留著）。被丟掉的那一段，它這一輪是真的沒讀到。

所以你開頭交代的那些規矩，聊太長之後會慢慢「失效」，多半是那句話早就滑出視窗，這一輪根本沒進到它眼前。它不是叛逆，就只是沒看到。

那個「記憶」功能，是另一回事

你可能會說：「ChatGPT 不是有記憶功能嗎？」有，但那個跟我們在講的 context window 不是同一層的東西。

那個記憶（Memory）功能，比較像它幫你另外整理的一份小抄（OpenAI 官方有說明這功能）。你說過「我是寫程式的」「我習慣用繁中」，它記下來，之後每開新對話，偷偷把這些塞回去提醒自己。那是跨對話、長期留著的東西，而且是濃縮過的重點，不是你們每一句話的逐字稿。

context window 則是「這一次，它眼前能看多少」。這兩個常被當成同一回事，其實根本不在同一層。你新開一個對話它忘得一乾二淨，就因為這一次的對話紀錄是空白的——那份小抄也許還在，但它頂多是幾條濃縮的重點，不是你們昨天那整段對話。

知道這件事之後，可以怎麼用

想通它沒記憶、全靠重讀，有些原本很煩的狀況就順了。

要它記住某個關鍵設定，最土也最有效的辦法就是「再貼一次」。重要的規矩，過一陣子重申一遍；開新對話時，把前情提要濃縮成幾句帶上去（像開頭先丟一句「承上次：我在寫一封婉拒信，語氣要客氣但堅定，全程用繁中」，把關鍵設定一次交代清楚）。這不是它笨到要你複述，是你得確保那段話這一輪真的出現在它面前——出現了，它就讀得到。

還有一招：一個對話聊到又臭又長、前後打架的時候，與其在裡面跟它盧，不如乾脆開個乾淨的新對話，把目前的結論濃縮成幾句貼進去重來。新的一頁、乾淨的視窗，往往比硬聊下去清爽得多。

至於它另外那些怪，比方同一個問題每次答案都不一樣，那是別的機制，跟記不記得無關，這篇就先不岔過去了。

說到底，它的毛病不是記性差，是壓根沒在用記性那套東西。每接一句話，它都是把眼前那本對話重讀一遍，再往下接。讀得到的就接得上，沒讀到的，對它來說就等於沒發生過。下次它又「忘了」你前面講的，先別急著嫌。多半是那句話這輪沒擺到它眼前；你補回去，它就接得回來。

English version: Why Does AI Forget What You Said Earlier?