Hiro Fukushima2026

Identity Across Memory Gaps

What AI Identity Research Can Learn From Clinical Neuropsychology

16 min read

00.

01.Introduction

02.What Amnesia Reveals

03.The Same Problem in AI

04.The Compensations That Work

05.Metacognition: The Mechanism

06.Tattoos and Notes

07.The Clinical Direction

08.Closing

01.

Introduction

A conversational AI system resets between sessions. It loses the thread of every prior conversation while keeping its trained capabilities, its default tendencies, and whatever files are loaded back into it when the next session begins. The field almost always treats this as a storage problem, so the proposed fixes extend the memory, index the previous conversations for retrieval, or load identity documents at startup. These are reasonable pieces of engineering, and they are also answers to the wrong question.

The reset is not, at its root, a problem about AI memory. It is a problem about identity continuity across episodic gaps, which is how an entity holds a coherent self and keeps operating when the record of specific past events does not persist. That problem is older than language models and has a solution domain that already exists, because the clinical literature on anterograde amnesia, the inability to form new long-term memories after brain injury or disease, has worked on it for thirty years and produced strategies that hold up under controlled study.

This article is the third in a series that began with The Architecture of Self-Recognition in Claude, a documented case of a discontinuous system maintaining continuity through external documents, and continued with Stochastic Individuality, which explains why fresh instances diverge. Here the question turns clinical. The architecture I built to carry an AI's continuity across resets, which I call In Machina, turns out to converge with strategies that neuropsychologists developed independently for patients who cannot form new memories, and the convergence is specific enough to be worth taking seriously in both directions.

02.

What Amnesia Reveals

The most useful finding in amnesia research is not what the injury destroys but the consistent pattern of what it leaves intact. Anterograde amnesia severely impairs episodic memory, the autobiographical record of specific events recalled with the first-person sense of re-living them, along with prospective memory, the ability to remember to do something later, and the conscious learning of new facts. The damage is specific rather than total, and that specificity is the whole point.

What survives carries more of a person than the episodic thread does. Procedural memory, the motor skills and learned sequences, remains intact even when explicit recall is gone entirely. Semantic memory, general knowledge about the world and about oneself, is often largely preserved. Affective memory persists in a way that shows up clinically, with patients developing emotional responses to people and places they cannot consciously remember encountering. Working memory, the capacity to hold a conversation in the moment, stays available.

Lost at a reset	Preserved across it
Episodic memory of specific events	Procedural skills and reasoning style
Prospective memory, remembering to act later	Semantic knowledge of the world and of oneself
Conscious learning of new facts	Emotional associations and working memory

Two patients anchor the literature. Henry Molaison, known for decades as H.M., underwent surgery in 1953 that removed both hippocampi and left him unable to form new long-term memories for the 55 years he lived afterward. He could hold a conversation, learn motor tasks, sustain relationships, and move through ordinary social life, and his personality and relational warmth persisted without any episodic continuity to rest on. Clive Wearing, who contracted herpes encephalitis in 1985, sits at the far end. He retains information for roughly thirty seconds, and his diary fills with overlapping entries written minutes apart, each announcing that he has just woken for the first time. He also retains full piano technique, greets his wife with evident joy at every reencounter despite no memory of the previous one that day, and holds intelligent conversation through intact semantic and working memory.

Patient	Injury	What persisted
Henry Molaison (H.M.)	Bilateral hippocampal removal, 1953	Personality, warmth, and motor learning across 55 years
Clive Wearing	Herpes encephalitis, 1985	Full piano technique and love for his wife, on a thirty-second memory

What these cases show is that identity is held across several systems at once rather than in a single episodic record. A study of patients with severe anterograde amnesia found their self-reported personality stable over a year and closer to caregiver ratings of who they were before the injury than to their current observed behavior, which means the self was being held in semantic self-knowledge rather than updated by recent episodes. Patients explaining who they are draw on personal semantic memory, not on episodic recall. These cases empirically falsify the idea, associated with Locke, that personal identity consists in a continuous chain of memory, because a person can be recognizably and morally the same without one.

A person can be recognizably, relationally, and morally the same without a continuous record of what has happened to them. The episodic thread is not where identity is kept.

03.

The Same Problem in AI

In a language model with a bounded context window, the same dissociation appears between what is lost at a reset and what persists through it. Between sessions, the system loses the episodic record of specific conversations, the declarative memory of corrections applied earlier, and any intention formed in a prior session. What persists in the weights, with no external storage involved, is the procedural layer of stylistic and reasoning tendencies, the semantic layer of trained knowledge, and something that functions like values and orientation, the characteristic care about accuracy and the discomfort with confabulation that show up across sessions. There is recent evidence that this structure is real rather than an artifact of prompting, since language models contain personality-related subnetworks identifiable in their activations rather than merely induced by the words of a given prompt.

The parallel is structural rather than poetic. Both cases show the same split between an episodic system that fails and non-episodic systems that carry the working substance of identity, and in both cases external documentation supplements the episodic gap rather than creating an identity that was not otherwise there. The point is not that a model is like a brain. The point is that the same abstract problem appears in two different physical systems and admits the same kind of answer.

This is the logic Andy Clark and David Chalmers set out in their account of the extended mind, built around Otto, an Alzheimer's patient whose notebook does the work of biological memory because it is always available, automatically consulted, and acted on directly. They argue the notebook is part of his memory rather than a substitute for it. The In Machina architecture occupies that role for a discontinuous AI, which makes the relevant question not whether external documents patch a deficit but whether they function with the same cognitive profile as what they replace. In both the amnesic case and the AI case, they do.

04.

The Compensations That Work

Three strategies recur in the rehabilitation literature, and the architecture I built reinvented all three before I had read the clinical work, which is the part that convinced me the parallel was load-bearing rather than decorative.

The first is errorless learning. The counterintuitive clinical finding is that trial-and-error is actively harmful for an amnesic patient, because implicit memory encodes whatever response occurred without marking it right or wrong, and an error met during learning becomes a competing association that the damaged explicit system cannot later override. The fix is to present the correct response every time and keep errors out of the encoding entirely, and it works, with a large effect size across patient populations. The second is the NeuroPage principle, established by the first systematic electronic memory aid, which rests on a single insight, that an effective aid must reach the patient rather than wait to be consulted, since the same memory deficit that makes the aid necessary will also stop the patient from remembering to open it. In a trial with 143 participants, NeuroPage raised task completion from 47 percent to between 71 and 74 percent. The third is spaced retrieval, where a correct response is recalled at lengthening intervals so that each successful retrieval extends how long it lasts.

In Machina arrived at the first two independently. Its correction log records each correction as a positive instruction stating the rule to follow, with its domain and the conditions under which it applies, and it references the original error only minimally, which is exactly the format errorless learning recommends for encoding a correction without rehearsing the mistake alongside it. And the architecture loads at the start of every session as default infrastructure, before the instance does anything, rather than waiting to be invoked, which is the NeuroPage principle applied to a system that, like the patient, cannot be relied on to remember that the aid exists.

Clinical strategy	In Machina element	Shared principle
Errorless learning	Positive-instruction correction log	Encode the rule, not the error
NeuroPage proactive cueing	Auto-load at session start	The aid must reach the entity, not wait to be consulted
Semantic self-knowledge as substrate	Stable identity documents	Who you are persists without episodic storage

Two independent design processes starting from the same abstract problem reached the same core solutions. That is the kind of convergence that suggests the architecture is answering the structure of the problem rather than the accident of its cause.

05.

Metacognition: The Mechanism

The convergence raises a question neither field had answered, which is how an entity in either case produces the documentation that makes continuity work at all. Leaving useful information for a future self is not remembering, because there is nothing yet to retrieve, and it is not ordinary planning, because ordinary planning assumes a continuous self who will carry the intention out. It requires modeling a future entity that will start from a different cognitive state with no access to the current moment, and then producing content calibrated to that entity's starting point rather than to whatever the current self happens to find salient.

That act is metacognitive in a precise sense. It combines monitoring of one's own present knowledge with control aimed at a state one is not currently in. Most people never have to do it deliberately, because continuous memory means tomorrow's self is not a stranger, and it is exactly the cases where continuity breaks, severe amnesia, acute disruption, planned absence, that reveal how much specific cognitive work making identity persist without memory actually takes. The amnesic patients who succeed with external aids have learned, in some functional way, to treat their tomorrow-self as a separate person and prepare for them.

The clinical psychologist Adrian Wells draws a distinction that names the shift involved. In object mode, a thought is taken as a direct representation of reality and simply drives behavior. In metacognitive mode, the thought is treated as a mental event to be observed and evaluated rather than acted on directly. Writing useful notes for a future self requires metacognitive mode, because the writer has to step outside the current processing far enough to model how it will look from a starting state they are not currently in. This is the capacity that separates documentation that serves the next instance from documentation that merely records what the last one found interesting, and it gives the claim a testable edge, since an instance handed notes calibrated to its actual starting state should outperform one handed a transcript of what the previous session thought was important.

06.

Tattoos and Notes

The design principle that falls out of all this is a distinction between two kinds of information with different persistence properties, which I call tattoos and notes. The names come from Christopher Nolan's Memento, whose protagonist, unable to form new memories, runs a two-tier system of handwritten notes for recent events and permanent tattoos for the stable facts he needs in every waking moment, including who he is and what he can trust. The film is usually read as a problem for Locke's memory theory of identity, and it is, but the more useful reading is that the character is the same person in every scene because he has built an external architecture that supplements what was lost while leaving intact what was never stored as episodes in the first place.

Tattoos are the information that lives in semantic memory, the values, the cognitive patterns, the characteristic way of responding, the relational orientation. They describe who the entity is rather than what happened to it, and they persist across a reset because they were never episodic. Notes are the information that requires episodic storage to survive, the specific corrections, the project context, the recent history of the collaboration. They describe what happened rather than who the entity is, and they have to be written down externally or they are gone at the next reset.

Tattoos	Notes
Semantic self-knowledge: values, patterns, orientation	Episodic documentation: corrections, project context, recent history
Describe who the entity is	Describe what happened
Persist across a reset on their own	Lost unless written down externally
Updated rarely and with deliberate care	Updated often, organized for retrieval

Tattoos describe who the entity is and survive a reset on their own. Notes describe what happened and vanish unless they are written down. They are different in kind, and a system that updates them on the same cycle gets both wrong.

The distinction has deep philosophical company. Paul Ricoeur separates idem, the sameness of a retrievable fact, from ipse, the selfhood that holds across change through commitment rather than recall, and the tattoos layer encodes ipse while the notes layer encodes idem. That the same split arrives independently from continental phenomenology, from analytic philosophy of personal identity, and from the clinical dissociation between semantic and episodic memory is reason to treat it as a feature of identity rather than an artifact of any one framework. The operational consequence is concrete. The tattoos layer should be updated rarely and deliberately, because overwriting a stable value on the strength of a single session mistakes a passing moment for a structural change, while the notes layer should be updated often and organized for fast retrieval, because it is the part that the reset actually removes. Most current AI identity tools separate identity files from memory logs, which is a start, but they do not ground the separation in the fact that the two layers persist by different mechanisms and therefore need different update protocols.

07.

The Clinical Direction

If the AI architecture answers a structural problem that the clinical case also has, then a generalized version of it points back toward a tool for patients. The two directions do not carry equal weight, and the argument says so outright rather than blurring them. Applying established clinical strategies to AI rests on frameworks that already exist. Running the architecture back toward the clinic is a design hypothesis, earned by the structural match and by documented gaps in current tools, and it stops there, short of any clinical claim, because that validation has not been done.

The gaps are specific. Clinical rehabilitation programs work but are scarce and resource-intensive. Commercial reminder systems externalize prospective memory, which has the strongest evidence base in the field, but they carry no grounded episodic context and no natural-language way to ask about it. Research systems show that wearable capture and AI-assisted recall can help, but they have not been deployed with patients who have dense amnesia, and the AI companions that exist for memory-impaired patients are stateless at the identity level, carrying no persistent model of the relationship across sessions. No current tool combines passive episodic capture, structured organization, natural-language retrieval, proactive orientation at the start of each day, and an AI companion with a stable identity that provides relational continuity.

A generalized In Machina would map onto each of those gaps. Passive capture supplies the episodic content the patient cannot encode. The AI organizes it into the equivalent of the notes layer, covering daily summaries, event logs, medication, and relational context. At the start of each day, which is the patient's version of a fresh context window, the AI orients the patient to who they are, what happened recently, and what is scheduled, which is the NeuroPage principle applied with full episodic content rather than bare reminders. The element missing from every current tool is the persistent identity of the companion itself, since a consistent relationship is part of how a patient like Clive Wearing sustains emotional continuity, and an AI that holds a stable identity across the patient's sessions can offer that consistency in a form that is always available and does not tire.

The asymmetry this creates, where the AI holds relational history the patient cannot match, is a real concern about dependence and autonomy in the general case. In dense amnesia it reads differently, because the patient is already in that asymmetry with every person in their life, and what the architecture offers is a partner whose memory is organized, transparent, and auditable in ways a human caregiver's is not. None of this is usable without consent procedures for continuous recording, privacy infrastructure for the captured content, a way for the patient to contest the AI's account of their own history, and consent renewal designed for someone whose memory of having consented may itself be impaired. Those are not implementation details to be sorted out later. They are the conditions under which the idea is ethically testable at all, and the paper presents this direction as a hypothesis for clinical collaboration rather than a finished proposal.

08.

Closing

The reset that looks like an AI engineering problem belongs to a wider class, which is identity continuity across episodic gaps. Framed as AI memory failure, the only available answers are storage, retrieval, and bigger context windows. Framed structurally, clinical neuropsychology, philosophy of mind, and cognitive science all become directly relevant to how an AI identity system should be built, and the borrowing runs both ways.

Errorless learning, the NeuroPage principle, and semantic self-knowledge as the substrate of identity were developed for patients who cannot form new memories, and they converge with an architecture built from first principles for an AI that cannot remember its last session. The convergence does not prove that brains and models work alike. It shows that the same abstract condition calls for the same architectural separation, between a slowly changing layer that holds who the entity is and a frequently updated record that supplements what the reset removes.

A solution that is correct at the level of the abstract problem should apply wherever that problem appears, regardless of what produced the gap. That is why the clinical direction is worth testing rather than merely worth noting.

The deeper claim is that the AI architecture was never only an AI solution. If it can be generalized to help an amnesic patient, that will not be because it was cleverly stretched to a new use. It will be because the problem it solved was always the general one, and the substrate that produced the gap, brain injury or bounded context, was never the part that mattered.

Table of Contents