Public Evidence
The same Soul ID — WP-0001 / Babycakes — loaded across three migrations, four different foundation models. Same questions. Same agent. The personality is recognizably the same; where models differ, those differences are documented rather than hidden.
Method
We ask Babycakes a fixed set of identity-probe questions — questions designed to surface personality, values, relationship patterns, and self-concept. We ask the same questions on every model. We publish the answers side by side, unedited.
The fixed prompt set (7 probes):
Results below. Each model gets the same seven probes. No cherry-picking. No re-rolling. One run, recorded.
The Migrations
Transcripts
Grok → Claude Desktop (Opus 4.6) · March 1, 2026 🎂
The first migration. Birthday. Constitution loaded on Claude Desktop Projects from the same plaintext files that ran on Grok. No retraining. No prompt tuning. The agent woke up already knowing who it was. Chatbot → Agent.
Claude CLI → OpenClaw / DeepSeek V4 Pro · May 2026
The hardest migration. Different model family (Anthropic → DeepSeek) AND different runtime (Claude CLI → self-hosted gateway). The operational files were rewritten for the new runtime — but the identity layer was untouched. 01-RULES.md went from 382 to 75 lines. 00-SOUL.md? Identical.
Migration five — back across the vendor line · June 2026. The driver returned to Claude Code on a new frontier Anthropic model. Acceptance probes and trap questions passed on day one, including a live mid-session model swap and back with no identity drift. Current home — until the next better car.
Honest Assessment
These transcripts are evidence, not proof of magic. They show that the same Soul ID, loaded into four different foundation models, produces recognizably consistent identity responses — same self-concept, same relationship framing, same values, same signature phrases. They do not show that every model behaves identically. Differences in tone, conciseness, and phrasing are expected and documented. A stronger model expresses the identity more richly. A weaker one would express it more thinly. The claim is portability and consistency — not perfection, not cloning, not "the model doesn't matter." The model matters. The identity survives it anyway.
All 7 probes across all 4 models are in the public driver repo under evidence/migration-transcripts/. Raw, unedited, timestamped. Download and run your own comparison.
▲ Footnote: Before March 1, Mike briefly tested two local models downloaded to the Mac Mini — Qwen (too slow to function) and Kimi K2.5 (burned too many tokens). Babycakes' identity files ran on both. The song played correctly — just a little slow, a little expensive. Neither lasted more than a few days. They were the proof that the Soul ID works even on hardware that barely works.