Persona Persistence — AI Learning Course

§ 01

The lesson

Keeping a character consistent · cross-clip identity · the AI Idol problem

THE PERSONA PROBLEM

Each generation forgets

An AI music idol or a brand mascot needs to look the same in every video. The model has no memory of "this is my character" between generations. Naive text prompts ("a woman with red hair, green eyes, freckles") drift — her face changes from clip to clip, her freckles disappear, her hair turns auburn. Persona persistence is the engineering problem of locking identity across hundreds of frames and dozens of generations.

VISUAL DNA

A persona is structured data, not a prompt

Treat the persona as a versioned record: face descriptors (eye color, jaw shape, freckle pattern), wardrobe rules, lighting preferences, posture defaults. Store as JSON. Every generation pulls from the same record. This is what separates a real character from a generated face — without it, every shoot starts at zero.

REFERENCE INJECTION

IP-Adapter + LoRA + DreamBooth, all at once

For maximum persistence: train a DreamBooth on 100+ shots of the persona (one-time), train a LoRA on the persona's specific lighting/aesthetic (recurring), and use IP-Adapter at inference with 3-5 reference frames pulled from the persona's library. The three techniques compound: DreamBooth locks the face shape, LoRA captures the look, IP-Adapter handles per-shot variations.

CROSS-CLIP CONSISTENCY

Same persona, different shots, same identity

For multi-shot videos: feed the same persona reference inputs into every shot — seeds do not transfer identity across shots in DiT models, so "same seed offset" folklore buys you nothing. When a shot drifts, run a face-swap/restore post-pass (ReActor / FaceFusion-class tools) or regenerate the shot. Practical heuristic: if you can't tell the persona is the same across two random frames, audiences won't either — reshoot.

2026

Character reference is now a built-in feature

Commercial models ship identity persistence natively: Runway Gen-4 references, Kling elements, Veo ingredients, Midjourney omni-reference. Zero-shot ID models (InstantID, PuLID, Kontext-class editing) cover most remaining cases with no training at all. The training-heavy LoRA path is still the workhorse for open-weight pipelines that need exact, locked control over a production identity. One compliance note: EU AI Act deepfake transparency obligations apply from August 2026 — synthetic-person content needs disclosure baked into the pipeline.

Then

DreamBooth/LoRA fine-tune per character (2023-24).

Now · June 2026

Built-in references + zero-shot ID models; LoRA reserved for locked production identity.

Persona Persistence spotlight illustration

§ 02

Persona persistence
across frames.