arXiv · 视频生成模型

训练-free 身份感知记忆方法推进长视频生成一致性

这篇论文提出了一种无需训练的身份感知记忆机制，用于解决自回归视频生成中的长期不一致和记忆退化问题。现有方法依赖预定义压缩或粗略检索，而该方法通过身份感知保持角色和场景的连贯性，在长视频叙事生成中取得更好效果。

域名: arxiv.org
评分: 4 · 重要更新
发布: 2026-05-18

导读

这条暂时没有深度导读，点上方「访问项目本体」直接到源页面查看。

原文摘要

Autoregressive video generation has improved rapidly in visual fidelity and interactivity, but it still suffers from long-term inconsistency and memory degradation. Most existing solutions either compress historical frames using predefined strategies or retrieve keyframes based on coarse implicit a…

Back to Latest