Overview

2020-04-01 2026-07-15

CV / Overview

an hour read (About 6844 words)

There are the overall of paper with code for CV / AIGC / LLM / VLM.
https://github.com/Gojay001/paper-with-code-skills.
[Updating…]

DL, Overview

电商视频生成：任务版图、技术路线与研究空白

2026-07-15 2026-07-15

Research / Ecommerce Video

a few seconds read (About 12 words)

通用视频生成追求“看起来合理”；电商视频还要求“与正在售卖的那个 SKU 一致”。本综述梳理商品图直生视频、人—商品演示、广告规划剪辑、品牌与商业意图、生成—真实混合等路线，并给出可复用机制与 Gap Map（G11–G20）。

AIGC, Research, Video Generation, Wiki

Qwen-Image-Agent — Bridging the Context Gap in Real-World Image Generation

2026-07-09 2026-07-15

Agent / Agentic Image Generation

a few seconds read (About 15 words)

痛点：用户说「做张海报」往往缺细节——隐含意图、实时知识、历史对话。T2I 模型训练时吃「完整 prompt」，部署时吃「残缺 context」→ 作者称 Context Gap（用户 context ≠ 生成所需 context）。

AIG, Agent, DL

InnoAds-Composer — Efficient Condition Composition for E-Commerce Poster Generation

2026-07-04 2026-07-15

AIGC / Diffusion Model

a few seconds read (About 14 words)

任务：电商海报 = 一张图里同时摆对商品主体、促销文案、背景风格。多阶段 pipeline（先合成场景再贴字）常出现主体走样、文字错字、风格不统一。

AIGC, DL, DM

MoFu — Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation

2026-07-04 2026-07-15

AIGC / Video Generation

a few seconds read (About 13 words)

任务：多主体视频生成——给定文本 + 多张参考图，生成多主体一致、尺度自然的视频。

AIGC, DL, VG

Lay2Story — Extending Diffusion Transformers for Layout-Togglable Story Generation

2026-07-03 2026-07-15

AIGC / Video Generation

a few seconds read (About 14 words)

Storytelling：用一组 prompt 生成多帧图，主角外观要一致。现有 training-free（改 cross-frame attention）和 training-based 都难精细控制位置、衣着、表情、姿势，且缺大规模带 layout 标注的数据。

AIGC, DL, VG

Overview

电商视频生成：任务版图、技术路线与研究空白

Qwen-Image-Agent — Bridging the Context Gap in Real-World Image Generation

InnoAds-Composer — Efficient Condition Composition for E-Commerce Poster Generation

MoFu — Scale-Aware Modulation and Fourier Fusion for Multi-Subject Video Generation

Lay2Story — Extending Diffusion Transformers for Layout-Togglable Story Generation

Tag Cloud

Categories

Recent

Archives

Recent

Archives

Your browser is out-of-date!