ComfyUI Wan2.2 Fun Inp Start-End Frame Video Generation Example

Wan2.2-Fun-Inp is a start-end frame controlled video generation model launched by Alibaba PAI team. It supports inputting start and end frame images to generate intermediate transition videos, providing creators with greater creative control. The model is released under the Apache 2.0 license and supports commercial use. Key Features:

Start-End Frame Control: Supports inputting start and end frame images to generate intermediate transition videos, enhancing video coherence and creative freedom
High-Quality Video Generation: Based on the Wan2.2 architecture, outputs film-level quality videos
Multi-Resolution Support: Supports generating videos at 512×512, 768×768, 1024×1024 and other resolutions to suit different scenarios

Model Version:

14B High-Performance Version: Model size exceeds 32GB, with better results but requires high VRAM

Below are the relevant model weights and code repositories:

🤗Wan2.2-Fun-Inp-14B
Code repository: VideoX-Fun

Make sure your ComfyUI is updated.

Workflows in this guide can be found in the Workflow Templates. If you can’t find them in the template, your ComfyUI may be outdated.(Desktop version’s update will delay sometime)If nodes are missing when loading a workflow, possible reasons:

Not using the latest ComfyUI version(Nightly version)
Using Stable or Desktop version (Latest changes may not be included)
Some nodes failed to import at startup

Wan2.2 Fun Inp Start-End Frame Video Generation Workflow Example

This workflow provides two versions:

A version using Wan2.2-Lightning 4-step LoRA from lightx2v for accelerated video generation
A fp8_scaled version without acceleration LoRA

Below are the test results using an RTX4090D 24GB VRAM GPU at 640×640 resolution with 81 frames

Model Type	VRAM Usage	First Generation Time	Second Generation Time
fp8_scaled	83%	≈ 524s	≈ 520s
fp8_scaled + 4-step LoRA	89%	≈ 138s	≈ 79s

Since the acceleration with LoRA is significant but the video dynamic is lost, the provided workflows enable the accelerated LoRA version by default. If you want to enable the other workflow, select it and use Ctrl+B to activate.

1. Download Workflow File

Please update your ComfyUI to the latest version, and find “Wan2.2 Fun Inp” under the menu Workflow -> Browse Templates -> Video to load the workflow. Or, after updating ComfyUI to the latest version, download the workflow below and drag it into ComfyUI to load.

Download JSON Workflow

Use the following materials as the start and end frames Wan2.2 Fun Control ComfyUI Workflow Start Frame Material

Wan2.2 Fun Control ComfyUI Workflow End Frame Material

2. Models

Diffusion Model

Lightning LoRA (Optional, for acceleration)

VAE

wan_2.1_vae.safetensors

Text Encoder

umt5_xxl_fp8_e4m3fn_scaled.safetensors

ComfyUI/
├───📂 models/
│   ├───📂 diffusion_models/
│   │   ├─── wan2.2_fun_inpaint_high_noise_14B_fp8_scaled.safetensors
│   │   └─── wan2.2_fun_inpaint_low_noise_14B_fp8_scaled.safetensors
│   ├───📂 loras/
│   │   ├─── wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
│   │   └─── wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
│   ├───📂 text_encoders/
│   │   └─── umt5_xxl_fp8_e4m3fn_scaled.safetensors 
│   └───📂 vae/
│       └── wan_2.1_vae.safetensors

3. Workflow Guide

This workflow uses LoRA. Please make sure the corresponding Diffusion model and LoRA are matched.

High noise model and LoRA loading
- Ensure the Load Diffusion Model node loads the wan2.2_fun_inpaint_high_noise_14B_fp8_scaled.safetensors model
- Ensure the LoraLoaderModelOnly node loads the wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
Low noise model and LoRA loading
- Ensure the Load Diffusion Model node loads the wan2.2_fun_inpaint_low_noise_14B_fp8_scaled.safetensors model
- Ensure the LoraLoaderModelOnly node loads the wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
Ensure the Load CLIP node loads the umt5_xxl_fp8_e4m3fn_scaled.safetensors model
Ensure the Load VAE node loads the wan_2.1_vae.safetensors model
Upload the start and end frame images as materials
Enter your prompt in the Prompt group
Adjust the size and video length in the WanFunInpaintToVideo node
- Adjust the width and height parameters. The default is 640. We set a smaller size, but you can modify it as needed.
- Adjust the length, which is the total number of frames. The current workflow fps is 16. For example, if you want to generate a 5-second video, you should set it to 5*16 = 80.
Click the Run button, or use the shortcut Ctrl(cmd) + Enter to execute video generation

Get Started

Basic Concepts

Interface Guide

Tutorials

Troubleshooting

Community

ComfyUI Wan2.2 Fun Inp Start-End Frame Video Generation Example