Wan2.2-Fun-Inp is a start-end frame controlled video generation model launched by Alibaba PAI team. It supports inputting start and end frame images to generate intermediate transition videos, providing creators with greater creative control. The model is released under the Apache 2.0 license and supports commercial use. Key Features:
  • Start-End Frame Control: Supports inputting start and end frame images to generate intermediate transition videos, enhancing video coherence and creative freedom
  • High-Quality Video Generation: Based on the Wan2.2 architecture, outputs film-level quality videos
  • Multi-Resolution Support: Supports generating videos at 512×512, 768×768, 1024×1024 and other resolutions to suit different scenarios
Model Version:
  • 14B High-Performance Version: Model size exceeds 32GB, with better results but requires high VRAM
Below are the relevant model weights and code repositories:
Make sure your ComfyUI is updated.Workflows in this guide can be found in the Workflow Templates. If you can’t find them in the template, your ComfyUI may be outdated.(Desktop version’s update will delay sometime)If nodes are missing when loading a workflow, possible reasons:
  1. Not using the latest ComfyUI version(Nightly version)
  2. Using Stable or Desktop version (Latest changes may not be included)
  3. Some nodes failed to import at startup

Wan2.2 Fun Inp Start-End Frame Video Generation Workflow Example

This workflow provides two versions:
  1. A version using Wan2.2-Lightning 4-step LoRA from lightx2v for accelerated video generation
  2. A fp8_scaled version without acceleration LoRA
Below are the test results using an RTX4090D 24GB VRAM GPU at 640×640 resolution with 81 frames
Model TypeVRAM UsageFirst Generation TimeSecond Generation Time
fp8_scaled83%≈ 524s≈ 520s
fp8_scaled + 4-step LoRA89%≈ 138s≈ 79s
Since the acceleration with LoRA is significant but the video dynamic is lost, the provided workflows enable the accelerated LoRA version by default. If you want to enable the other workflow, select it and use Ctrl+B to activate.

1. Download Workflow File

Please update your ComfyUI to the latest version, and find “Wan2.2 Fun Inp” under the menu Workflow -> Browse Templates -> Video to load the workflow. Or, after updating ComfyUI to the latest version, download the workflow below and drag it into ComfyUI to load.

Download JSON Workflow

Use the following materials as the start and end frames Wan2.2 Fun Control ComfyUI Workflow Start Frame Material Wan2.2 Fun Control ComfyUI Workflow End Frame Material

2. Models

Diffusion Model Lightning LoRA (Optional, for acceleration) VAE Text Encoder
ComfyUI/
├───📂 models/
│   ├───📂 diffusion_models/
│   │   ├─── wan2.2_fun_inpaint_high_noise_14B_fp8_scaled.safetensors
│   │   └─── wan2.2_fun_inpaint_low_noise_14B_fp8_scaled.safetensors
│   ├───📂 loras/
│   │   ├─── wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
│   │   └─── wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
│   ├───📂 text_encoders/
│   │   └─── umt5_xxl_fp8_e4m3fn_scaled.safetensors 
│   └───📂 vae/
│       └── wan_2.1_vae.safetensors

3. Workflow Guide

Workflow Step Image
This workflow uses LoRA. Please make sure the corresponding Diffusion model and LoRA are matched.
  1. High noise model and LoRA loading
    • Ensure the Load Diffusion Model node loads the wan2.2_fun_inpaint_high_noise_14B_fp8_scaled.safetensors model
    • Ensure the LoraLoaderModelOnly node loads the wan2.2_i2v_lightx2v_4steps_lora_v1_high_noise.safetensors
  2. Low noise model and LoRA loading
    • Ensure the Load Diffusion Model node loads the wan2.2_fun_inpaint_low_noise_14B_fp8_scaled.safetensors model
    • Ensure the LoraLoaderModelOnly node loads the wan2.2_i2v_lightx2v_4steps_lora_v1_low_noise.safetensors
  3. Ensure the Load CLIP node loads the umt5_xxl_fp8_e4m3fn_scaled.safetensors model
  4. Ensure the Load VAE node loads the wan_2.1_vae.safetensors model
  5. Upload the start and end frame images as materials
  6. Enter your prompt in the Prompt group
  7. Adjust the size and video length in the WanFunInpaintToVideo node
    • Adjust the width and height parameters. The default is 640. We set a smaller size, but you can modify it as needed.
    • Adjust the length, which is the total number of frames. The current workflow fps is 16. For example, if you want to generate a 5-second video, you should set it to 5*16 = 80.
  8. Click the Run button, or use the shortcut Ctrl(cmd) + Enter to execute video generation