Gradio

Prompt (Supports sections: index:prompt;;;index:prompt)

Use '0:prompt;;;-1:prompt' or '0-2:prompt;;;3:prompt'. Index total sections -1 is last section.

Negative Prompt

Prompt Token Count

Batch Count

Switches to the F1 model (different DiT path and logic).

🏎️ Use F1 Model

Status

Input Image (Video Start)

If checked, 'Input Image (Video Start)' is hidden. Each batch item uses a random image from the folder.

Use Random Images from Folder

Resolution Options (Choose One)

Option 1: Target Resolution (Uses Buckets)

Target bucket size (e.g., 640 for 640x640). Uses input image aspect ratio. Final size divisible by 32.

Total Video Length (seconds)

1 120

Total Video Sections (Overrides seconds if > 0)

Specify exact number of sections. If set, 'Total Video Length (seconds)' is ignored by the backend.

Output FPS

1 60

Seed (-1 for random)

Steps

1 100

Generated Videos (Click to select)

Enable Latent Preview

Use Full Video Previews (slower)

Preview Every N Sections

Generates previews during the sampling loop.

1 50

Latest Preview

LoRa Folder

LoRA 1

Multiplier

0 2

LoRA 2

Multiplier

0 2

LoRA 3

Multiplier

0 2

LoRA 4

Multiplier

0 2

Distilled Guidance Scale (embedded_cfg_scale)

1 20

Guidance Scale (CFG)

Default 1.0 (no CFG), backend recommends not changing.

1 10

CFG Rescale (rs)

Default 0.0, backend recommends not changing.

0 1

Latent Window Size

Default 9

Sample Solver

Enable FP8 precision for the main Transformer model.

Use FP8 DiT

Requires FP8 DiT. Use scaled math (potential quality improvement).

Use Scaled FP8 DiT

Blocks to Swap (to Save VRAM, 0=disable)

Higher values = less VRAM usage but slower generation

0 39

Decode all frames at once instead of section by section.

Bulk Decode Frames (Faster Decode, Higher VRAM)

Attention Mode

VAE Chunk Size (CausalConv3d)

0 or None=disable (Default: None)

VAE Spatial Tile Min Size

0 or None=disable (Default: None)

Device Override (optional)

Enable TeaCache for faster generation (shits hands).

Use TeaCache

TeaCache Init Steps

Steps for TeaCache init (match Inference Steps)

TeaCache Threshold

Relative L1 distance threshold for skipping.

0 1

Prompt

Negative Prompt

Uses og model supports end frame. Default is F1 model.

Use Normal FramePack Model

Batch Count

Status

Progress

Input Video for Extension

Core Generation Parameters

Seed (-1 for random)

Resolution (Max Dimension)

Target max width/height for bucket.

Additional Video Length (seconds)

1 120

Latent Window Size

Default 9 for F1 model.

9 33

Inference Steps

1 100

CFG Scale

Usually 1.0 for F1 (no external CFG).

1 32

Distilled Guidance (GS)

1 32

GPU Memory Preserve (GB)

1 16

Use TeaCache

Force Original Video Resolution (No Resize)

If checked, only the newly generated extension part of the video will be saved.

Save Extension Only

MP4 CRF (Quality)

Lower is better quality, larger file.

0 51

Context Frames (1x from Input)

1 10

VAE Batch Size (Input Video Encoding)

4 128

Attention Mode (DiT)

VAE Chunk Size (CausalConv3d)

0 or None=disable

VAE Spatial Tile Min Size

0 or None=disable

Generated Extended Videos

Latest Section Preview

LoRA Configuration

LoRA Folder

LoRA 1

Multiplier

0 2

LoRA 2

Multiplier

0 2

LoRA 3

Multiplier

0 2

LoRA 4

Multiplier

0 2

Upload Video

Generation Parameters

Status

Input LoRA File

Output Name

Target Format

Choose 'default' for H1111/MUSUBI format, 'other' for diffusion pipe format, or 'Hunyuan to FramePack' for FramePack compatibility.

default other Hunyuan to FramePack

Status

Base DiT Model

Output Model Name

Exclude Single Blocks

Status

LoRA 1

Multiplier

0 2

LoRA 2

Multiplier

0 2

LoRA 3

Multiplier

0 2

LoRA 4

Multiplier

0 2

LoRA Folder

DiT Model Folder