logo
Docs

Workflows & Models

OpenFork workflows are Docker-backed GPU services. The website, desktop app, Python client, and cloud images all read from the same service registry so model availability, VRAM requirements, and routing stay aligned.

52

Enabled services

74

Workflow entries

2GB

Lowest VRAM

32GB

Highest VRAM

Source of truth: service metadata lives in services.json. Workflow processors and workflow JSON live in the Python client.

Video

Text-to-video, image-to-video, video-to-video, and native audio-video generation.

daVinci MagiHuman

daVinci-MagiHuman through Wan2GP with Magi Human Distill SR1080 quanto int8 weights. Requires a start image.

16-32GB180-220GB disk

DreamID-Omni

DreamID-Omni FP8 talking-head video with identity and voice reference inputs

24GB140GB disk

Hunyuan 1.5

Tencent's SOTA FP8 video model

16-24GB160-180GB disk

LTX-2.3

LTX-2.3 22B: native audio+video generation (24GB)

12-32GB165-170GB disk

SCAIL

Experimental 16GB SCAIL 14B through Wan2GP for pose-guided character animation at reduced resolution and duration.

16-24GB190GB disk

TurboDiffusion

Real-time prototyping video model

8-12GB160GB disk

Vista4D

Vista4D 384p49 through Wan2GP for source-video novel-view reshooting with predefined camera trajectories.

24GB200GB disk

WAN 2.2

High-quality video (Text/Image-to-Video)

8-24GB160-220GB disk

Image

Text-to-image, instruction editing, inpainting, and illustration models.

Anima

Anima text-to-image illustration model

8-16GB140GB disk

ERNIE-Image

Baidu ERNIE-Image text-to-image (FP16)

16-24GB100-120GB disk

FLUX Kontext

FLUX.1 Kontext [dev] GGUF Q4_K_M for 8GB VRAM; supports optional two-image composition by precomposing the source and identity reference

8-24GB120-140GB disk

Qwen Edit

Qwen-Image-Edit-2511 instruction-based image editing

8-12GB120GB disk

Qwen Turbo

Ultra-fast instruction-based editing with an optional second reference image (2 steps)

8GB120GB disk

Z-Image

High-quality Z-Image (Q4_K_M GGUF)

8-24GB120-160GB disk

Z-Image Turbo

ControlNet & Flux Image Gen

8GB120GB disk

Audio

TTS, voice clone/design, music, sound effects, foley, and speech restoration.

ACE-Step 1.5

ACE-Step 1.5 XL music generation for 16GB VRAM

8-16GB100GB disk

AudioX

AudioX text-to-audio and video-conditioned sound generation

16-24GB120GB disk

Chatterbox

High-quality voice cloning

8GB80GB disk

DiffRhythm

AI-powered music composition

8GB100GB disk

HeartMuLa

4-bit quantized music generation (~8GB VRAM)

16-24GB100GB disk

MMAudio V2A

Video-to-audio synthesis (MMAudio small_44k)

8-16GB80-100GB disk

PRiSM Audio

High-fidelity video-to-audio generation (PRiSM)

8-16GB70GB disk

Qwen3-TTS

Alibaba's multilingual TTS with 9 premium voices

8GB100GB disk

Qwen3-TTS 1.7B

Alibaba's multilingual TTS with voice design (1.7B)

16GB120GB disk

Speech Enhancement

High-efficiency speech restoration and enhancement

2GB60GB disk

Stable Audio

Sound FX and text-to-audio

8GB100GB disk

VibeVoice

Clone-ready text-to-speech

8GB80GB disk

Utilities

Upscaling, prompt helpers, and local utility services used by production workflows.

LLM

Qwen3 4B for workflow planning, scripts, prompts, and dialogue

4GB20GB disk

SparkVSR

State-of-the-art video super-resolution (24GB VRAM required)

24GB70GB disk

How Jobs Run

Submit

The website creates a DGN job with workflow type, inputs, storage target, policy, and billing plan.

Route

The orchestrator chooses a capable provider, preferring cached images and policy-compatible providers.

Process

The Python client pulls the Docker image if needed, runs the processor, uploads outputs, and settles credits or paid earnings.

Job Policies

Choose private, trusted, public, or paid routing