OpenFork Desktop
A decentralized AI video creation platform with compute marketplace and GitHub-style collaboration.
Integrated Services
Automated workflow orchestration, local Docker management, and seamless P2P job routing.
High-quality video (Text/Image-to-Video)
Tencent's SOTA FP8 video model
LTX-2.3 22B: native audio+video generation (24GB)
DreamID-Omni FP8 talking-head video with identity and voice reference inputs
ControlNet & Flux Image Gen
High-quality Z-Image (Q4_K_M GGUF)
Anima text-to-image illustration model
FLUX.1 Kontext [dev] GGUF Q4_K_M for 8GB VRAM; supports optional two-image composition by precomposing the source and identity reference
Qwen-Image-Edit-2511 instruction-based image editing
Ultra-fast instruction-based editing with an optional second reference image (2 steps)
Clone-ready text-to-speech
High-quality voice cloning
Alibaba's multilingual TTS with 9 premium voices
AI-powered music composition
Sound FX and text-to-audio
AudioX text-to-audio and video-conditioned sound generation
Qwen3 4B for workflow planning, scripts, prompts, and dialogue
Real-time prototyping video model
4-bit quantized music generation (~8GB VRAM)
ACE-Step 1.5 XL music generation for 16GB VRAM
Video-to-audio synthesis (MMAudio small_44k)
High-efficiency speech restoration and enhancement
High-fidelity video-to-audio generation (PRiSM)
daVinci-MagiHuman through Wan2GP with Magi Human Distill SR1080 quanto int8 weights. Requires a start image.
Experimental 16GB SCAIL 14B through Wan2GP for pose-guided character animation at reduced resolution and duration.
Vista4D 384p49 through Wan2GP for source-video novel-view reshooting with predefined camera trajectories.
State-of-the-art video super-resolution (24GB VRAM required)
Baidu ERNIE-Image text-to-image (FP16)