Daily · 16 May 2026
Top 100 AI Video Generation Models of 2026
Ranked from 100 down to 1. Generated by /lad, illustrated by /iad.
#1
Google Veo 3.1
Google DeepMind's flagship video model, the only true 4K-native generator with synchronized native audio (dialogue, ambient, music) in a single generation. Strongest all-around quality in 2026 with 8s clips, scene continuation, and ingredient-based composition.
#2
OpenAI Sora 2
OpenAI's unmatched physics engine produces the most realistic motion in 2026. Released alongside the Sora 2 app; note that OpenAI announced the Sora web/app will be discontinued, with the API following later in 2026.
#3
Kling 3.0
Kuaishou's Kling 3.0 leads in motion-heavy content and dance/sports realism. Kling 3.0 Omni adds native audio. Strong for action sequences and dynamic camera moves.
#4
Runway Gen-4.5
Runway's reference model for ads, client work, and tight creative control. Ecosystem advantage with director-style camera controls, motion brushes, and Act-One performance capture.
#5
Wan 2.2 (Alibaba)
Best overall open-source video generator in 2026. The 14B Mixture-of-Experts model outperforms several closed commercial models on VBench, with separate experts for high- and low-noise diffusion stages.
#6
Luma Ray3.14
Luma Labs' Ray family — Dream Machine successor with strong text-to-video, image-to-video, and keyframe interpolation. The favorite for short-form social creators.
#7
Seedance 2.0
ByteDance's high-end video model with native audio generation and strong consistency across long clips. Native to CapCut and ByteDance creator surfaces.
#8
HunyuanVideo (Tencent)
13B parameter open-source video model — the largest at release. Beats Runway Gen-3 on many cinematic-quality benchmarks; strong motion accuracy and ecosystem support.
#9
Pika 2.5
Pika Labs' creator-focused model — best-in-class for daily Reels/TikTok/Shorts publishing. Pikaffects library and ingredient inputs make it the most fun-to-use of the consumer-tier tools.
#10
Hailuo / MiniMax Video-01
MiniMax's Hailuo Video produces remarkably smooth motion at a low price point. Frequently the top open-access model on text-to-video leaderboards.
#11
Mochi 1 (Genmo)
10B parameter open-source video model with Apache 2.0 license. Asymmetric Diffusion Transformer design; the strategic choice when licensing freedom matters.
#12
LTX-Video (Lightricks)
Open-source model from Lightricks that generates 30fps at 1216×704 faster than real time on capable hardware. The speed leader for live and interactive use cases.
#13
CogVideoX (Tsinghua)
Open-source video model strong on image-to-video. CogVideoX-5B is the practical choice for moderate hardware; the 13B variant raises quality at a compute cost.
#14
Stable Video Diffusion (Stability AI)
Stability's open image-to-video model. Heavily fine-tuned by the community; foundation for many ComfyUI workflows and downstream specialized models.
#15
Vidu (Shengshu)
Chinese long-form video model with strong character consistency across cuts. Reference cameo features make it popular for branded short series.
#16
HeyGen
Top-ranked AI avatar / talking-head generator. Realistic lip-sync, multilingual cloning, and Studio for full scene composition. The default for talking-head explainer content.
#17
Synthesia
Enterprise-grade AI avatar platform with broad language support and large pre-built avatar library. Dominant in L&D, training, and internal corporate video.
#18
ElevenLabs Conversational Video
ElevenLabs' avatar product layered on their voice models. Tight lip-sync and best-in-class voice quality, with conversational turn-taking.
#19
D-ID
Photo-to-talking-head specialist. Animates portrait images with synced speech; widely embedded in marketing and customer-service products.
#20
Hedra
Audio-first character video generator — feed audio + a character image, get expressive lip-synced video. Strong for music videos and podcast clips.
#21
Sync Labs
Best-in-class dedicated lip-sync model. Drops dialogue onto existing video and translates speech with mouth-shape correction.
#22
Topaz Video AI
Upscaling and frame-interpolation suite for footage. The desktop standard for 4K/8K upscale, slow-mo, and stabilization in post.
#23
Descript Underlord
Descript's AI editor that operates on the transcript. Generates B-roll, fills awkward cuts, and now drafts entire scenes from a prompt.
#24
CapCut AI
ByteDance's mass-consumer creator tool with Seedance generation, AI scripting, and mobile-first editing. The most-used AI video app in the world by daily actives.
#25
InVideo AI
Web-based AI video creator that drafts full scripts, picks B-roll, and renders end-to-end from a prompt. Strong for marketers and faceless YouTube channels.
#26
Krea Video
Krea's video generation surface aggregates multiple underlying models (Veo, Kling, Wan, Hailuo) with a unified prompt UI and real-time generation feel.
#27
Hailuo S2V-01
MiniMax's subject-to-video — keeps a reference subject consistent across generated clips. Strong for branded character series.
#28
Runway Act-One
Runway's performance-capture feature — drive an animated character from your own facial expressions via webcam. Game-changer for animated short workflows.
#29
Pika Pikaffects
Pika's library of one-tap effects (squish, cake-ify, melt, crush) that go viral on TikTok. The effect-pack approach turned out to be a winning UX bet.
#30
Veo 3 Fast
Google's lower-latency, lower-cost variant of Veo 3 for high-volume use cases. Strong for production pipelines that need many shorter clips.
#31
Sora Storyboard
OpenAI's pre-shutdown storyboard interface for Sora — sequence multiple shots into a single timeline with consistency across cuts.
#32
Runway Aleph
Runway's in-context video editing model — describe a change in natural language and Aleph re-renders the existing clip with the edit applied.
#33
ByteDance OmniHuman
Single-image-to-full-body-animated-video model. Realistic gesture and body motion conditioned on speech audio.
#34
Genmo Mochi 2
Successor to Mochi 1 with longer clip length and stronger character consistency. Open weights with permissive licensing.
#35
AnimateDiff
Open-source motion module for Stable Diffusion that animates any SD checkpoint. The community workhorse for stylized animated content via ComfyUI.
#36
ComfyUI Video Workflows
ComfyUI's node-graph environment is the de-facto research and production surface for open-source video models. Workflows distributed via Civitai and Hugging Face.
#37
WaveSpeed AI
Cloud platform that hosts dozens of open-source video models with cheaper inference than the originals. Common backend for indie creators.
#38
fal.ai Video
Inference platform with one-click access to every major video model and shared LoRA library. Tightly integrated with Replicate and HF Spaces.
#39
Replicate Video Models
Replicate hosts a wide library of video models with simple HTTP APIs and per-second billing. Default platform for engineers prototyping video features.
#40
Hailuo I2V-01
MiniMax image-to-video — turn a still into a 6-second clip. Praised for natural motion that respects the original composition.
#41
Genmo Replay
Genmo's image-to-video product with adjustable motion intensity and keyframe controls. Popular with stop-motion-style creators.
#42
Higgsfield AI
Cinematic-camera-motion-focused video generator. Pre-baked director-style camera moves (push-in, jib, drone) make it strong for film mood pieces.
#43
Viggle
Character-controllable video generator — feed a character image and a motion reference, output the character performing that motion. Massive on TikTok.
#44
Domo AI
Discord-native AI video tool with strong style transfer and animation generation. Large community for anime and stylized output.
#45
PixVerse
Consumer video generator with strong character animation and template-driven viral effects. Distributed via mobile and web.
#46
NotebookLM Video Overview
Google's NotebookLM produces narrated 'video overview' explainer assets directly from notebook sources. Default for daily explainer pipelines (used in /vad).
#47
Steve AI
Animated explainer and scribe-video generator with a deep template library. Strong for L&D and faceless YouTube.
#48
Pictory
Long-form-to-short-form AI editor that pulls highlights from podcasts and webinars into social clips. Used heavily by content marketers.
#49
Filmora AI
Wondershare's Filmora ships AI scene cutting, smart cutout, and text-to-video. The mass-market alternative to Premiere/DaVinci for prosumers.
#50
Adobe Firefly Video
Adobe's video model integrated into Premiere Pro for generative extend, object removal, and B-roll. Commercially safe-for-training data is the differentiator.
#51
Premiere Pro Generative Extend
Adobe's flagship in-app generative video feature — extend a clip by 2-4 seconds without reshoots. The feature most pro editors actually use daily.
#52
DaVinci Resolve AI
Blackmagic's AI feature set inside Resolve — magic mask, smart reframe, voice isolation, and now generative fill. Free tier makes it the most-used pro editor on earth.
#53
Magnific Relight Video
Magnific's video relighting model — change time-of-day, weather, and lighting in existing footage. Adopted into Freepik's video stack.
#54
Topaz Video AI Astra
Topaz's research-heavy upscaler that turns SD/HD into convincing 4K with frame interpolation. Standard for archival video restoration.
#55
RVM (Robust Video Matting)
Open-source real-time video matting model. Default choice for live and recorded green-screen-free background removal at the indie level.
#56
DreaMoving
Alibaba's pose-controlled human video generator. Drives a still character image with a motion reference video; predecessor to Wan family.
#57
AnimateAnyone
Alibaba research model that animates a reference character with a pose sequence. Influential in the avatar-from-image space.
#58
Stable Video 4D
Stability AI's 4D model — generates novel-view video from a single input video. Enables free-camera relighting and re-shooting in post.
#59
Phenaki
Google's variable-length text-to-video research model that pioneered long-form story generation. Influential precursor to Veo.
#60
Make-A-Video (Meta)
Meta's text-to-video research model — among the first to demonstrate quality long-form motion. Foundation for Meta's later video work.
#61
Emu Video (Meta)
Meta's two-stage text-to-video model. Used inside Reels for generative effects and tested as a creator tool inside Instagram.
#62
ModelScope Text2Video
Alibaba DAMO's early open text-to-video model on Hugging Face. Foundational for many community fine-tunes that followed.
#63
Tora (Alibaba)
Alibaba's trajectory-controlled video diffusion model — draw a path on a still image, get video that follows that motion path.
#64
MotionDirector
Open-source LoRA-style motion fine-tuning for video diffusion. Specialize a base model on a custom motion pattern with a handful of clips.
#65
ToonCrafter
Open-source cartoon in-between frame generator. Given two key cartoon frames it interpolates the in-between motion with style preservation.
#66
OpenSora
Open-source replication and extension of Sora-style models from HPC AI. Practical to fine-tune on consumer hardware compared to closed Sora.
#67
EasyAnimate
Alibaba PAI's open text-to-video / image-to-video model. Strong on Chinese-language prompts and large-resolution support.
#68
Allegro (Rhymes AI)
Open-source short-clip text-to-video model with a permissive license. Targeted at developers building custom video features in apps.
#69
Pyramid Flow
Open-source video model that uses pyramidal flow matching for efficient long-clip generation. Strong quality-per-VRAM ratio.
#70
Step-Video-T2V
StepFun's open 30B-parameter text-to-video model. Strong on Chinese-language scenes and complex multi-character composition.
#71
Lightricks Videoleap
Mobile-first AI video editor from Lightricks. AI dub, scene generation, and effects designed for vertical short-form on iOS/Android.
#72
VEED.IO AI
Browser-based AI video editor — auto-subtitle, AI avatar, magic edit. Default tool for many one-person SaaS marketing teams.
#73
Tavus
Personalized video generation for sales and CS — clone a sales rep's avatar and generate per-recipient personalized videos at scale.
#74
Colossyan
Avatar-led L&D video platform. Multi-avatar conversations, branching scenarios, and corporate template library.
#75
DeepBrain AI
Korean AI avatar leader, especially in broadcast and news. Hyper-realistic studio anchors and 100+ language coverage.
#76
Hour One
Avatar video at enterprise scale — used in onboarding, training, and product walkthroughs. Heavy template library for SaaS use cases.
#77
AdCreative.ai Video
Performance-marketing-focused AI video generator. Generates ad creatives from a brand kit with CTR-prediction scoring built in.
#78
Argil
Influencer-clone avatar generator targeted at creators who want to scale their face across multiple shorts per day.
#79
Lumen5
Article-to-video conversion platform — paste a blog post, get a captioned short. Workhorse for newsrooms and content marketers.
#80
Animaker AI
Animated explainer and character video platform with an enormous template library. Strong for non-designers producing animated content.
#81
Vyond AI
Enterprise animated explainer tool with a script-to-video AI assistant. Common in compliance training and corporate comms.
#82
Plotaverse / Plotagraph
Niche 'living photo' animator — adds subtle motion to stills (waterfalls, hair, clothes). Heavy use on Instagram.
#83
Wonder Studio (Autodesk)
AI-driven CG character replacement and motion capture from regular video footage. Acquired by Autodesk for VFX pipelines.
#84
Move.ai
Markerless motion capture from regular cameras. The default mocap-from-iPhone solution for indie game studios and animation houses.
#85
Rokoko Vision
Markerless motion capture using webcam or phone, with Rokoko Studio integration. Popular for solo animators.
#86
Cuebric
AI virtual production tool — generates 2.5D backgrounds suitable for LED-volume virtual sets. Used in indie film and ad production.
#87
Krea Realtime Video
Krea's realtime canvas-to-video feature — paint or change a scene live and see the video animate accordingly. Live performance use cases.
#88
AI Voice + Video stack: ElevenLabs + Veo
The most-used end-to-end production combo in 2026: ElevenLabs Voice 2 for narration paired with Veo 3.1 for visuals. Drives most faceless YouTube channels.
#89
AI Studios (DeepBrain)
DeepBrain's studio product — script, avatar, voice, scenes in one timeline. Mid-market alternative to Synthesia and HeyGen.
#90
Cinemia Animatic
AI animatic generation for film pre-vis — convert a script into shot-listed storyboards plus motion previews. Used by indie filmmakers.
#91
MagicAnimate
ByteDance research model for high-fidelity human image animation from a pose sequence. Strong dance-video reproduction.
#92
GenTron / GenTube
Researcher-favorite open-source baseline for text-to-video diffusion experiments. Code released alongside academic papers.
#93
RIFE / Practical-RIFE
Open-source frame interpolation model. The community standard for converting 24fps animation to 60fps, used widely in restoration.
#94
Topaz Astra Frame Interpolation
Premium frame interpolation tier inside Topaz Video AI. Smarter than RIFE on real-world footage but paid.
#95
Real-ESRGAN Video
Real-ESRGAN-based open-source video super-resolution. Community-favorite fast 2× upscaler for anime and stylized footage.
#96
Eleven Labs Soundscape for Video
ElevenLabs' generated sound effects and ambient audio model. Pair with a silent video clip to produce a full mixed scene.
#97
Suno Bark + Video Sync
Pair Suno-generated music with any of the major video models for full audiovisual generation. The default music side of the AI music-video pipeline.
#98
Reve Video
Reve Studio's video product — high-quality short clips with strong text-rendering inside frames. Targeted at design-led teams.
#99
Anthropic Skills with /vidai pipeline
Claude Code's /vidai skill orchestrates NotebookLM + ffmpeg + cinematic-mode video assembly into a one-command 'video a day' pipeline. The author-tool side of the stack.
#100
Remotion
React-based programmatic video framework. Increasingly the rendering layer for AI-generated content because LLMs are excellent at writing Remotion compositions.