Video Evaluation Dashboard
EvalForge v0.1.0 · 200 prompts · Evaluated 2026-03-15
Model Rankings
| # | Model | Provider | Grade | Overall | Subject Consistency | Background Consistency | Temporal Flickering | Motion Smoothness | Dynamic Degree | Aesthetic Quality | Imaging Quality | Overall Consistency | Text Alignment |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 1 | Veo 3.1 | google_ai | Excellent | 91.0 | 94.0 | 93.0 | 95.0 | 92.0 | 85.0 | 91.0 | 90.0 | 92.0 | 88.0 |
| 2 | Kling 2.6 Pro | fal_ai | Good | 87.0 | 91.0 | 89.0 | 92.0 | 88.0 | 82.0 | 87.0 | 86.0 | 88.0 | 83.0 |
| 3 | Seedance 1.5 | bytedance | Good | 84.0 | 89.0 | 87.0 | 90.0 | 86.0 | 80.0 | 84.0 | 83.0 | 85.0 | 81.0 |
| 4 | Wan 2.2 | alibaba | Good | 81.0 | 86.0 | 85.0 | 88.0 | 84.0 | 78.0 | 80.0 | 82.0 | 83.0 | 79.0 |
| 5 | LTX 2.3 | lightricks | Moderate | 78.0 | 83.0 | 82.0 | 86.0 | 81.0 | 84.0 | 72.0 | 74.0 | 78.0 | 76.0 |
Model Comparison Radar
Score Heatmap
| Model | Subject Consistency | Background Consistency | Temporal Flickering | Motion Smoothness | Dynamic Degree | Aesthetic Quality | Imaging Quality | Overall Consistency | Text Alignment |
|---|---|---|---|---|---|---|---|---|---|
| Veo 3.1 | 94.0 | 93.0 | 95.0 | 92.0 | 85.0 | 91.0 | 90.0 | 92.0 | 88.0 |
| Kling 2.6 Pro | 91.0 | 89.0 | 92.0 | 88.0 | 82.0 | 87.0 | 86.0 | 88.0 | 83.0 |
| Seedance 1.5 | 89.0 | 87.0 | 90.0 | 86.0 | 80.0 | 84.0 | 83.0 | 85.0 | 81.0 |
| Wan 2.2 | 86.0 | 85.0 | 88.0 | 84.0 | 78.0 | 80.0 | 82.0 | 83.0 | 79.0 |
| LTX 2.3 | 83.0 | 82.0 | 86.0 | 81.0 | 84.0 | 72.0 | 74.0 | 78.0 | 76.0 |
Category Breakdown
Veo 3.192.0
Subject Consistency
95
Text Alignment
90
Overall Consistency
93
Kling 2.6 Pro87.0
Subject Consistency
92
Text Alignment
85
Overall Consistency
89
Seedance 1.584.0
Subject Consistency
90
Text Alignment
82
Overall Consistency
86
Wan 2.281.0
Subject Consistency
87
Text Alignment
80
Overall Consistency
84
LTX 2.377.0
Subject Consistency
84
Text Alignment
77
Overall Consistency
79