top of page
ComfyUI-SurveillanceVL: AI-Video Surveillance
​
What: Production-ready surveillance video analysis system for ComfyUI using Qwen3-VL vision-language models.
Why: Automate hours of manual video review, generate structured reports with timestamps and entity detection.
How: Zero-dependency architecture with 5 nodes processing video end-to-end.
Key Stats:
-
Installation: 5 minutes, 3 files
-
Processing: 2-3 hours of video per hour of compute
-
Accuracy: State-of-the-art Qwen3-VL models
-
Output: JSON, TXT, CSV formatted reports
bottom of page
