top of page

ComfyUI-SurveillanceVL: AI-Video Surveillance

​

What: Production-ready surveillance video analysis system for ComfyUI using Qwen3-VL vision-language models.

Why: Automate hours of manual video review, generate structured reports with timestamps and entity detection.

How: Zero-dependency architecture with 5 nodes processing video end-to-end.

Key Stats:

  • Installation: 5 minutes, 3 files

  • Processing: 2-3 hours of video per hour of compute

  • Accuracy: State-of-the-art Qwen3-VL models

  • Output: JSON, TXT, CSV formatted reports

bottom of page