V3Trans-Crowd: A Video-based Visual Transformer for Crowd Management Monitoring

Yuqi Zuo, Aymen Hamrouni, Hakim Ghazzai, Yehia Massoud

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Autonomously monitoring and analyzing the behavior of the crowd is an open research topic in the transportation field. The real-time identification, tracking, and prediction of the crowd behavior is primordial to ensure smooth crowd management operations in many public areas such as public transport stations and streets. First, the complexity brought by the interaction and fusion from individual to group that needs to be assessed and analyzed. Second, the classification of these actions which might be useful in identifying danger and avoiding any undesired consequences. In this paper, we propose a transformer-based crowd management monitoring framework called V3Trans-Crowd that captures information from video data and extracts meaningful output to categorize the behavior of the crowd. We provide an improved hierarchical transformer for multi-modal tasks. Inspired by 3D visual transformer, our proposed 3D visual model, V3Trans-Crowd, has been shown to achieve great performances in terms of accuracy compared to state-of-the-art methods, all tested on the standard Crowd-11 dataset.

Original languageEnglish (US)
Title of host publication2023 IEEE International Conference on Smart Mobility, SM 2023
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages154-159
Number of pages6
ISBN (Electronic)9798350312751
DOIs
StatePublished - 2023
Event2023 IEEE International Conference on Smart Mobility, SM 2023 - Thuwal, Saudi Arabia
Duration: Mar 19 2023Mar 21 2023

Publication series

Name2023 IEEE International Conference on Smart Mobility, SM 2023

Conference

Conference2023 IEEE International Conference on Smart Mobility, SM 2023
Country/TerritorySaudi Arabia
CityThuwal
Period03/19/2303/21/23

Keywords

  • computer vision
  • Crowd behavior analysis
  • Crowd management
  • visual transformer

ASJC Scopus subject areas

  • Artificial Intelligence
  • Computer Networks and Communications
  • Computer Vision and Pattern Recognition
  • Control and Optimization
  • Transportation

Fingerprint

Dive into the research topics of 'V3Trans-Crowd: A Video-based Visual Transformer for Crowd Management Monitoring'. Together they form a unique fingerprint.

Cite this