AccidentBlip2: Accident Detection With Multi-View MotionBlip2

Shao, Yihua; Cai, Hongyi; Long, Xinwei; Lang, Weiyi; Wang, Zhe; Wu, Haoran; Wang, Yan; Yin, Jiayi; Yang, Yang; Lv, Yisheng; Lei, Zhen

Computer Science > Artificial Intelligence

arXiv:2404.12149 (cs)

[Submitted on 18 Apr 2024 (v1), last revised 7 May 2024 (this version, v4)]

Title:AccidentBlip2: Accident Detection With Multi-View MotionBlip2

Authors:Yihua Shao, Hongyi Cai, Xinwei Long, Weiyi Lang, Zhe Wang, Haoran Wu, Yan Wang, Jiayi Yin, Yang Yang, Yisheng Lv, Zhen Lei

View PDF HTML (experimental)

Abstract:Intelligent vehicles have demonstrated excellent capabilities in many transportation scenarios. The inference capabilities of neural networks using cameras limit the accuracy of accident detection in complex transportation systems. This paper presents AccidentBlip2, a pure vision-based multi-modal large model Blip2 for accident detection. Our method first processes the multi-view images through ViT-14g and sends the multi-view features into the cross-attention layer of Q-Former. Different from Blip2's Q-Former, our Motion Q-Former extends the self-attention layer with the temporal-attention layer. In the inference process, the queries generated from previous frames are input into Motion Q-Former to aggregate temporal information. Queries are updated with an auto-regressive strategy and are sent to a MLP to detect whether there is an accident in the surrounding environment. Our AccidentBlip2 can be extended to a multi-vehicle cooperative system by deploying Motion Q-Former on each vehicle and simultaneously fusing the generated queries into the MLP for auto-regressive inference. Our approach outperforms existing video large language models in detection accuracy in both single-vehicle and multi-vehicle systems.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2404.12149 [cs.AI]
	(or arXiv:2404.12149v4 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2404.12149

Submission history

From: Yihua Shao [view email]
[v1] Thu, 18 Apr 2024 12:54:25 UTC (3,657 KB)
[v2] Fri, 19 Apr 2024 04:13:51 UTC (3,657 KB)
[v3] Mon, 22 Apr 2024 17:07:07 UTC (3,719 KB)
[v4] Tue, 7 May 2024 11:21:57 UTC (3,719 KB)

Computer Science > Artificial Intelligence

Title:AccidentBlip2: Accident Detection With Multi-View MotionBlip2

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:AccidentBlip2: Accident Detection With Multi-View MotionBlip2

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators