Abstract
Detecting small targets in large fields of view is a challenging task. Nowadays, many targets detection models based on the convolutional neural network (CNN) achieve excellent performance. However, these CNN-based detectors are inefficient when applied to tasks of real-time detection of small targets. This paper proposes a small-target detection model in large fields of view based on the tectofugal–thalamofugal–accessory optic system of birds. Within this model, first, we design an unsupervised saliency algorithm to generate saliency regions to suppress background information according to the visual information processing mechanism of the tectofugal pathway of birds. Second, we design a super-resolution (SR) analysis method to enlarge small targets and improve image resolution by the information processing mechanism of the accessory optic system of birds. Then, according to the information processing mechanism of the thalamofugal pathway, we propose a CNN-based method to detect small targets. We further test our model on two public datasets (the VEDAI dataset and DLR 3 K dataset), and the experimental results demonstrate that the proposed detection model outperforms the state-of-the-art methods on small-target detection.
Similar content being viewed by others
References
Abadi M, Agarwal A, Barham P et al (2016) TensorFlow: large-scale machine learning on heterogeneous distributed systems. arXiv:1603.04467
Alexey AB (2018) How to improve object detection. IOP Publishing PhysicsWeb. https://github.com/AlexeyAB/darknet. Accessed on 21 July 2018
Bessette B, Hodos W (1989) Intensity, color, and pattern discrimination deficits after lesions of the core and belt regions of the ectostriatum. Vis Neurosci 2(1):27–34
Boehnke S, Munoz D (2008) On the importance of the transient visual response in the superior colliculus. Curr Opin Neurobiol 18(6):544–551
Bruhn A, Weickert J, Schnörr C (2005) Lucas/Kanade meets horn/Schunck: combining local and global optic flow methods. Int J Comput Vis 61(3):211–231
Butler AB, Hodos W (2005) Comparative vertebrate neuroanatomy: evolution and adaptation. Wiley
Cao Y, Wang G, Yan D, Zhao Z (2015) Two algorithms for the detection and tracking of moving vehicle targets in aerial infrared image sequences. Remote Sens 8(1):28
Chen C, Liu MY, Tuzel O et al (2016) R-CNN for small object detection. Asian conference on computer vision. Springer, Cham, pp 214–230
Ewert J, Buxbaum-Conradi H, Dreisvogt F, Glagow M, Merkel-Harff C, Röttgen A, Schürg-Pfeiffer E, Schwippert WW (2001) Neural modulation of visuomotor functions underlying prey-catching behaviour in anurans: perception, attention, motor performance, learning. Comp Biochem Physiol A Mol Integr Physiol 128(3):417–460
Fecteau JH, Munoz DP (2006) Salience, relevance, and firing: a priority map for target selection. Trends Cogn Sci 10(8):382–390
Fendrich R (1993) The merging of the senses. J Cogn Neurosci 5(3):373–374
Fu C, Liu W, Ranga A, Tyagi A Berg A (2017) Dssd: Deconvolutional single shot detector. arXiv:1701.06659
Gang L, Qianqian Z, Tao H et al (2012) Detecting for the aerial small target in infrared image based on the correlation coefficients of nonsubsampled contourlet transform. IEEE international conference on automation and logistics. IEEE, pp 363–367
Girshick R (2015) Fast r-cnn. Proceedings of the IEEE international conference on computer vision (ICCV), pp 1440–1448
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), pp 580–587
He K, Zhang X, Ren S et al (2015) Delving deep into rectifiers: surpassing human-level performance on imagenet classification. Proceedings of the IEEE international conference on computer vision, pp 1026–1034
He K, Gkioxari G, Dollár P et al (2017) Mask r-cnn. Proceedings of the IEEE international conference on computer vision, pp 2961–2969
Hosang J, Benenson R, Dollár P, Schiele B (2015) What makes for effective detection proposals? IEEE Trans Pattern Anal Mach Intell 38(4):814–830
Hui Z, Wang X, Gao X (2018) Fast and accurate single image super-resolution via information distillation network. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 723–731
Jia Y, Shelhamer E, Donahue J et al (2014) Caffe: convolutional architecture for fast feature embedding. Proceedings of the 22nd ACM international conference on multimedia. ACM, pp 675–678
Ju M, Luo J, Zhang P et al (2019) A simple and efficient network for small target detection. IEEE Access, pp 85771–85781
Khare M, Srivastava RK, Khare A (2014) Single change detection-based moving object segmentation by using Daubechies complex wavelet transform. IET Image Process 8(6):334–344
Kong T, Yao A, Chen Y et al (2016) Hypernet: towards accurate region proposal generation and joint object detection. Proceedings of IEEE conference on computer vision and pattern recognition, pp 845–853
Kong L, Zhu X, Wang G (2018) Context semantics for small target detection in large-field images with two cascaded faster R-CNNs. J. Phys Conf Ser, IOP publishing 1069(1):012138. https://doi.org/10.1088/1742-6596/1069/1/012138
Ku J, Mozifian M, Lee J et al (2018) Joint 3d proposal generation and object detection from view aggregation. 2018 IEEE/RSJ international conference on intelligent robots and systems (IROS). IEEE, pp 1–8
Lai W S, Huang J B, Ahuja N et al (2017) Deep laplacian pyramid networks for fast and accurate super-resolution. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 624–632
Li D, Li M, Zheng J et al (2017) Joint rotation invariant feature for vehicle detection in aerial images. Ninth international conference on digital image processing (ICDIP), International Society for Optics and Photonics, 10420: 104200W
Li J, Liang X, Wei Y et al (2017) Perceptual generative adversarial networks for small object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1222–1230
Lin TY, Dollár P, Girshick R et al (2017) Feature pyramid networks for object detection. IEEE Conference on Computer Vision and Pattern Recognition, pp 2117–2125
Liu K, Mattyus G (2015) Fast multiclass vehicle detection on aerial images. Geoscience and Remote Sensing Letters 12(9):1938–1942
Liu W, Anguelov D, Erhan D et al (2016) Ssd: single shot multibox detector. European conference on computer vision. Springer, Cham, pp 21–37
Lou J, Zhu W, Wang H, Ren M (2016) Small target detection combining regional stability and saliency in a color image. Multimed Tools Appl 76(13):14781–14798
Mandal M, Shah M, Meena P et al (2019) SSSDET: simple short and shallow network for resource efficient vehicle detection in aerial scenes. IEEE international conference on image processing (ICIP), pp 3098–3102
Mandal M, Shah M, Meena P et al (2019) AVDNet: A Small-Sized Vehicle Detection Network for Aerial Visual Data. IEEE Geoscience and Remote Sensing Letters, pp 1–5
Medina L, Reiner A (2000) Do birds possess homologues of mammalian primary visual, somatosensory and motor cortices? Trends Neurosci 23(1):1–12
Mysore P, Asadollahi A, Knudsen E (2010) Global inhibition and stimulus competition in the owl optic Tectum. J Neurosci 30(5):1727–1738
Northmore D (2011) Optic tectum. Encyclopedia of fish physiology: from genome to environment. Elsevier, pp 131–142
Razakarivony S, Jurie F (2016) Vehicle detection in aerial imagery: a small target detection benchmark. J Vis Commun Image Represent 34:187–203
Redmon J, Farhadi A (2017) YOLO9000: better, faster, stronger. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 7263–7271
Redmon J, Farhadi A (2018) Yolov3: An incremental improvement. arXiv:1804.02767
Redmon J, Divvala S, Girshick R et al (2016) You only look once: unified, real-time object detection. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Reiner A, Yamamoto K, Karten H (2005) Organization and evolution of the avian forebrain. Anat Rec 287(1):1080–1102
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39(6):1137–1149
Ren Y, Zhu C, Xiao S (2018) Small object detection in optical remote sensing images via modified faster R-CNN. Appl Sci 8(5):813
Sermanet P, Eigen D, Zhang X et al (2013) OverFeat: integrated recognition, localization and detection using convolutional networks. arXiv:1312.6229
Shi W, Caballero J, Huszár F et al (2016) Real-time single image and video super-resolution using an efficient sub-pixel convolutional neural network. Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1874–1883
Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv:1409.1556
Sobral A, Vacavant A (2014) A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos. Comput Vis Image Underst 122:4–21
Suganyadevi K, Malmurugan N (2014) OFGM-SMED: an efficient and robust foreground object detection in compressed video sequences. Eng Appl Artif Intell 28:210–217
Uijlings J, van de Sande KE, Gevers T, Smeulders A (2013) Selective search for object recognition. Int J Comput Vis 104(2):154–171
Vedaldi A, Lenc K (2015) Matconvnet: convolutional neural networks for matlab. Proceedings of the 23rd ACM international conference on multimedia. ACM, pp 689–692
Wang P, Tian JW, Gao CQ (2009) Infrared small target detection using directional highpass filters based on LS-SVM. Electron Lett 45(3):156–158
Yang Y, Cao P, Yang Y, Wang S (2008) Corollary discharge circuits for saccadic modulation of the pigeon visual system. Nat Neurosci 11(5):595–602
Yang MY, Liao W, Li X et al (2019) Vehicle detection in aerial images. Photogramm Eng Remote Sens 85(4):297–304
Zhong J, Lei T, Yao G (2017) Robust vehicle detection in aerial images based on cascaded convolutional neural networks. Sensors 17(12):2720
Acknowledgments
This work was supported by the National Natural Science Foundation of China (NSFC) General program (61673353), Young Scientist Fund of NSFC (61603344) and Key research projects of Henan colleges and Universities(15A120017).
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Wang, Z., Liu, D., Lei, Y. et al. Small target detection based on bird’s visual information processing mechanism. Multimed Tools Appl 79, 22083–22105 (2020). https://doi.org/10.1007/s11042-020-08807-8
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-020-08807-8