Learning discriminative local contexts for person re-identification in vehicle surveillance scenarios

Lin, Xiangyu; Wang, Jing; Huang, Rufei; Wang, Cheng; Zhang, Huizhen

doi:10.1007/s10044-024-01219-6

Learning discriminative local contexts for person re-identification in vehicle surveillance scenarios

Industrial and Commercial Application
Published: 28 February 2024

Volume 27, article number 7, (2024)
Cite this article

Pattern Analysis and Applications Aims and scope Submit manuscript

Xiangyu Lin^1,2,
Jing Wang^1,2,
Rufei Huang^1,2,
Cheng Wang^1,2 &
…
Huizhen Zhang^1,2

56 Accesses
1 Altmetric
Explore all metrics

Abstract

In recent years, person re-identification (Re-ID) has been widely used in intelligent surveillance and security. However, Re-ID faces many challenges in the vehicle surveillance scenario, such as heavy occlusion, misalignment, and similar appearances. Most Re-ID methods focus on learning discriminative global features or dividing regions for local feature learning, which may ignore critical but subtle differences between pedestrians. In this paper, we propose a local context aggregation branch for learning discriminative local contexts at multiple scales, which can supplement the critical detailed information omitted in global features. Specifically, we exploit dilated convolutions to simulate spatial feature pyramid to capture multi-scale spatial contexts efficiently. The essential information that can distinguish different pedestrians is then emphasized. Besides, we construct a Re-ID dataset named BSV for vehicle surveillance scenarios and propose a triplet loss with station constraint enhancement, which utilizes additional valuable station information to construct penalty terms to improve the performance of Re-ID further. Extensive experiments are conducted on the proposed BSV dataset and two standard Re-ID datasets, and the results validate the effectiveness of our method.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Multi-scale Context Aggregation for Video-Based Person Re-Identification

Progressive spatial–temporal transfer model for unsupervised person re-identification

Article 03 April 2024

Learning convolutional multi-level transformers for image-based person re-identification

Article Open access 13 October 2023

Data availability

The Market-1501 and DukeMTMC-reID datasets are published datasets. The proposed BSV dataset is not publicly available due to copyright issues.

References

Ye M, Shen J, Lin G, Xiang T, Shao L, Hoi SC (2021) Deep learning for person re-identification: a survey and outlook. IEEE Trans Pattern Anal Mach Intell 44(6):2872–2893
Article Google Scholar
Luo H, Jiang W, Gu Y, Liu F, Liao X, Lai S, Gu J (2020) A strong baseline and batch normalization neck for deep person re-identification. IEEE Trans Multimedia 22(10):2597–2609
Article Google Scholar
Wang G, Yuan Y, Chen X, Li J, Zhou X (2018) Learning discriminative features with multiple granularities for person re-identification. In: Proceedings of the 26th ACM international conference on multimedia, pp 274–282
Chen G, Gu T, Lu J, Bao J-A, Zhou J (2021) Person re-identification via attention pyramid. IEEE Trans Image Process 30:7663–7676
Article ADS PubMed Google Scholar
Hermans A, Beyer L, Leibe B (2017) In defense of the triplet loss for person re-identification. arXiv preprint arXiv:1703.07737
Zhou K, Yang Y, Cavallaro A, Xiang T (2019) Omni-scale feature learning for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 3702–3712
He S, Luo H, Wang P, Wang F, Li H, Jiang W (2021) TransReID: transformer-based object re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 15013–15022
Sun Y, Zheng L, Yang Y, Tian Q, Wang S (2018) Beyond part models: person retrieval with refined part pooling (and a strong convolutional baseline). In: Proceedings of the European conference on computer vision (ECCV), pp 480–496
Wang G, Yuan Y, Li J, Ge S, Zhou X (2020) Receptive multi-granularity representation for person re-identification. IEEE Trans Image Process 29:6096–6109
Article ADS Google Scholar
Cheng D, Gong Y, Zhou S, Wang J, Zheng N (2016) Person re-identification by multi-channel parts-based CNN with improved triplet loss function. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 1335–1344
Zheng F, Deng C, Sun X, Jiang X, Guo X, Yu Z, Huang F, Ji R (2019) Pyramidal person re-identification via multi-loss dynamic training. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 8514–8522
Sun Y, Xu Q, Li Y, Zhang C, Li Y, Wang S, Sun J (2019) Perceive where to focus: learning visibility-aware part-level features for partial person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 393–402
Chen T, Ding S, Xie J, Yuan Y, Chen W, Yang Y, Ren, Z, Wang Z (2019) ABD-Net: Attentive but diverse person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 8351–8361
Zhang Z, Lan C, Zeng W, Jin X, Chen Z (2020) Relation-aware global attention for person re-identification. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 3186–3195
Rao Y, Chen G, Lu J, Zhou J (2021) Counterfactual attention learning for fine-grained visual categorization and re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 1025–1034
Chen G, Lin C, Ren L, Lu J, Zhou J (2019) Self-critical attention learning for person re-identification. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 9637–9646
Chen G, Lu J, Yang M, Zhou J (2019) Spatial-temporal attention-aware learning for video-based person re-identification. IEEE Trans Image Process 28(9):4192–4205
Article ADS MathSciNet PubMed Google Scholar
Xu J, Zhao R, Zhu F, Wang H, Ouyang W (2018) Attention-aware compositional network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2119–2128
Li W, Zhu X, Gong S (2018) Harmonious attention network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2285–2294
Fang P, Zhou J, Roy SK, Petersson L, Harandi M (2019) Bilinear attention networks for person retrieval. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 8030–8039
Zheng L, Shen L, Tian L, Wang S, Wang J, Tian Q (2015) Scalable person re-identification: a benchmark. In: Proceedings of the IEEE international conference on computer vision, pp 1116–1124
Ristani E, Solera F, Zou R, Cucchiara R, Tomasi C (2016) Performance measures and a data set for multi-target, multi-camera tracking. In: European conference on computer vision. Springer, pp 17–35
Shimada Y, Takagi M, Taniguchi Y (2019) Person re-identification for estimating bus passenger flow. In: 2019 IEEE conference on multimedia information processing and retrieval (MIPR). IEEE, pp 169–174
Guo J, Xue Y, Cai J, Gao Z, Xu G, Zhang H (2021) A bus passenger re-identification dataset and a deep learning baseline using triplet embedding. Multimedia Tools Appl 80(11):16425–16440
Article Google Scholar
Florian L-C, Adam SH (2017) Rethinking atrous convolution for semantic image segmentation. In: Conference on computer vision and pattern recognition (CVPR), vol 6. IEEE/CVF
Zamir SW, Arora A, Khan S, Hayat M, Khan FS, Yang M-H (2022) Restormer: efficient transformer for high-resolution image restoration. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 5728–5739
Wieczorek M, Siłka J, Woźniak M, Garg S, Hassan MM (2021) Lightweight convolutional neural network model for human face detection in risk situations. IEEE Trans Ind Inf 18(7):4820–4829
Article Google Scholar
Woźniak M, Siłka J, Wieczorek M (2021) Deep learning based crowd counting model for drone assisted systems. In: Proceedings of the 4th ACM MobiCom workshop on drone assisted wireless communications for 5G and beyond, pp 31–36
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Wang H, Shen J, Liu Y, Gao Y, Gavves E (2022) Nformer: robust person re-identification with neighbor transformer. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7297–7307
Somers V, De Vleeschouwer C, Alahi A (2023) Body part-based representation learning for occluded person re-identification. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, pp 1613–1623
Luo H, Jiang W, Zhang X, Fan X, Qian J, Zhang C (2019) Alignedreid++: dynamically matching local information for person re-identification. Pattern Recognit 94:53–61
Article ADS Google Scholar
Tan H, Liu X, Bian Y, Wang H, Yin B (2021) Incomplete descriptor mining with elastic loss for person re-identification. IEEE Trans Circuits Syst Video Technol 32(1):160–171
Article Google Scholar
Lin Y, Zheng L, Zheng Z, Wu Y, Hu Z, Yan C, Yang Y (2019) Improving person re-identification by attribute and identity learning. Pattern Recognit 95:151–161
Article ADS Google Scholar
Zhu Z, Jiang X, Zheng F, Guo X, Huang F, Sun X, Zheng W (2020) Aware loss with angular regularization for person re-identification. In: Proceedings of the AAAI conference on artificial intelligence, vol 34, pp 13114–13121
Chen W, Chen X, Zhang J, Huang K (2017) Beyond triplet loss: a deep quadruplet network for person re-identification. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 403–412
Ren S, He K, Girshick R, Sun J (2015) Faster R-CNN: toward real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28
Felzenszwalb PF, Girshick RB, McAllester D, Ramanan D (2010) Object detection with discriminatively trained part-based models. IEEE Trans Pattern Anal Mach Intell 32(9):1627–1645
Article PubMed Google Scholar
Deng J, Dong W, Socher R, Li L-J, Li K, Fei-Fei L (2009) ImageNet: a large-scale hierarchical image database. In: 2009 IEEE conference on computer vision and pattern recognition. IEEE, pp 248–255
Radenović F, Tolias G, Chum O (2018) Fine-tuning cnn image retrieval with no human annotation. IEEE Trans Pattern Anal Mach Intell 41(7):1655–1668
Article PubMed Google Scholar

Download references

Acknowledgements

This work was supported by the Project of Science and Technology Plan of Fujian Province (Grant No. 2020H0016).

Author information

Authors and Affiliations

College of Computer Science and Technology, Huaqiao University, Xiamen, 361021, China
Xiangyu Lin, Jing Wang, Rufei Huang, Cheng Wang & Huizhen Zhang
Xiamen Key Laboratory of Computer Vision and Pattern Recognition, Huaqiao University, Xiamen, 361021, China
Xiangyu Lin, Jing Wang, Rufei Huang, Cheng Wang & Huizhen Zhang

Authors

Xiangyu Lin
View author publications
You can also search for this author in PubMed Google Scholar
Jing Wang
View author publications
You can also search for this author in PubMed Google Scholar
Rufei Huang
View author publications
You can also search for this author in PubMed Google Scholar
Cheng Wang
View author publications
You can also search for this author in PubMed Google Scholar
Huizhen Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Jing Wang.

Ethics declarations

Conflict of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Lin, X., Wang, J., Huang, R. et al. Learning discriminative local contexts for person re-identification in vehicle surveillance scenarios. Pattern Anal Applic 27, 7 (2024). https://doi.org/10.1007/s10044-024-01219-6

Download citation

Received: 06 January 2023
Accepted: 04 December 2023
Published: 28 February 2024
DOI: https://doi.org/10.1007/s10044-024-01219-6

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Learning discriminative local contexts for person re-identification in vehicle surveillance scenarios

Abstract

Access this article

Similar content being viewed by others

Multi-scale Context Aggregation for Video-Based Person Re-Identification

Progressive spatial–temporal transfer model for unsupervised person re-identification

Learning convolutional multi-level transformers for image-based person re-identification

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Learning discriminative local contexts for person re-identification in vehicle surveillance scenarios

Abstract

Access this article

Similar content being viewed by others

Multi-scale Context Aggregation for Video-Based Person Re-Identification

Progressive spatial–temporal transfer model for unsupervised person re-identification

Learning convolutional multi-level transformers for image-based person re-identification

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation