Attention Based CNN-ConvLSTM for Pedestrian Attribute Recognition. 2020

Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
School of Computer Engineering and Science, Shanghai University, Shanghai 200444, China.

As a result of its important role in video surveillance, pedestrian attribute recognition has become an attractive facet of computer vision research. Because of the changes in viewpoints, illumination, resolution and occlusion, the task is very challenging. In order to resolve the issue of unsatisfactory performance of existing pedestrian attribute recognition methods resulting from ignoring the correlation between pedestrian attributes and spatial information, in this paper, the task is regarded as a spatiotemporal, sequential, multi-label image classification problem. An attention-based neural network consisting of convolutional neural networks (CNN), channel attention (CAtt) and convolutional long short-term memory (ConvLSTM) is proposed (CNN-CAtt-ConvLSTM). Firstly, the salient and correlated visual features of pedestrian attributes are extracted by pre-trained CNN and CAtt. Then, ConvLSTM is used to further extract spatial information and correlations from pedestrian attributes. Finally, pedestrian attributes are predicted with optimized sequences based on attribute image area size and importance. Extensive experiments are carried out on two common pedestrian attribute datasets, PEdesTrian Attribute (PETA) dataset and Richly Annotated Pedestrian (RAP) dataset, and higher performance than other state-of-the-art (SOTA) methods is achieved, which proves the superiority and validity of our method.

UI MeSH Term Description Entries
D007091 Image Processing, Computer-Assisted A technique of inputting two-dimensional or three-dimensional images into a computer and then enhancing or analyzing the imagery into a form that is more useful to the human observer. Biomedical Image Processing,Computer-Assisted Image Processing,Digital Image Processing,Image Analysis, Computer-Assisted,Image Reconstruction,Medical Image Processing,Analysis, Computer-Assisted Image,Computer-Assisted Image Analysis,Computer Assisted Image Analysis,Computer Assisted Image Processing,Computer-Assisted Image Analyses,Image Analyses, Computer-Assisted,Image Analysis, Computer Assisted,Image Processing, Biomedical,Image Processing, Computer Assisted,Image Processing, Digital,Image Processing, Medical,Image Processings, Medical,Image Reconstructions,Medical Image Processings,Processing, Biomedical Image,Processing, Digital Image,Processing, Medical Image,Processings, Digital Image,Processings, Medical Image,Reconstruction, Image,Reconstructions, Image
D010363 Pattern Recognition, Automated In INFORMATION RETRIEVAL, machine-sensing or identification of visible patterns (shapes, forms, and configurations). (Harrod's Librarians' Glossary, 7th ed) Automated Pattern Recognition,Pattern Recognition System,Pattern Recognition Systems
D006801 Humans Members of the species Homo sapiens. Homo sapiens,Man (Taxonomy),Human,Man, Modern,Modern Man
D000069636 Pedestrians Persons traveling on foot.
D000465 Algorithms A procedure consisting of a sequence of algebraic formulas and/or logical steps to calculate or determine a given task. Algorithm
D001288 Attention Focusing on certain aspects of current experience to the exclusion of others. It is the act of heeding or taking notice or concentrating. Focus of Attention,Selective Attention,Social Attention,Attention Focus,Attention, Selective,Attention, Social,Selective Attentions
D014741 Video Recording The storing or preserving of video signals to be played back later via a transmitter or receiver. Audiovisual Recording,Videorecording,Audiovisual Recordings,Recording, Audiovisual,Recording, Video,Recordings, Audiovisual,Recordings, Video,Video Recordings,Videorecordings
D016571 Neural Networks, Computer A computer architecture, implementable in either hardware or software, modeled after biological neural networks. Like the biological system in which the processing capability is a result of the interconnection strengths between arrays of nonlinear processing nodes, computerized neural networks, often called perceptrons or multilayer connectionist models, consist of neuron-like units. A homogeneous group of units makes up a layer. These networks are good at pattern recognition. They are adaptive, performing tasks by example, and thus are better for decision-making than are linear learning machines or cluster analysis. They do not require explicit programming. Computational Neural Networks,Connectionist Models,Models, Neural Network,Neural Network Models,Neural Networks (Computer),Perceptrons,Computational Neural Network,Computer Neural Network,Computer Neural Networks,Connectionist Model,Model, Connectionist,Model, Neural Network,Models, Connectionist,Network Model, Neural,Network Models, Neural,Network, Computational Neural,Network, Computer Neural,Network, Neural (Computer),Networks, Computational Neural,Networks, Computer Neural,Networks, Neural (Computer),Neural Network (Computer),Neural Network Model,Neural Network, Computational,Neural Network, Computer,Neural Networks, Computational,Perceptron
D056667 Biometric Identification A method of differentiating individuals based on the analysis of qualitative or quantitative biological traits or patterns. Biometric identification, which has applications in forensics and identity theft prevention, includes DNA profiles or DNA FINGERPRINTS; FINGERPRINTS; AUTOMATED FACIAL RECOGNITION; IRIS scan; RETINA scan; hand geometry; vascular patterns; automated VOICE pattern recognition; ultrasound of fingers; and X-RAYS. Automated Identity Recognition,Biometric Authentication,Authentication, Biometric,Identification, Biometric,Identity Recognition, Automated
D057567 Memory, Long-Term Remembrance of information from 3 or more years previously. Memory, Longterm,Memory, Remote,Remote Memory,Long-Term Memories,Long-Term Memory,Longterm Memories,Longterm Memory,Memories, Long-Term,Memories, Longterm,Memories, Remote,Memory, Long Term,Remote Memories

Related Publications

Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
December 2021, Journal of imaging,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
December 2019, IEEE transactions on image processing : a publication of the IEEE Signal Processing Society,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
January 2023, International journal of information technology : an official journal of Bharati Vidyapeeth's Institute of Computer Applications and Management,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
June 2021, Heliyon,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
December 2022, Heliyon,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
September 2022, Sensors (Basel, Switzerland),
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
January 2022, Computational intelligence and neuroscience,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
March 2020, Heliyon,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
January 2018, PloS one,
Yang Li, and Huahu Xu, and Minjie Bian, and Junsheng Xiao
January 2021, PloS one,
Copied contents to your clipboard!