paying attention to the features extracted from the image to person re-identification

Fa | Ar | En

paying attention to the features extracted from the image to person re-identification


نویسنده	zahiri s. h. ,iranpoor r. ,mehrshad n.
منبع	journal of electrical and computer engineering innovations - 2025 - دوره : 13 - شماره : 2 - صفحه:267 -274
چکیده	Background and objectives: person re-identification is an important application in computer vision, enabling the recognition of individuals across non-overlapping camera views. however, the large number of pedestrians with varying appearances, poses, and environmental conditions makes this task particularly challenging. to address these challenges, various learning approaches have been employed. achieving a balance between speed and accuracy is a key focus of this research. recently introduced transformer-based models have made significant strides in machine vision, though they have limitations in terms of time and input data. this research aims to balance these models by reducing the input information, focusing attention solely on features extracted from a convolutional neural network model. methods: this research integrates convolutional neural network (cnn) and transformer architectures. a cnn extracts important features of a person in an image, and these features are then processed by the attention mechanism in a transformer model. the primary objective of this work is to enhance computational speed and accuracy in transformer architectures. results: the results obtained demonstrate an improvement in the performance of the architectures under consistent conditions. in summary, for the market-1501 dataset, the map metric increased from approximately 30% in the downsized transformer model to around 74% after applying the desired modifications. similarly, the rank-1 metric improved from 48% to approximately 89%.conclusion: indeed, although it still has limitations compared to larger transformer models, the downsized transformer architecture has proven to be much more computationally efficient. applying similar modifications to larger models could also yield positive effects. balancing computational costs while improving detection accuracy remains a relative goal, dependent on specific domains and priorities. choosing the appropriate method may emphasize one aspect over another.
کلیدواژه	person re-identification ,deep learning ,image processing ,convolutional neural network ,computer vision ,image detection
آدرس	university of birjand, faculty of engineering, department of electrical engineering, iran, university of birjand, faculty of engineering, department of electrical engineering, iran, university of birjand, faculty of engineering, department of electrical engineering, iran
پست الکترونیکی	nmehrshad@birjand.ac.ir



Authors