>
Fa   |   Ar   |   En
   video prediction using multi-scale deep neural networks  
   
نویسنده shayanfar nima ,derhami vali ,rezaeian mehdi
منبع journal of ai and data mining - 2022 - دوره : 10 - شماره : 3 - صفحه:423 -431
چکیده    In video prediction, it is expected to predict the next frame of a video by providing a sequence of input frames. whereas numerous studies exist that tackle frame prediction, a suitable performance is not still achieved, and therefore, the application is an open problem. in this work, multi-scale processing is studied for video prediction, and a new network architecture for multi-scale processing is presented. this architecture is in the broad family of autoencoders. it is comprised of an encoder and decoder. a pretrained vgg is used as an encoder that processes a pyramid of input frames at multiple scales simultaneously. the decoder is based on the 3d convolutional neurons. the presented architecture is studied using three different datasets with varying degrees of difficulty. in addition, the proposed approach is compared with two conventional autoencoders. it is observed that using the pretrained network and multi-scale processing results in a performant approach.
کلیدواژه deep learning ,convolutional autoencoder ,video prediction ,multiscale processing
آدرس yazd university, computer engineering department, iran, yazd university, computer engineering department, iran, yazd university, computer engineering department, iran
پست الکترونیکی mrezaeian@yazd.ac.ir
 
     
   
Authors
  
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved