Academia.eduAcademia.edu

Video Signal Processing

description211 papers
group0 followers
lightbulbAbout this topic
Video Signal Processing is the field of study focused on the manipulation, enhancement, and analysis of video signals. It encompasses techniques for encoding, decoding, compression, and transmission of video data, aiming to improve visual quality and efficiency in storage and transmission across various media.
lightbulbAbout this topic
Video Signal Processing is the field of study focused on the manipulation, enhancement, and analysis of video signals. It encompasses techniques for encoding, decoding, compression, and transmission of video data, aiming to improve visual quality and efficiency in storage and transmission across various media.
Video compression standards are implemented in wireless data transmission technologies to provide multimedia services efficiently. These compression standards generally utilize the Discrete Cosine Transform (DCT) in conjunction with... more
The video compression standards commonly adopted in wireless multimedia services utilize variable length codes (VLC) in order to attain high compression ratios. While providing the high data rates required, this technique makes the system... more
This paper describes our recently developed system which captures pen strokes on whiteboards in real time using an off-the-shelf video camera. Unlike many existing tools, our system does not instrument the pens or the whiteboard. It... more
This paper describes our recently developed system which captures pen strokes on whiteboards in real time using an off-the-shelf video camera. Unlike many existing tools, our system does not instrument the pens or the whiteboard. It... more
This paper proposes a multiple-source multiple-sample fusion approach to identity verification. Fusion is performed at two levels: intramodal and intermodal. In intramodal fusion, the scores of multiple samples (e.g. utterances or video... more
Image Inpainting is the technique of filling out a photo with missing details. The purpose of inpainting is to visualize the realistic reconstruction of lost areas in a way that looks to the human eye natural.We present a novel algorithm... more
Fish-cage dysfunction in aquaculture installations can trigger significant negative consequences affecting the operational costs. Low oxygen levels, due to excessive fooling's, leads to decrease growth performance, and feed efficiency.... more
Rate control is a complicated problem in the H.264/AVC coding standard, extra computation is usually needed for the existing rate control schemes to estimate the complexity of frames or macroblocks (MBs). However, during transcoding,... more
Rate control is a complicated problem in the H.264/AVC coding standard, extra computation is usually needed for the existing rate control schemes to estimate the complexity of frames or macroblocks (MBs). However, during transcoding,... more
In this paper, a CMOS realization of the current differencing transconductance amplifier (CDTA) is given, which is a newly reported active building block for current-mode signal processing. Current differencing stage of the CDTA element... more
In this paper we present a novel method for gesture video decomposition based on the depicted content. From the initial content the key-frames are extracted and the neighboring frames are assigned to key-frames of similar content. The... more
What contextual and demographic factors predict drivers' decision to engage in secondary tasks? IET Intelligent Transport Systems, 13(8), pp. 1218-1223.
Deep learning (DL) model performance is intricately tied to the quality of training, influenced by several parameters. Of these, the computing unit employed significantly impacts training efficiency. Traditional setups use central... more
A saliency-based method for generating video summaries is presented, which exploits coupled audiovisual information from both media streams. Efficient and advanced speech and image processing algorithms to detect key frames that are... more
In this paper, we present a comparative study of several state of the art background subtraction methods. Approaches ranging from simple background subtraction with global thresholding to more sophisticated statistical methods have been... more
This paper proposes a multiple-source multiple-sample fusion approach to identity verification. Fusion is performed at two levels: intramodal and intermodal. In intramodal fusion, the scores of multiple samples (e.g. utterances or video... more
This paper proposes a multiple-source multiple-sample fusion approach to identity verification. Fusion is performed at two levels: intramodal and intermodal. In intramodal fusion, the scores of multiple samples (e.g. utterances or video... more
In mixed-resolution (MR) stereoscopic video, one view is presented with a lower resolution compared with the other one; therefore, a lower bitrate, a reduced computational complexity, and a decrease in memory access bandwidth can be... more
This paper presents a configurable Convolutional Neural Network Accelerator (CNNA) for a System on Chip design (SoC). The goal was to accelerate inference of different deep learning networks on an embedded SoC platform. The presented CNNA... more
Providing real-time estimates of building occupancy to first responders during emergency events can help in search and rescue, and egress management. This paper addresses the estimation of occupancy in each zone of a building, where the... more
Providing real-time estimates of building occupancy to first responders during emergency events can help in search and rescue, and egress management. This paper addresses the estimation of occupancy in each zone of a building, where the... more
We present a computationally efficient algorithm for the eigenspace decomposition of correlated images. Our approach is motivated by the fact that for a planar rotation of a twodimensional image, analytical expressions can be given for... more