Nadakudity, Sai Sita Anusha (2015) Object Tracking from Audio and Video data using Linear Prediction method. MTech thesis.
PDF 2326Kb |
Abstract
Microphone arrays and video surveillance by camera are widely used for detection and tracking of a moving speaker. In this project, object tracking was planned using multimodal fusion i.e., Audio-Visual perception. Source localisation can be done by GCC-PHAT, GCC-ML for time delay estimation delay estimation. These methods are based on spectral content of the speech signals that can be effected by noise and reverberation. Video tracking can be done using Kalman filter or Particle filter. Therefore Linear Prediction method is used for audio and video tracking. Linear prediction in source localisation use features related to excitation source information of speech which are less effected by noise. Hence by using this excitation source information, time delays are estimated and the results are compared with GCC PHAT method. The dataset obtained from [20] is used in video tracking a single moving object captured through stationary camera. Then for object detection, projection histogram is done followed by linear prediction for tracking and the corresponding results are compared with Kalman filter method.
Item Type: | Thesis (MTech) |
---|---|
Uncontrolled Keywords: | Linear prediction,source localisation,video tracking |
Subjects: | Engineering and Technology > Electronics and Communication Engineering > Image Processing Engineering and Technology > Electronics and Communication Engineering > Signal Processing |
Divisions: | Engineering and Technology > Department of Electronics and Communication Engineering |
ID Code: | 6775 |
Deposited By: | Mr. Sanat Kumar Behera |
Deposited On: | 30 Dec 2015 15:21 |
Last Modified: | 30 Dec 2015 15:21 |
Supervisor(s): | Roy, L P |
Repository Staff Only: item control page