{Analysis and implementation of the position-pitch source localization algorithm on a hybrid reconfigurable cpu}
Abstract: Source localization is an important component in speech processing systems, as automated source localization in combination with beamforming makes it possible to enhance the speech of individual speakers in reverberating environments, hence providing signals with improved signal-to-noise ratio for subsequent speech processing algorithms. The source localization algorithm used in this thesis consists of four main parts: Gammatone filterbank, cross correlation, Position-Pitch decomposition and final evaluation with interframe tracking. This thesis presents an analysis of the source localization algorithm with the aim to find reasonable calibrations for its accuracy and time performance. To achieve a real-time implementation the algorithm is ported to reconfigurable FPGA based hardware. The hardware used is a Stretch S6 hybrid reconfigurable CPU, in which certain computationally expensive parts of the code can be outsourced. The cross correlation is identified as the most computationally expensive part of source localization, and several approaches of it have been implemented at the hardware. In order to elaborate on advantages and disadvantages of the different approaches, measures of time performance are compared.
