Research And Implementation On FPGA Based Real-time Specific Speech Recognition System

Posted on:2014-09-03

Degree:Master

Type:Thesis

Country:China

Candidate:Z H Yu

Full Text:PDF

GTID:2268330401476813

Subject:Communication and Information System

Abstract/Summary:

PDF Full Text Request

Specific speech recognitionï¼Œmeans to recognize the same or similar speech segments whichare given in the template library at the dynamic voice stream. As a branch of the specific audioretrieval, specific speech recognition can used to detect garbage voice in the telecommunicationsnetwork and other fields, such as advertisement detection and copyright protection. With theincreasing number of templates, the specific speech recognition system is faced with thechallenge of real-time performance. While the real-time performance means multi-channelreal-time processing capabilities of the system, which is the key point for the system to be putinto practicality.This dissertation relies on the key project in information and technology domain of theNational863Program, combining with the real-time and multi-access needs over thetelecommunications network. Aiming at reaching the real-time performance under the conditionof high-capacity voice template library (template numbers more than8000), the main work andachievements of this dissertation can be outlined as follows:1ï¼ŽA multi-processing element parallel architecture based on the superscalar architecture isproposed. This architecture fully excavates the parallelism of the specific speech recognitionalgorithm. It employs the super-word parallel technology in the processing element. We designeda data processing unit of256bits, in the condition of the units of frame in the voice signalprocessing and the max parallelism degree8of the super-word parallel. The parallelmulti-processing units use the superscalar architecture. As every processing element canaccomplish the matching processing, the architecture is scalable.2ï¼ŽPutting forward data storage and scheduling mechanism to make the each processingelement in the architecture to do their best anytime. For the template data storage, we study ahybrid storage structure of the combination of centralized shared storage and distributed storage,and we design the secondary storage mechanism of the template data and PE internal doublebuffer (Double buffer DB) structure, to achieve the efficient data storage system. We study thetemplate distribution strategy of a round robin distribution mechanism and a first request firstdistribution mechanism for the template data distribution.3ï¼ŽIn order to solve the problem of excessive IP (Intellectual Property) usage whencalculating large-size squares using embedded multipliers in FPGAs directly, this thesis presentsan efficient method for square calculation. The proposed method first through a simple iterativelogic circuit to reduce the width of operands, then combined with the embedded DSP48E tocomplete square calculation. Experimental and simulation results show that the method can getgood results in reducing the usage of DSP48E. 4ï¼ŽCombining with MicroBlaze processor embedded in Xilinxâ€™s FPGA, making the proposedprocessing architecture as a co-processor, we realize the specific speech recognition SOPCsystem on an FPGA platform. The system uses the shared RAM interface mode based on theAXI4Bus protocol to realize the control and data exchange between the main processor and thecoprocessor. By testing the performance of the system and doing the resources analysis, we getthe results that the system fulfill the22-access real-time recognition tasks instantaneously withthe number of templates is8192, realize the multi-access real-time processing under thecondition of large-capacity template library.

Keywords/Search Tags:

Specific Speech Recognition, Real-time, Multiple Processing Element, ParallelProcessing, AXI4Bus, FPGA

PDF Full Text Request

Related items

1	Applied Research On Specific Word Chinese Speech Recognition System
2	A Real-Time Speech Recognition System Based On The Implementation Of FPGA
3	The Research And Implementation Of High Performance Embedded Real-time Image Processing And Recognition System
4	Speech Recognition Of Scene-specific Words For Non-specific People
5	The Research Of Real-time Image Detection Technology Based On FPGA
6	Study Of Speech Recognition Of Isolated Words Which Is From Non-specific Based On ESN
7	Speech Recognition Research Of Non-specific Person’s Isolated Words Based On DTW Model
8	Automatic Speech Recognition Technology (asr) Software And Real-time Implementation
9	Development Of A DSP & FPGA Based Real Time MIMO Prototyping Testbed
10	Research Of SAR Real-time Signal Processing Based On FPGA