Font Size: a A A

Video Channel Selecting System Based On Voice Recognition

Posted on:2009-12-18Degree:MasterType:Thesis
Country:ChinaCandidate:R ZhangFull Text:PDF
GTID:2178360272976379Subject:Software engineering
Abstract/Summary:PDF Full Text Request
The rapid development of computer technology in the next few years, a variety of computer-related technology will be rapidly developped and used ,so that the computer will become more humane and intelligent. The multi-media and network technologies in the future is even more useful in the development of computer technology. Voice recognition technology will allow machines to adapt to human language. Voice recognition technology research from the early 1950s and started in the mid-60 as an important issue to work. Today, Voice recognition technology has been applied in many areas, For example: in the field of telephone communications, such as Taiwan, paging, telephone switching centers or information; in the financial sector, such as home banking or stock information; in the field of tourism services, such as plane tickets or hotels, such as scheduled. With the advent of the Internet information age, Voice recognition technology and multimedia network technology will become inevitable.Flash technology is the most popular network among multimedia technology, Flash is a vector-based graphics systems, only a small amount of vector data can describe a complex object, the storage space occupied by only a few thousand parts per bitmap, A very suitable for usage in the network. At the same time, it has the support of the bitmap, sound support, support for gradual change color, Alpha transparent, and other functions. Interactive Flash animation is even more important feature, users can click on the button, select the menu to control the playback of Flash files. Flash has become increasingly mainstream multimedia network, but also the future development of the WEB's a big school. The system will be voice recognition technology and Flash multimedia network technology combined with the application of the project, the system through the introduction of the corresponding Flash control to achieve the Flash video player, and then to voice recognition engine based on the realization of voice signals with the interactive Flash movie. System server-side and client communication using Socket technology to achieve in order to facilitate the system to other operating systems transplantation. ATK use voice recognition engine for voice recognition. ATK is a research and development of the University of Cambridge English voice recognition engine, can achieve a specific English phrases and sentences for the identification, which is the predecessor of the HTK in the Linux environment, the compiler development. The subject of this system is by ATK engine in the process of the initial load to achieve the recognition of the voice signal. In the client system start-up, ATK start the engine at the same time, Mike waiting for input, to achieve recognition.System C / S structure, that is client / server model, ATK voice recognition engine from the client load. Service to complete identification of the main results of the search to deal with. System C / S structure can take full advantage of client and server hardware at both ends of the advantages of the environment, the task will be assigned to a reasonable Client-side and Server side to achieve, can only text string identifying the text and format text in the answer systems Transport, communications systems in order to reduce costs.Client system to achieve recognition of the voice signal, send the text to identify, Flash video playback functions; recognition of the results of the deal, Flash and the Treasury to expand the database of information from the maintenance of services to complete the process. The traditional C / S structure of the software needed for different operating systems to develop different versions of the system in the development of the follow-up, the service program in the Linux operating system environments, and client development for the network in the form of plug-in Applications and Web pages, so that the realization of the B / S structure that is the browser / server model. In this structure, the main option in the Server-side logic to realize that this can greatly simplify the client computer load, to reduce development costs while also maintaining and upgrading the system is relatively simple and convenient system in the network .In the voice recognition technology applications, reliability depends largely on voice recognition engine recognition rate. This Yougen system in the area, when the system has only been used to identify a specific word or phrase, can create a better and stable model of language, have a higher recognition. When the system is used to identify more complex and the content of the law is not strong, its language model on the relatively more difficult to identify a relatively low rate. Flash-based video-on-demand structure of the system, could be the flash player to be defined as: Unit1, Unit2, Lesson1, Lesson2 and so on, the system for their definition of special semantic tags. This will create a stable model of language, the system will have a stronger reliability and high availability. When this does not include the need to identify the semantic content of the mark, the system's recognition rate will be relatively low, reliability is also weak.System using Socket Interface for the creation of sets of data, will create a good server-side socket with the local transport layer address bound to wait for client connections. Client to create the same socket, and then connect the server to open the fracture when the connection can achieve success when the data transfer. System used a client server model, but the system in the development of the follow-up, you can use a number of client server model. On the server to connect all customers use the various polling View state.Select Windows XP system as a development platform, the system has a stable, reliable, highly integrated and user-friendly features, is the more common users of the system. Other Windows 98, Widows 2000, also can be compatible. But in a follow-up to the development of the system, server-side program will be chosen under the Linux operating system.Visual C++ is the system to be used is the main language development. VC++ is a very powerful programming language, its flexibility and efficiency in the field of software development has been widely used. The system server-side program in the selection of Visual C + + 6.0 compiler development environment, the client due to engine ATK compiler development environment for Visual C + +. NET, the client is therefore the same choice under the circumstances to achieve.The system functions by the Flash player in the VC program, the introduction of ActiveX controls to complete. Controls and Visual C + + are not directly related, is the control of the Windows operating system interface elements, and the development of the VC is a language, but in the VC program can be introduced to control the process run more in line with the Windows interface standard application process. There are a number of self-control is not a Windows system, but by the third-party release, but in line with the norms of Windows, which controls ActiveX control that.The client system Flash player is called by the control shockwaveflash to complete, which can be assigned to the URL of the file under the Flash player, Flash player, but also the appropriate use of its function to control the course of play. Flash player and the realization of VC control module to be a way to receive the news service of process engines to identify the text of the results, and then have to deal with the results of a series of judgments, including the content of the flag and so on, and then judge the results based on the realization of the specified URL, or under Flash files or use the player controls a specific function to achieve the target.
Keywords/Search Tags:Voice recognition, ATK, Flash
PDF Full Text Request
Related items