Study On Automatic Generation Of Chinese Course Video Subtitles

Posted on:2017-03-20

Degree:Master

Type:Thesis

Country:China

Candidate:Y L Hui

Full Text:PDF

GTID:2348330488969857

Subject:Agricultural Extension

Abstract/Summary:

PDF Full Text Request

Video subtitles is a auxiliary tool for understand the content of video, with the development of the Internet, video subtitles are playing an increasingly important role. This paper studied the problem of the automatic generation of video subtitles and the technology principle of the extraction of audio stream from course video, the segmentation of audio stream, speech recognition, the generation of text format files, the Chinese speech recognition technology is discussed emphatically.The process of Chinese speech recognition includes four parts: feature extraction, acoustic model, language model and pattern matching. the related technologies which were used in these four parts are compared and analyzed, then choose MFCC, HMM, N-gram and related algorithms to study Chinese speech recognition and described the MFCC feature extraction method, HMM acoustic model and related algorithms, and the N-gram language model and smooth processing methods in detail.In the light of the rules of Chinese pronunciation, this paper put the initials and finals as the phonemes and combined with the Sphinx speech recognition system which is developed by Carnegie Mellon University to establish acoustic model, language model and the construction of a dictionary. The HMM is used in acoustics modeling, the N-gram statistical model is used in language modeling, the format of the dictionary is a statement corresponds to a set of phone. In the process of modeling, embodied 30 thousand audio files in total nearly, the corresponding entry is also nearly 30 thousand. This paper also described the acoustic modeling and language modeling process in detail, in the process of acoustic modeling, the emphasis is on the data preparation work before the modeling and the training process, in the process of language modeling, the emphasis is on the model training process.Through the establishment of corpus, the research of sphinx speech recognition system, the design and development of the subtitles generation system to build a automatic generation system of subtitles finally. The test and contrast experiments show that the Chinese recognition rate of the automatic generation system of subtitles in this study is about 51%. By the analyzing and summarizing, the corpus is the most important factor that restricts the recognition rate of this study.

Keywords/Search Tags:

subtitles, speech recognition, extract parameters feature, acoustic model, language model

PDF Full Text Request

Related items

1	Asr Research Based On CTC
2	A Study On The Extraction Of Speech Depth In Tibetan Language And Its Speech Recognition
3	Study And Improve On The Mongolian Speech Recognition System
4	Research And Implementation Of Mongolian-Chinese Mixed Language Speech Recognition System Based On Deep Learning
5	Research On Speech Recognition Based On Deep Learning
6	Research On Discriminative Techniques Of Feature Extraction And Acoustic Model Training In Continuous Speech Recognition
7	Research On Speech Recognition Method In Strong Noise Environment
8	Chinese Continuous Speech Recognition Based On Sphinx
9	Researching Of The Mogolian Language Model Based On Speech Recognition
10	Design And Implementation Of Intelligent Speech Interaction