Abstract:
PROBLEM TO BE SOLVED: To provide a system capable of giving natural reading and accents of a text. SOLUTION: The system for outputting the reading and the accent of the text, includes a storage section for storing a first corpus in which notation, the reading and the accent which are input beforehand, are recorded for each separation of a phrase contained in the text. Then, an object text which is an object for outputting the reading and the accent is acquired, and at least one group of the notation which matches the notation of the object text from groups of consecutive notation in the first corpus, is searched. In combined groups of the reading and the accent, corresponding to the group of the notation, which is searched, the combined group of the reading and the accent where the appearance probability for appearing in the first corpus is higher than a reference probability, which has been defined beforehand, is selected as the reading and the accent of the object text. COPYRIGHT: (C)2007,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide a data processing method suitable for transcribing speeches obtained in a special situation such as a trial and a meeting into a text by establishing proper correspondence between a text having been corrected and an original speech even if the text written down through speech recognition is corrected, and a system using the same. SOLUTION: The system is equipped with: a speech recognition processing part 32 which specifies utterance sections in speech data, performing speech recognition of respective utterance sections, and correlates the obtained character strings of recognition data of each utterance section and the speech data according to information on utterance time; and an output control part 34 which displays a text created by sorting recognition data for each utterance section. The system is further equipped with: a text editing part 35 which edits the created text; and a speech correspondence estimation part 36 which correlates character strings in the edited text to the speech data by using a dynamic programming technique. COPYRIGHT: (C)2005,JPO&NCIPI
Abstract:
PROBLEM TO BE SOLVED: To simultaneously estimate a word and a syntactic structure with a high precision by providing a probability model allowing selection of a range of a history used for estimation and using this probability model as a structural language model with respect to processing for estimating the next data element on the basis of the history having a tree structure. SOLUTION: With respect to a word estimating method for voice recognition using a computer, the tree structure of the history of words preceding a word as the estimation object is specified, and a context tree which is stored in a tree-like context tree storage part 40 and has information related to structures allowed for a sentence and appearance probabilities of words for these structures as nodes is referred to, and a word is estimated on the basis of the context tree and the specified sentence structure of the history.
Abstract:
PROBLEM TO BE SOLVED: To provide a device and a method for voice recognition which have a higher recognition rate than conventionally. SOLUTION: Words are divided into redundant words and other normal words and any of a predicted word and a precedent word as a condition are predicted discriminatingly between those two to improve the precision of the word prediction at a redundant word peripheral part. To this end, the voice recognition device has an acoustic processing means which converts an analog voice input signal into a digital signal, a storage means which stores acoustic models having learnt features of sounds, a storage means having a dictionary which has both 1st language models learnt on the basis of a document containing redundant words and normal words other than the redundant words in advance and 2nd language models learnt on the basis of a document of only normal words, while ignoring redundant words, and a means which recognizes as an inputted voice the word having the highest probability by calculating probability, by using the acoustic models and dictionary for the digital signal.
Abstract:
PURPOSE:To execute the adaptation of a vector quantization use code book with high accuracy and simply by providing a prototype adaptation means for correcting a prototype vector of each label in a label group of the vector quantization code book in accordance with a degree of relation between the label and a displacement vector by each displacement vector. CONSTITUTION:By bringing the generation of a word for adaptation learning to fre quency analysis at every prescribed period, a sequence of a feature vector is derived. Subsequently, a feature vector sequence is divided into two pieces of section 1 and section 2 on a time base, and a word base form is also divided into two pieces of sections L1, L2 in the same way, by which the corresponding relation of each part is obtained. On the basis of the corresponding relation of each section, a difference of representative values S1, S2 and B1, B2 of the feature quantity in the respective sections is derived. On the other hand, strength of the correspondence of each level and each section is derived as appearance probability of each section with a condition of the lavel, and by setting the conditional probability as weight and synthesizing a moving vector of the feature quantity of every section, code vectors F1, F2 correspond ing to each label are brought to adaptation. In such a way, the adaptation of a voice recognition system can be executed simply by small data.