Abstract:
The present invention relates to the provision of natural-soundingphonemes and accents for text. There is provided a system that outputs phonemes and accents of texts.The system has a storage section storing a first corpus in which spellings, phonemes, and accents of a text input beforehand are recorded separately for individual segmentations of the words that are contained in the text. A text for which phonemes and accents are to be output is acquired and the first corpus is searched to retrieve at least one set of spellings that match the spellings in the text from among sets of contiguous spellings. Then, the combination of a phoneme and an accent that has a higher probability of occurrence in the first corpus than a predetermined reference probability is selected as the phonemes and accent of the text.
Abstract:
The present invention relates to the provision of natural-soundingphonemes and accents for text. There is provided a system that outputs phonemes and accents of texts.The system has a storage section storing a first corpus in which spellings, phonemes, and accents of a text input beforehand are recorded separately for individual segmentations of the words that are contained in the text. A text for which phonemes and accents are to be output is acquired and the first corpus is searched to retrieve at least one set of spellings that match the spellings in the text from among sets of contiguous spellings. Then, the combination of a phoneme and an accent that has a higher probability of occurrence in the first corpus than a predetermined reference probability is selected as the phonemes and accent of the text.
Abstract:
PROBLEM TO BE SOLVED: To insert punctuation marks on suitable positions in a sentence. SOLUTION: An acoustic processing part 20 processes inputted voice data and converts the data into characteristic vectors. When punctuation mark automatic insertion is not executed, a language mark-reproduction part 22 processes the characteristic vectors by using only a versatile language model 320, and inserts a punctuation mark on a part where insertion of a punctuation mark is shown clearly, for example, 'a comma' or the like, by voice data. When the punctuation mark automatic insertion is executed, the language mark- reproduction part 22 discriminates a pause part having no voice as a comma ',' or the like by using the versatile language model 320 and a punctuation language model 322.
Abstract:
A system that outputs phonemes and accents of texts. The system has a storage section storing a first corpus in which spellings, phonemes, and accents of a text input beforehand are recorded separately for individual segmentations of the words that are contained in the text. A text for which phonemes and accents are to be output is acquired and the first corpus is searched to retrieve at least one set of spellings that match the spellings in the text from among sets of contiguous spellings. Then, the combination of a phoneme and an accent that has a higher probability of occurrence in the first corpus than a predetermined reference probability is selected as the phonemes and accent of the text.
Abstract:
PROBLEM TO BE SOLVED: To search a new phrase to be registered in a dictionary of a dividing means which breakes down a text into phrases. SOLUTION: This system inputs a text for learning into a dividing means to break down into phrases to produce break down candidates including the phrases different in combination according to the obtained break down reliability. It sums up the reliability of the break down candidates including those phrases for each phrase to find out their likelihood. Then, it finds out the combination minimizing the information entropy of the phrase considered to appear at the frequency matching the likelihood of the phrases in the combination within the extent that the text can be expressed by using the phrases included in a combination among the combinations of phrases included at least in one candidate, and to outputs it as a combination of phrases including the new phrase. COPYRIGHT: (C)2008,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide an apparatus and a technique for increasing the accuracy of recognition in natural language processing by calculating the n-gram probability of words with high precision while making effective use of a first corpus where words are separated from one another and a second corpus where words are not separated. SOLUTION: In a method for using the corpus where words are separated from one another, the first corpus (words separated) is used in the calculation of n-gram and the probability (division probability) with which a space between two adjacent characters becomes a word boundary; the second corpus (words unseparated) is assigned with probabilistic word boundaries based upon information in the first corpus (words separated) and used in the calculation of word n-gram. For the calculation of the probabilistic word boundaries, the second corpus (words unseparated) assigns the division probabilities calculated via the first corpus (words separated) to every space between characters. An unknown-word model based on character units models the correspondence between each character and how it is read in character units. In this way, a model of kana-kanji conversion for unknown words is proposed. COPYRIGHT: (C)2006,JPO&NCIPI
Abstract:
PROBLEM TO BE SOLVED: To provide a system capable of giving natural reading and accents of a text. SOLUTION: The system for outputting the reading and the accent of the text, includes a storage section for storing a first corpus in which notation, the reading and the accent which are input beforehand, are recorded for each separation of a phrase contained in the text. Then, an object text which is an object for outputting the reading and the accent is acquired, and at least one group of the notation which matches the notation of the object text from groups of consecutive notation in the first corpus, is searched. In combined groups of the reading and the accent, corresponding to the group of the notation, which is searched, the combined group of the reading and the accent where the appearance probability for appearing in the first corpus is higher than a reference probability, which has been defined beforehand, is selected as the reading and the accent of the object text. COPYRIGHT: (C)2007,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide a data processing method suitable for transcribing speeches obtained in a special situation such as a trial and a meeting into a text by establishing proper correspondence between a text having been corrected and an original speech even if the text written down through speech recognition is corrected, and a system using the same. SOLUTION: The system is equipped with: a speech recognition processing part 32 which specifies utterance sections in speech data, performing speech recognition of respective utterance sections, and correlates the obtained character strings of recognition data of each utterance section and the speech data according to information on utterance time; and an output control part 34 which displays a text created by sorting recognition data for each utterance section. The system is further equipped with: a text editing part 35 which edits the created text; and a speech correspondence estimation part 36 which correlates character strings in the edited text to the speech data by using a dynamic programming technique. COPYRIGHT: (C)2005,JPO&NCIPI
Abstract:
PROBLEM TO BE SOLVED: To simultaneously estimate a word and a syntactic structure with a high precision by providing a probability model allowing selection of a range of a history used for estimation and using this probability model as a structural language model with respect to processing for estimating the next data element on the basis of the history having a tree structure. SOLUTION: With respect to a word estimating method for voice recognition using a computer, the tree structure of the history of words preceding a word as the estimation object is specified, and a context tree which is stored in a tree-like context tree storage part 40 and has information related to structures allowed for a sentence and appearance probabilities of words for these structures as nodes is referred to, and a word is estimated on the basis of the context tree and the specified sentence structure of the history.
Abstract:
PROBLEM TO BE SOLVED: To acquire a characteristic to be recognized as a phrase and its pronunciation more accurately than before. SOLUTION: A system selects a plurality of candidate character strings as candidates to be recognized as phrases from an input text, combines predetermined pronunciations with respective characters included in each of the selected candidate character strings to generate a plurality of candidates for pronunciations of the candidate character string, combines data wherein the respective generated candidates for the pronunciations are made to correspond to respective candidate character strings with language model data wherein numerals indicative of frequencies of appearance of the respective phrases in the text are recorded to generate frequency data indicative of frequencies of appearance by pairs of character strings representing the phrases and pronunciations, speech-recognizes an input speech based upon the generated frequency data to generate recognition data wherein character strings indicative of a plurality of phrases included in the input speech are made to correspond to pronunciations, and selects and outputs a combination included in the recognition data among combinations of candidate character strings and candidates for pronunciations. COPYRIGHT: (C)2008,JPO&INPIT