SYSTEM, PROGRAM, AND CONTROL METHOD FOR SPEECH SYNTHESIS

    公开(公告)号:CA2614840A1

    公开(公告)日:2007-01-18

    申请号:CA2614840

    申请日:2006-07-10

    Applicant: IBM

    Abstract: The present invention relates to the provision of natural-soundingphonemes and accents for text. There is provided a system that outputs phonemes and accents of texts.The system has a storage section storing a first corpus in which spellings, phonemes, and accents of a text input beforehand are recorded separately for individual segmentations of the words that are contained in the text. A text for which phonemes and accents are to be output is acquired and the first corpus is searched to retrieve at least one set of spellings that match the spellings in the text from among sets of contiguous spellings. Then, the combination of a phoneme and an accent that has a higher probability of occurrence in the first corpus than a predetermined reference probability is selected as the phonemes and accent of the text.

    SYSTEM, PROGRAM, AND CONTROL METHOD FOR SPEECH SYNTHESIS

    公开(公告)号:CA2614840C

    公开(公告)日:2016-11-22

    申请号:CA2614840

    申请日:2006-07-10

    Applicant: IBM

    Abstract: The present invention relates to the provision of natural-soundingphonemes and accents for text. There is provided a system that outputs phonemes and accents of texts.The system has a storage section storing a first corpus in which spellings, phonemes, and accents of a text input beforehand are recorded separately for individual segmentations of the words that are contained in the text. A text for which phonemes and accents are to be output is acquired and the first corpus is searched to retrieve at least one set of spellings that match the spellings in the text from among sets of contiguous spellings. Then, the combination of a phoneme and an accent that has a higher probability of occurrence in the first corpus than a predetermined reference probability is selected as the phonemes and accent of the text.

    MARK INSERTION DEVICE AND ITS METHOD

    公开(公告)号:JP2001083987A

    公开(公告)日:2001-03-30

    申请号:JP24331199

    申请日:1999-08-30

    Applicant: IBM

    Abstract: PROBLEM TO BE SOLVED: To insert punctuation marks on suitable positions in a sentence. SOLUTION: An acoustic processing part 20 processes inputted voice data and converts the data into characteristic vectors. When punctuation mark automatic insertion is not executed, a language mark-reproduction part 22 processes the characteristic vectors by using only a versatile language model 320, and inserts a punctuation mark on a part where insertion of a punctuation mark is shown clearly, for example, 'a comma' or the like, by voice data. When the punctuation mark automatic insertion is executed, the language mark- reproduction part 22 discriminates a pause part having no voice as a comma ',' or the like by using the versatile language model 320 and a punctuation language model 322.

    4.
    发明专利
    未知

    公开(公告)号:BRPI0614034A2

    公开(公告)日:2011-03-01

    申请号:BRPI0614034

    申请日:2006-07-10

    Applicant: IBM

    Abstract: A system that outputs phonemes and accents of texts. The system has a storage section storing a first corpus in which spellings, phonemes, and accents of a text input beforehand are recorded separately for individual segmentations of the words that are contained in the text. A text for which phonemes and accents are to be output is acquired and the first corpus is searched to retrieve at least one set of spellings that match the spellings in the text from among sets of contiguous spellings. Then, the combination of a phoneme and an accent that has a higher probability of occurrence in the first corpus than a predetermined reference probability is selected as the phonemes and accent of the text.

    Technique to search new phrase to be registered in voice processing dictionary
    5.
    发明专利
    Technique to search new phrase to be registered in voice processing dictionary 有权
    在语音处理词典中注册新技术的技巧

    公开(公告)号:JP2008151926A

    公开(公告)日:2008-07-03

    申请号:JP2006338454

    申请日:2006-12-15

    CPC classification number: G06F17/2735 G06F17/277

    Abstract: PROBLEM TO BE SOLVED: To search a new phrase to be registered in a dictionary of a dividing means which breakes down a text into phrases.
    SOLUTION: This system inputs a text for learning into a dividing means to break down into phrases to produce break down candidates including the phrases different in combination according to the obtained break down reliability. It sums up the reliability of the break down candidates including those phrases for each phrase to find out their likelihood. Then, it finds out the combination minimizing the information entropy of the phrase considered to appear at the frequency matching the likelihood of the phrases in the combination within the extent that the text can be expressed by using the phrases included in a combination among the combinations of phrases included at least in one candidate, and to outputs it as a combination of phrases including the new phrase.
    COPYRIGHT: (C)2008,JPO&INPIT

    Abstract translation: 要解决的问题:搜索要将文本破解成短语的分割装置的字典中注册的新短语。

    解决方案:该系统将用于学习的文本输入到分割装置中以分解成短语,以根据获得的分解可靠性产生包括不同组合的短语的候选人。 它总结了分解候选人的可靠性,包括每个短语的短语,以找出它们的可能性。 然后,发现组合中最小化被认为出现在频率上的短语的信息熵,以匹配组合中的短语的可能性,该程度可以通过使用包含在组合中的组合来表达文本 短语至少包括在一个候选中,并将其作为包括新短语的短语的组合输出。 版权所有(C)2008,JPO&INPIT

    Apparatus and method for estimating word boundary probability, apparatus and method for constructing probabilistic language model, apparatus and method for kana-kanji conversion, and method for constructing unknown word model
    6.
    发明专利
    Apparatus and method for estimating word boundary probability, apparatus and method for constructing probabilistic language model, apparatus and method for kana-kanji conversion, and method for constructing unknown word model 有权
    用于估计边界概率的装置和方法,用于构建概念语言模型的装置和方法,用于加纳 - 康佳转换的装置和方法以及用于构造未知字模型的方法

    公开(公告)号:JP2006031295A

    公开(公告)日:2006-02-02

    申请号:JP2004207864

    申请日:2004-07-14

    CPC classification number: G06F17/2863 G06F17/2715

    Abstract: PROBLEM TO BE SOLVED: To provide an apparatus and a technique for increasing the accuracy of recognition in natural language processing by calculating the n-gram probability of words with high precision while making effective use of a first corpus where words are separated from one another and a second corpus where words are not separated. SOLUTION: In a method for using the corpus where words are separated from one another, the first corpus (words separated) is used in the calculation of n-gram and the probability (division probability) with which a space between two adjacent characters becomes a word boundary; the second corpus (words unseparated) is assigned with probabilistic word boundaries based upon information in the first corpus (words separated) and used in the calculation of word n-gram. For the calculation of the probabilistic word boundaries, the second corpus (words unseparated) assigns the division probabilities calculated via the first corpus (words separated) to every space between characters. An unknown-word model based on character units models the correspondence between each character and how it is read in character units. In this way, a model of kana-kanji conversion for unknown words is proposed. COPYRIGHT: (C)2006,JPO&NCIPI

    Abstract translation: 要解决的问题:提供一种用于通过以高精度计算单词的n-gram概率来提高自然语言处理中的识别精度的装置和技术,同时有效地使用第一语料库,其中单词被分离 另一个和第二个语料库,其中单词不分开。

    解决方案:在使用语言彼此分离的语料库的方法中,第一语料库(分离词)用于计算n-gram和两个相邻空间之间的空间的概率(分割概率) 字符变成字边界; 第二语料库(未分离的单词)基于第一语料库中的信息(分开的单词)被分配有概率词边界,并用于计算单词n-gram。 对于概率词边界的计算,第二语料库(单词未分离)将通过第一语料库计算的分割概率(单词分离)分配给字符之间的每个空格。 基于字符单元的未知词模型模拟每个字符之间的对应关系以及如何以字符单位读取。 以这种方式,提出了一种用于未知词的假名汉字转换模型。 版权所有(C)2006,JPO&NCIPI

    System, program and control method
    7.
    发明专利
    System, program and control method 审中-公开
    系统,程序和控制方法

    公开(公告)号:JP2007024960A

    公开(公告)日:2007-02-01

    申请号:JP2005203160

    申请日:2005-07-12

    CPC classification number: G10L13/08 G10L13/04 G10L13/086 G10L13/10

    Abstract: PROBLEM TO BE SOLVED: To provide a system capable of giving natural reading and accents of a text.
    SOLUTION: The system for outputting the reading and the accent of the text, includes a storage section for storing a first corpus in which notation, the reading and the accent which are input beforehand, are recorded for each separation of a phrase contained in the text. Then, an object text which is an object for outputting the reading and the accent is acquired, and at least one group of the notation which matches the notation of the object text from groups of consecutive notation in the first corpus, is searched. In combined groups of the reading and the accent, corresponding to the group of the notation, which is searched, the combined group of the reading and the accent where the appearance probability for appearing in the first corpus is higher than a reference probability, which has been defined beforehand, is selected as the reading and the accent of the object text.
    COPYRIGHT: (C)2007,JPO&INPIT

    Abstract translation: 要解决的问题:提供能够给出自然阅读和文字重音的系统。

    解决方案:用于输出文本的读数和重音的系统包括存储部分,用于存储第一语料库,其中记录了每个分离所包含的短语的符号,预先输入的阅读和口音 在文中。 然后,获取作为用于输出读取和重音的对象的对象文本,并且搜索与第一语料库中的连续符号的组中的对象文本的符号匹配的符号中的至少一组。 在与搜索的符号组相对应的阅读和口音的组合组中,出现在第一语料库中的出现概率高于参考概率的阅读和口音的组合组,其具有 被预先定义,被选为对象文本的读数和重音。 版权所有(C)2007,JPO&INPIT

    Speech recognition system, data processor, and its data processing method and program
    8.
    发明专利
    Speech recognition system, data processor, and its data processing method and program 有权
    语音识别系统,数据处理器及其数据处理方法和程序

    公开(公告)号:JP2005165066A

    公开(公告)日:2005-06-23

    申请号:JP2003405223

    申请日:2003-12-03

    CPC classification number: G10L15/26

    Abstract: PROBLEM TO BE SOLVED: To provide a data processing method suitable for transcribing speeches obtained in a special situation such as a trial and a meeting into a text by establishing proper correspondence between a text having been corrected and an original speech even if the text written down through speech recognition is corrected, and a system using the same. SOLUTION: The system is equipped with: a speech recognition processing part 32 which specifies utterance sections in speech data, performing speech recognition of respective utterance sections, and correlates the obtained character strings of recognition data of each utterance section and the speech data according to information on utterance time; and an output control part 34 which displays a text created by sorting recognition data for each utterance section. The system is further equipped with: a text editing part 35 which edits the created text; and a speech correspondence estimation part 36 which correlates character strings in the edited text to the speech data by using a dynamic programming technique. COPYRIGHT: (C)2005,JPO&NCIPI

    Abstract translation: 要解决的问题:为了提供一种数据处理方法,适用于将特殊情况(例如审判和会议)中获得的演讲转录成文本,通过建立正确的文本与原始语音之间的正确对应关系,即使 通过语音识别记录的文本被更正,并且使用相同的系统。 解决方案:该系统配备有:语音识别处理部分32,其指定语音数据中的话语部分,执行各个发音部分的语音识别,并且将获得的每个发音部分的识别数据的字符串与语音数据相关联 根据说话时间的信息; 以及输出控制部34,其显示通过对每个发音部分分类识别数据而创建的文本。 该系统还配备有:文本编辑部分35,其编辑所创建的文本; 以及通过使用动态编程技术将编辑的文本中的字符串与语音数据相关联的语音对应估计部分36。 版权所有(C)2005,JPO&NCIPI

    Word estimating method, voice recognition method, voice recognition device using this method, and program
    9.
    发明专利
    Word estimating method, voice recognition method, voice recognition device using this method, and program 有权
    词汇估计方法,语音识别方法,使用该方法的语音识别装置和程序

    公开(公告)号:JP2003076392A

    公开(公告)日:2003-03-14

    申请号:JP2001254502

    申请日:2001-08-24

    CPC classification number: G10L15/193

    Abstract: PROBLEM TO BE SOLVED: To simultaneously estimate a word and a syntactic structure with a high precision by providing a probability model allowing selection of a range of a history used for estimation and using this probability model as a structural language model with respect to processing for estimating the next data element on the basis of the history having a tree structure. SOLUTION: With respect to a word estimating method for voice recognition using a computer, the tree structure of the history of words preceding a word as the estimation object is specified, and a context tree which is stored in a tree-like context tree storage part 40 and has information related to structures allowed for a sentence and appearance probabilities of words for these structures as nodes is referred to, and a word is estimated on the basis of the context tree and the specified sentence structure of the history.

    Abstract translation: 要解决的问题:通过提供允许选择用于估计的历史的范围的概率模型并且使用该概率模型作为用于估计的处理的结构语言模型来同时高精度地估计单词和句法结构 基于具有树结构的历史的下一个数据元素。 解决方案:关于使用计算机的语音识别的词估计方法,指定在词之前的词的历史的树结构作为估计对象,以及存储在树状上下文树存储部分中的上下文树 并且具有与允许用于句子的结构相关的信息以及作为节点的这些结构的单词的出现概率,并且基于上下文树和历史的指定句子结构来估计单词。

    Technique for acquiring character string or the like to be newly recognized as phrase
    10.
    发明专利
    Technique for acquiring character string or the like to be newly recognized as phrase 有权
    获取字符串的技术或类似于新闻识别的技术

    公开(公告)号:JP2008216756A

    公开(公告)日:2008-09-18

    申请号:JP2007055522

    申请日:2007-03-06

    CPC classification number: G10L15/063

    Abstract: PROBLEM TO BE SOLVED: To acquire a characteristic to be recognized as a phrase and its pronunciation more accurately than before. SOLUTION: A system selects a plurality of candidate character strings as candidates to be recognized as phrases from an input text, combines predetermined pronunciations with respective characters included in each of the selected candidate character strings to generate a plurality of candidates for pronunciations of the candidate character string, combines data wherein the respective generated candidates for the pronunciations are made to correspond to respective candidate character strings with language model data wherein numerals indicative of frequencies of appearance of the respective phrases in the text are recorded to generate frequency data indicative of frequencies of appearance by pairs of character strings representing the phrases and pronunciations, speech-recognizes an input speech based upon the generated frequency data to generate recognition data wherein character strings indicative of a plurality of phrases included in the input speech are made to correspond to pronunciations, and selects and outputs a combination included in the recognition data among combinations of candidate character strings and candidates for pronunciations. COPYRIGHT: (C)2008,JPO&INPIT

    Abstract translation: 要解决的问题:比以前更准确地获取被识别为短语及其发音的特征。 解决方案:系统选择多个候选字符串作为从输入文本中识别为短语的候选,将预定发音与包括在每个所选择的候选字符串中的各个字符相结合,以产生多个候选字符,用于发音的发音 候选字符串组合数据,其中使得发音的各个生成的候选对应于具有语言模型数据的各个候选字符串,其中记录了表示文本中各个短语的出现频率的数字,以产生指示 表示短语和发音的字符串对的出现频率,语音 - 基于生成的频率数据识别输入语音,以产生识别数据,其中指示包括在输入语音中的多个短语的字符串被做成对应于pronu 并且选择并输出包括在候选字符串的组合中的识别数据中的组合和用于发音的候选。 版权所有(C)2008,JPO&INPIT

Patent Agency Ranking