System, program and control method
    61.
    发明专利
    System, program and control method 审中-公开
    系统,程序和控制方法

    公开(公告)号:JP2007024960A

    公开(公告)日:2007-02-01

    申请号:JP2005203160

    申请日:2005-07-12

    CPC classification number: G10L13/08 G10L13/04 G10L13/086 G10L13/10

    Abstract: PROBLEM TO BE SOLVED: To provide a system capable of giving natural reading and accents of a text.
    SOLUTION: The system for outputting the reading and the accent of the text, includes a storage section for storing a first corpus in which notation, the reading and the accent which are input beforehand, are recorded for each separation of a phrase contained in the text. Then, an object text which is an object for outputting the reading and the accent is acquired, and at least one group of the notation which matches the notation of the object text from groups of consecutive notation in the first corpus, is searched. In combined groups of the reading and the accent, corresponding to the group of the notation, which is searched, the combined group of the reading and the accent where the appearance probability for appearing in the first corpus is higher than a reference probability, which has been defined beforehand, is selected as the reading and the accent of the object text.
    COPYRIGHT: (C)2007,JPO&INPIT

    Abstract translation: 要解决的问题:提供能够给出自然阅读和文字重音的系统。

    解决方案:用于输出文本的读数和重音的系统包括存储部分,用于存储第一语料库,其中记录了每个分离所包含的短语的符号,预先输入的阅读和口音 在文中。 然后,获取作为用于输出读取和重音的对象的对象文本,并且搜索与第一语料库中的连续符号的组中的对象文本的符号匹配的符号中的至少一组。 在与搜索的符号组相对应的阅读和口音的组合组中,出现在第一语料库中的出现概率高于参考概率的阅读和口音的组合组,其具有 被预先定义,被选为对象文本的读数和重音。 版权所有(C)2007,JPO&INPIT

    Speech recognition system, data processor, and its data processing method and program
    62.
    发明专利
    Speech recognition system, data processor, and its data processing method and program 有权
    语音识别系统,数据处理器及其数据处理方法和程序

    公开(公告)号:JP2005165066A

    公开(公告)日:2005-06-23

    申请号:JP2003405223

    申请日:2003-12-03

    CPC classification number: G10L15/26

    Abstract: PROBLEM TO BE SOLVED: To provide a data processing method suitable for transcribing speeches obtained in a special situation such as a trial and a meeting into a text by establishing proper correspondence between a text having been corrected and an original speech even if the text written down through speech recognition is corrected, and a system using the same. SOLUTION: The system is equipped with: a speech recognition processing part 32 which specifies utterance sections in speech data, performing speech recognition of respective utterance sections, and correlates the obtained character strings of recognition data of each utterance section and the speech data according to information on utterance time; and an output control part 34 which displays a text created by sorting recognition data for each utterance section. The system is further equipped with: a text editing part 35 which edits the created text; and a speech correspondence estimation part 36 which correlates character strings in the edited text to the speech data by using a dynamic programming technique. COPYRIGHT: (C)2005,JPO&NCIPI

    Abstract translation: 要解决的问题:为了提供一种数据处理方法,适用于将特殊情况(例如审判和会议)中获得的演讲转录成文本,通过建立正确的文本与原始语音之间的正确对应关系,即使 通过语音识别记录的文本被更正,并且使用相同的系统。 解决方案:该系统配备有:语音识别处理部分32,其指定语音数据中的话语部分,执行各个发音部分的语音识别,并且将获得的每个发音部分的识别数据的字符串与语音数据相关联 根据说话时间的信息; 以及输出控制部34,其显示通过对每个发音部分分类识别数据而创建的文本。 该系统还配备有:文本编辑部分35,其编辑所创建的文本; 以及通过使用动态编程技术将编辑的文本中的字符串与语音数据相关联的语音对应估计部分36。 版权所有(C)2005,JPO&NCIPI

    Word estimating method, voice recognition method, voice recognition device using this method, and program
    63.
    发明专利
    Word estimating method, voice recognition method, voice recognition device using this method, and program 有权
    词汇估计方法,语音识别方法,使用该方法的语音识别装置和程序

    公开(公告)号:JP2003076392A

    公开(公告)日:2003-03-14

    申请号:JP2001254502

    申请日:2001-08-24

    CPC classification number: G10L15/193

    Abstract: PROBLEM TO BE SOLVED: To simultaneously estimate a word and a syntactic structure with a high precision by providing a probability model allowing selection of a range of a history used for estimation and using this probability model as a structural language model with respect to processing for estimating the next data element on the basis of the history having a tree structure. SOLUTION: With respect to a word estimating method for voice recognition using a computer, the tree structure of the history of words preceding a word as the estimation object is specified, and a context tree which is stored in a tree-like context tree storage part 40 and has information related to structures allowed for a sentence and appearance probabilities of words for these structures as nodes is referred to, and a word is estimated on the basis of the context tree and the specified sentence structure of the history.

    Abstract translation: 要解决的问题:通过提供允许选择用于估计的历史的范围的概率模型并且使用该概率模型作为用于估计的处理的结构语言模型来同时高精度地估计单词和句法结构 基于具有树结构的历史的下一个数据元素。 解决方案:关于使用计算机的语音识别的词估计方法,指定在词之前的词的历史的树结构作为估计对象,以及存储在树状上下文树存储部分中的上下文树 并且具有与允许用于句子的结构相关的信息以及作为节点的这些结构的单词的出现概率,并且基于上下文树和历史的指定句子结构来估计单词。

    DEVICE AND METHOD FOR VOICE RECOGNITION, COMPUTER SYSTEM, AND STORAGE MEDIUM

    公开(公告)号:JP2001188558A

    公开(公告)日:2001-07-10

    申请号:JP37041399

    申请日:1999-12-27

    Applicant: IBM

    Abstract: PROBLEM TO BE SOLVED: To provide a device and a method for voice recognition which have a higher recognition rate than conventionally. SOLUTION: Words are divided into redundant words and other normal words and any of a predicted word and a precedent word as a condition are predicted discriminatingly between those two to improve the precision of the word prediction at a redundant word peripheral part. To this end, the voice recognition device has an acoustic processing means which converts an analog voice input signal into a digital signal, a storage means which stores acoustic models having learnt features of sounds, a storage means having a dictionary which has both 1st language models learnt on the basis of a document containing redundant words and normal words other than the redundant words in advance and 2nd language models learnt on the basis of a document of only normal words, while ignoring redundant words, and a means which recognizes as an inputted voice the word having the highest probability by calculating probability, by using the acoustic models and dictionary for the digital signal.

    VOICE RECOGNIZING DEVICE
    65.
    发明专利

    公开(公告)号:JPH02238496A

    公开(公告)日:1990-09-20

    申请号:JP5776089

    申请日:1989-03-13

    Applicant: IBM JAPAN

    Abstract: PURPOSE:To execute the adaptation of a vector quantization use code book with high accuracy and simply by providing a prototype adaptation means for correcting a prototype vector of each label in a label group of the vector quantization code book in accordance with a degree of relation between the label and a displacement vector by each displacement vector. CONSTITUTION:By bringing the generation of a word for adaptation learning to fre quency analysis at every prescribed period, a sequence of a feature vector is derived. Subsequently, a feature vector sequence is divided into two pieces of section 1 and section 2 on a time base, and a word base form is also divided into two pieces of sections L1, L2 in the same way, by which the corresponding relation of each part is obtained. On the basis of the corresponding relation of each section, a difference of representative values S1, S2 and B1, B2 of the feature quantity in the respective sections is derived. On the other hand, strength of the correspondence of each level and each section is derived as appearance probability of each section with a condition of the lavel, and by setting the conditional probability as weight and synthesizing a moving vector of the feature quantity of every section, code vectors F1, F2 correspond ing to each label are brought to adaptation. In such a way, the adaptation of a voice recognition system can be executed simply by small data.

Patent Agency Ranking