MARK INSERTION DEVICE AND ITS METHOD

    公开(公告)号:JP2001083987A

    公开(公告)日:2001-03-30

    申请号:JP24331199

    申请日:1999-08-30

    Applicant: IBM

    Abstract: PROBLEM TO BE SOLVED: To insert punctuation marks on suitable positions in a sentence. SOLUTION: An acoustic processing part 20 processes inputted voice data and converts the data into characteristic vectors. When punctuation mark automatic insertion is not executed, a language mark-reproduction part 22 processes the characteristic vectors by using only a versatile language model 320, and inserts a punctuation mark on a part where insertion of a punctuation mark is shown clearly, for example, 'a comma' or the like, by voice data. When the punctuation mark automatic insertion is executed, the language mark- reproduction part 22 discriminates a pause part having no voice as a comma ',' or the like by using the versatile language model 320 and a punctuation language model 322.

    Voice recognition system and method
    2.
    发明专利
    Voice recognition system and method 有权
    语音识别系统和方法

    公开(公告)号:JP2010139963A

    公开(公告)日:2010-06-24

    申请号:JP2008318403

    申请日:2008-12-15

    Abstract: PROBLEM TO BE SOLVED: To provide a practical system etc. for voice recognition, in which recognition performance is improved by considering utterance variation.
    SOLUTION: The system includes a voice recognition device 200 and a pre-processor 100 for creating a recognition graph used for voice recognition processing by the voice recognition device 200. The pre-processor 100 comprises: a language model estimation section 110 for estimating a language model; a recognition word dictionary section 130 holding corresponding information to a word, a phoneme string just in the same description as in the word, and to information on the phoneme string in which utterance variation is described; and a recognition graph creating section 140 for creating a recognition graph on the basis of a language model estimated by a language model estimation section 110, and the correspondence information held by the recognition word dictionary section 130 regarding the word included in the language model. The recognition graph creating section 140 creates the recognition graph by applying the phoneme string considering utterance variation regarding the word with respect to the word included in a word string composed of more than a fixed number of words.
    COPYRIGHT: (C)2010,JPO&INPIT

    Abstract translation: 要解决的问题:提供语音识别的实用系统等,其中通过考虑话语变化来提高识别性能。 解决方案:该系统包括用于创建用于由语音识别装置200进行语音识别处理的识别图形的语音识别装置200和预处理器100.预处理器100包括:语言模型估计部分110,用于 估计语言模型; 将对应的信息保存到单词的识别词典部分130,与该单词相同的描述中的音素串,以及描述话音变化的音素串的信息; 以及用于基于由语言模型估计部分110估计的语言模型创建识别图形的识别图形创建部分140,以及由识别词典词典部分130保持的关于语言模型中包含的词语的对应信息。 识别图形创建部分140通过应用音素串来考虑与包含在由多于固定数量的单词组成的单词串中的单词相关的单词的发音变化来应用音素串来创建识别图。 版权所有(C)2010,JPO&INPIT

    Technique to search new phrase to be registered in voice processing dictionary
    3.
    发明专利
    Technique to search new phrase to be registered in voice processing dictionary 有权
    在语音处理词典中注册新技术的技巧

    公开(公告)号:JP2008151926A

    公开(公告)日:2008-07-03

    申请号:JP2006338454

    申请日:2006-12-15

    CPC classification number: G06F17/2735 G06F17/277

    Abstract: PROBLEM TO BE SOLVED: To search a new phrase to be registered in a dictionary of a dividing means which breakes down a text into phrases.
    SOLUTION: This system inputs a text for learning into a dividing means to break down into phrases to produce break down candidates including the phrases different in combination according to the obtained break down reliability. It sums up the reliability of the break down candidates including those phrases for each phrase to find out their likelihood. Then, it finds out the combination minimizing the information entropy of the phrase considered to appear at the frequency matching the likelihood of the phrases in the combination within the extent that the text can be expressed by using the phrases included in a combination among the combinations of phrases included at least in one candidate, and to outputs it as a combination of phrases including the new phrase.
    COPYRIGHT: (C)2008,JPO&INPIT

    Abstract translation: 要解决的问题:搜索要将文本破解成短语的分割装置的字典中注册的新短语。

    解决方案:该系统将用于学习的文本输入到分割装置中以分解成短语,以根据获得的分解可靠性产生包括不同组合的短语的候选人。 它总结了分解候选人的可靠性,包括每个短语的短语,以找出它们的可能性。 然后,发现组合中最小化被认为出现在频率上的短语的信息熵,以匹配组合中的短语的可能性,该程度可以通过使用包含在组合中的组合来表达文本 短语至少包括在一个候选中,并将其作为包括新短语的短语的组合输出。 版权所有(C)2008,JPO&INPIT

    Speech recognition system, data processor, and its data processing method and program
    4.
    发明专利
    Speech recognition system, data processor, and its data processing method and program 有权
    语音识别系统,数据处理器及其数据处理方法和程序

    公开(公告)号:JP2005165066A

    公开(公告)日:2005-06-23

    申请号:JP2003405223

    申请日:2003-12-03

    CPC classification number: G10L15/26

    Abstract: PROBLEM TO BE SOLVED: To provide a data processing method suitable for transcribing speeches obtained in a special situation such as a trial and a meeting into a text by establishing proper correspondence between a text having been corrected and an original speech even if the text written down through speech recognition is corrected, and a system using the same. SOLUTION: The system is equipped with: a speech recognition processing part 32 which specifies utterance sections in speech data, performing speech recognition of respective utterance sections, and correlates the obtained character strings of recognition data of each utterance section and the speech data according to information on utterance time; and an output control part 34 which displays a text created by sorting recognition data for each utterance section. The system is further equipped with: a text editing part 35 which edits the created text; and a speech correspondence estimation part 36 which correlates character strings in the edited text to the speech data by using a dynamic programming technique. COPYRIGHT: (C)2005,JPO&NCIPI

    Abstract translation: 要解决的问题:为了提供一种数据处理方法,适用于将特殊情况(例如审判和会议)中获得的演讲转录成文本,通过建立正确的文本与原始语音之间的正确对应关系,即使 通过语音识别记录的文本被更正,并且使用相同的系统。 解决方案:该系统配备有:语音识别处理部分32,其指定语音数据中的话语部分,执行各个发音部分的语音识别,并且将获得的每个发音部分的识别数据的字符串与语音数据相关联 根据说话时间的信息; 以及输出控制部34,其显示通过对每个发音部分分类识别数据而创建的文本。 该系统还配备有:文本编辑部分35,其编辑所创建的文本; 以及通过使用动态编程技术将编辑的文本中的字符串与语音数据相关联的语音对应估计部分36。 版权所有(C)2005,JPO&NCIPI

    Word estimating method, voice recognition method, voice recognition device using this method, and program
    5.
    发明专利
    Word estimating method, voice recognition method, voice recognition device using this method, and program 有权
    词汇估计方法,语音识别方法,使用该方法的语音识别装置和程序

    公开(公告)号:JP2003076392A

    公开(公告)日:2003-03-14

    申请号:JP2001254502

    申请日:2001-08-24

    CPC classification number: G10L15/193

    Abstract: PROBLEM TO BE SOLVED: To simultaneously estimate a word and a syntactic structure with a high precision by providing a probability model allowing selection of a range of a history used for estimation and using this probability model as a structural language model with respect to processing for estimating the next data element on the basis of the history having a tree structure. SOLUTION: With respect to a word estimating method for voice recognition using a computer, the tree structure of the history of words preceding a word as the estimation object is specified, and a context tree which is stored in a tree-like context tree storage part 40 and has information related to structures allowed for a sentence and appearance probabilities of words for these structures as nodes is referred to, and a word is estimated on the basis of the context tree and the specified sentence structure of the history.

    Abstract translation: 要解决的问题:通过提供允许选择用于估计的历史的范围的概率模型并且使用该概率模型作为用于估计的处理的结构语言模型来同时高精度地估计单词和句法结构 基于具有树结构的历史的下一个数据元素。 解决方案:关于使用计算机的语音识别的词估计方法,指定在词之前的词的历史的树结构作为估计对象,以及存储在树状上下文树存储部分中的上下文树 并且具有与允许用于句子的结构相关的信息以及作为节点的这些结构的单词的出现概率,并且基于上下文树和历史的指定句子结构来估计单词。

    DEVICE AND METHOD FOR VOICE RECOGNITION, COMPUTER SYSTEM, AND STORAGE MEDIUM

    公开(公告)号:JP2001188558A

    公开(公告)日:2001-07-10

    申请号:JP37041399

    申请日:1999-12-27

    Applicant: IBM

    Abstract: PROBLEM TO BE SOLVED: To provide a device and a method for voice recognition which have a higher recognition rate than conventionally. SOLUTION: Words are divided into redundant words and other normal words and any of a predicted word and a precedent word as a condition are predicted discriminatingly between those two to improve the precision of the word prediction at a redundant word peripheral part. To this end, the voice recognition device has an acoustic processing means which converts an analog voice input signal into a digital signal, a storage means which stores acoustic models having learnt features of sounds, a storage means having a dictionary which has both 1st language models learnt on the basis of a document containing redundant words and normal words other than the redundant words in advance and 2nd language models learnt on the basis of a document of only normal words, while ignoring redundant words, and a means which recognizes as an inputted voice the word having the highest probability by calculating probability, by using the acoustic models and dictionary for the digital signal.

    CHARACTER RECOGNITION AND CHARACTER COMPLEMENTING METHOD AND COMPUTER SYSTEM

    公开(公告)号:JPH0896085A

    公开(公告)日:1996-04-12

    申请号:JP22755994

    申请日:1994-09-22

    Applicant: IBM JAPAN

    Abstract: PURPOSE: To balance the limit of the number which can be presented and a request to perform a prediction as far as possible by effectively performing a switch as to whether performing only a one-character prediction or performing a word prediction. CONSTITUTION: A reading candidate character string is recognized from the reading information inputted from a coordinate input/display device 11 via a character input part 7. For every recognized reading candidate character string, the character which can be continuous to the reading candidate character string and the incidence probability (branch probability) are acquired by retrieving a dictionary. The probability L of a predicted character string is determined. At this time, whether the number of the predicted character string is more than the maximum number N to be presented to a user as candidate character strings or not is judged. The words are sorted in order of larger L. The difference of the sum total Lc of the L of the words up to an N number and the sum total Ld of the L of the words up to an N+1st number or after is a prescribed number or more, the word predicted by performing an extension is presented to the user.

    METHOD AND DEVICE FOR POST-CHARACTER RECOGNITION PROCESSING

    公开(公告)号:JPH06162274A

    公开(公告)日:1994-06-10

    申请号:JP30700692

    申请日:1992-11-17

    Applicant: IBM JAPAN

    Inventor: ITO NOBUYASU

    Abstract: PURPOSE:To enable a post-processing based on a transition probability in a language provided with a lot of character sets such as Japanese by adding attributes such as the parts of speech or the like to candidate characters obtained as the result of character recognition and evaluating the transition probability. CONSTITUTION:In a post-processing device for selecting the optimum combination of the candidate characters from the view point of a character transition probability from the strings (character lattices) of candidate character groups obtained as the result of recognizing Japanese character strings, a character/part of speech correspondence storage means 10 stores the parts of speech possibly adopted by the respective characters in the character strings for the respective characters and a part of speech corresponding means 11 makes the parts of speech correspond to the respective candidate characters based on stored contents. Also a character transition probability storage means 12 stores the transition probabilities that the respective characters corresponding to the parts of speech are connected to each other and a connection relation evaluation means 13 evaluates the connection relation of the candidate characters with the candidate characters in front in the character lattices for the respective candidate characters corresponding to the parts of speech based on the stored contents of the transition probability storage means 12. Then, an optimum pass selecting means 15 selects the candidate character whose connection relation is optimum.

    POST-PROCESSING METHOD FOR OCR-INPUTTED JAPANESE SENTENCE

    公开(公告)号:JPH05108891A

    公开(公告)日:1993-04-30

    申请号:JP25719491

    申请日:1991-09-10

    Applicant: IBM

    Abstract: PURPOSE: To execute the post processing of a Japanese sentence inputted from an OCR at sufficiently high accuracy and speed. CONSTITUTION: After searching grammatically formed passes based upon a recognition result and the restriction of Japanese, the cost of each available pass is calculated and a plurality of candidate passes having suitable cost values are selected. Then the conviction Cf of each character candidate (or plural candidates) of each column is calculated from the cost g (1) of a candidate pass passing the character candidate itself (or the candidates themselves) and the cost g (2) of a candidate pass passing another character candidate (or other character candidates). Thus the substitution of candidates or warning to an operator is executed, based upon the calculated value.

    Idle talk extraction system, method and program for extracting idle talk parts from conversation
    10.
    发明专利
    Idle talk extraction system, method and program for extracting idle talk parts from conversation 有权
    空闲提拉系统,从对话中提取空闲零件的方法和程序

    公开(公告)号:JP2013145429A

    公开(公告)日:2013-07-25

    申请号:JP2012004802

    申请日:2012-01-13

    CPC classification number: G06F17/3053 G06F17/2785 Y10S707/99933

    Abstract: PROBLEM TO BE SOLVED: To provide a technique for extracting idle talk parts from a conversation.SOLUTION: An idle talk extraction system for extracting idle talks from a conversation comprises: a first corpus including documents in a plurality of fields; a second corpus including only documents in a field to which the conversation belongs; a determination part to determine as a lower limit subject word a word for which an idf value for the first corpus and an idf value for the second corpus are each below a first prescribed threshold value, for words included in the second corpus; a score calculation part to calculate as a score a tf-idf value for each word included in the second corpus and, for the lower limit subject word, use a constant set as a lower limit instead of the tf-idf value; a clipping part to sequentially cut out intervals to be processed, from text data of contents of the conversation; and an extraction part to extract as an idle talk part an interval where an average value of the score of words included in the interval is larger than a second prescribed threshold value.

    Abstract translation: 要解决的问题:提供从会话中提取空闲谈话部分的技术。解决方案:一种用于从会话中提取空闲会话的空闲谈话提取系统包括:包括多个字段中的文档的第一语料库; 第二语料库,仅包括会话所属领域的文件; 确定部分,用于将包含在第二语料库中的单词确定为第一语料库的idf值和第二语料库的idf值的单词低于第一规定阈值的下限主题词语的单词; 分数计算部分,用于计算包括在第二语料库中的每个单词的tf-idf值作为分数,并且对于下限主题词,使用常数集作为下限而不是tf-idf值; 剪切部分,从会话的内容的文本数据中顺序地切出待处理的间隔; 以及提取部分,作为空闲谈话部分提取包括在所述间隔中的词的分数的平均值大于第二规定阈值的间隔。

Patent Agency Ranking