-
公开(公告)号:KR1020130005160A
公开(公告)日:2013-01-15
申请号:KR1020110066574
申请日:2011-07-05
Applicant: 한국전자통신연구원
CPC classification number: H04M1/72552 , G10L15/083 , G10L15/30 , H04M2250/74 , H04W4/12
Abstract: PURPOSE: A message service method using a voice recognition function is provided to offer a message by combining a voice recognition result and the real voice of a user. CONSTITUTION: A message server(20) recognizes a voice transmitted from a transmission terminal(10)(S14). The message server generates a recognized result from the voice and an N-best result based on a chaos network. The message server transmits the generated N-best result to the transmission terminal(S20). The message server receives the selected message from the transmission terminal and an evaluation result for the message accuracy(S26). The message server transmits the message and the evaluation result to a reception terminal(30)(S32). [Reference numerals] (10) Transmission terminal; (20) Message server; (30) Reception terminal; (S10) Inputting voice; (S12,S40) Transmitting the voice; (S14) Recognizing the voice; (S16) Generating a recognized result and an N-best result; (S18) Storing log data; (S20) Transmitting the recognized result and the N-best result; (S22) Displaying the recognized result and the N-best result; (S24) Determining a message and an evaluation result; (S26,S32) Transmitting the message and the evaluation result; (S28) Storing additional log data; (S30) Modifying errors of the recognized result; (S34) Displaying the message and the evaluation result; (S36) Requesting the voice; (S38) Extracting the voice; (S42) Outputting the voice
Abstract translation: 目的:提供使用语音识别功能的消息服务方法,通过组合语音识别结果和用户真实语音来提供消息。 构成:消息服务器(20)识别从发送终端(10)发送的语音(S14)。 消息服务器根据混沌网络产生语音识别结果和N最佳结果。 消息服务器将生成的N最佳结果发送到发送终端(S20)。 消息服务器从发送终端接收所选择的消息和消息准确性的评估结果(S26)。 消息服务器将消息和评估结果发送到接收终端(30)(S32)。 (附图标记)(10)发送端子; (20)消息服务器; (30)接待台; (S10)输入声音; (S12,S40)发送语音; (S14)识别声音; (S16)生成识别结果和N最佳结果; (S18)存储日志数据; (S20)发送识别结果和N最佳结果; (S22)显示识别结果和N最佳结果; (S24)确定消息和评估结果; (S26,S32)发送消息和评估结果; (S28)存储其他日志数据; (S30)修正识别结果的错误; (S34)显示消息和评估结果; (S36)请求声音; (S38)提取声音; (S42)输出声音
-
公开(公告)号:KR1020120066530A
公开(公告)日:2012-06-22
申请号:KR1020100127907
申请日:2010-12-14
Applicant: 한국전자통신연구원
CPC classification number: G10L15/065 , G10L15/187
Abstract: PURPOSE: An apparatus for estimating language model weight is provided to enhance performance of secondary search and to improve performance of a voice recognition system. CONSTITUTION: An apparatus for estimating language model weight comprises: a first search unit(101) for performing primary search by applying a first language model; a phoneme recognition unit(102) for outputting second sound score by applying a sound model to a sound feature vector; a weight estimation unit(103) for outputting a first language model weight in case that a sound score of voice recognition result is higher than a sound score of phoneme recognition result; and a second search unit(104) for applying the second language weight to word grid.
Abstract translation: 目的:提供一种用于估计语言模型权重的装置,以提高辅助搜索的性能并提高语音识别系统的性能。 一种用于估计语言模型权重的装置,包括:第一搜索单元,用于通过应用第一语言模型来执行初级搜索; 用于通过将声音模型应用于声音特征向量来输出第二声分数的音素识别单元(102) 用于在声音识别结果的声分高于音素识别结果的声分数的情况下输出第一语言模型权重的权重估计单元; 以及用于将第二语言权重应用于字网格的第二搜索单元(104)。
-
公开(公告)号:KR1020120056661A
公开(公告)日:2012-06-04
申请号:KR1020100118310
申请日:2010-11-25
Applicant: 한국전자통신연구원
Abstract: PURPOSE: A voice signal pre-processing device and a method thereof are provided to interpolate and restore a voice signal whose size is abnormal under mobile environments, thereby increasing performance of recognizing a voice. CONSTITUTION: A voiced sound section detecting unit(120) detects a voiced sound section including a voiced sound signal from a voice section. A pre-processing method determining unit(140) detects a clipping signal which is generated during the voiced sound section. A clipping signal processing unit(160) extracts a signal sample close to the clipping signal. The clipping signal processing unit interpolates the clipping signal by using the signal sample.
Abstract translation: 目的:提供一种语音信号预处理装置及其方法,用于在移动环境下内插和恢复尺寸异常的语音信号,从而提高识别语音的性能。 声音部分检测单元(120)检测包括来自语音部分的有声声音信号的浊音部分。 预处理方法确定单元(140)检测在有声声部分期间产生的限幅信号。 剪辑信号处理单元(160)提取接近限幅信号的信号样本。 剪辑信号处理单元通过使用信号采样内插削波信号。
-
公开(公告)号:KR1020120026357A
公开(公告)日:2012-03-19
申请号:KR1020100088526
申请日:2010-09-09
Applicant: 한국전자통신연구원
Abstract: PURPOSE: A device for driving voice recognition system is provided to perform the voice recognition by vocalization of a pre-stored keyword without additional key operation, thereby increasing the user convenience. CONSTITUTION: When a user speaks a registration target keyword, a user registration unit(100) calculates a threshold value from the keyword. The user registration unit stores the threshold value in a storage unit(114). A voice recognition and driving unit(150) calculates a likelihood ratio for a vocalized data following the input of the vocalized data. The voice recognition and driving unit drives the system by comparing the likelihood ratio with the threshold value.
Abstract translation: 目的:提供一种用于驱动语音识别系统的设备,用于通过预先存储的关键字的发声来执行语音识别,而无需附加的键操作,从而增加了用户的便利性。 构成:当用户说出注册目标关键字时,用户注册单元(100)根据关键字计算阈值。 用户登记单元将阈值存储在存储单元(114)中。 语音识别和驱动单元(150)计算声音数据输入之后的发声数据的似然比。 语音识别和驱动单元通过将似然比与阈值进行比较来驱动系统。
-
-
-