-
公开(公告)号:US20190139535A1
公开(公告)日:2019-05-09
申请号:US16239891
申请日:2019-01-04
Applicant: Yamaha Corporation
Inventor: Hiraku KAYAMA , Hiroaki MATSUBARA
IPC: G10L13/033 , G10L21/0364 , G10L25/90
Abstract: This invention is an improvement of technology for automatically generating response voice to voice uttered by a speaker (user), and is characterized by controlling a pitch of the response voice in accordance with a pitch of the speaker's utterance. A voice signal of the speaker's utterance (e.g., question) is received, and a pitch (e.g., highest pitch) of a representative portion of the utterance is detected. Voice data of a responsive to the utterance is acquired, and a pitch (e.g., average pitch) based on the acquired response voice data is acquired. A pitch shift amount for shifting the acquired pitch to a target pitch having a particular relationship to the pitch of the representative portion is determined. When response voice is to be synthesized on the basis of the response voice data, the pitch of the response voice to be synthesized is shifted in accordance with the pitch shift amount.
-
公开(公告)号:US20170116978A1
公开(公告)日:2017-04-27
申请号:US15316850
申请日:2015-07-02
Applicant: Yamaha Corporation
Inventor: Hiroaki MATSUBARA
IPC: G10L13/027 , G10L13/08 , G10L13/047 , G10L15/22 , G10L15/18
CPC classification number: G10L13/027 , G10L13/047 , G10L13/08 , G10L15/1815 , G10L15/22
Abstract: A voice synthesizing apparatus includes: a voice inputter (102) configured to input a voice; an obtainer (22) configured to obtain a primary response to the voice inputted by the voice inputter (102); an analyzer (112) configured to analyze whether the primary response includes a repetition target; and a voice synthesizer (24) configured to, in a case where the analyzed primary response is determined to include the repetition target, synthesize a voice from a secondary response that includes the repetition target repeated at least twice to output the voice.
-
公开(公告)号:US20170221470A1
公开(公告)日:2017-08-03
申请号:US15491414
申请日:2017-04-19
Applicant: Yamaha Corporation
Inventor: Hiraku KAYAMA , Hiroaki MATSUBARA
IPC: G10L13/033 , G10L21/0364 , G10L25/90
CPC classification number: G10L13/0335 , G10L15/22 , G10L21/0364 , G10L25/90
Abstract: This invention is an improvement of technology for automatically generating response voice to voice uttered by a speaker (user), and is characterized by controlling a pitch of the response voice in accordance with a pitch of the speaker's utterance. A voice signal of the speaker's utterance (e.g., question) is received, and a pitch (e.g., highest pitch) of a representative portion of the utterance is detected. Voice data of a responsive to the utterance is acquired, and a pitch (e.g., average pitch) based on the acquired response voice data is acquired. A pitch shift amount for shifting the acquired pitch to a target pitch having a particular relationship to the pitch of the representative portion is determined. When response voice is to be synthesized on the basis of the response voice data, the pitch of the response voice to be synthesized is shifted in accordance with the pitch shift amount.
-
公开(公告)号:US20190392814A1
公开(公告)日:2019-12-26
申请号:US16561348
申请日:2019-09-05
Applicant: YAMAHA CORPORATION
Inventor: Hiraku KAYAMA , Hiroaki MATSUBARA , Junya URA
Abstract: A voice dialogue apparatus includes a pitch adjusting unit configured to shift pitches of an entire period of a preceding voice, which is reproduced before a dialogue voice for a dialogue, according to a pitch of the dialogue voice, a first reproduction instructing unit configured to instruct reproduction of the preceding voice having been adjusted with the pitch adjusting unit, and a second reproduction instructing unit configured to instruct reproduction of the dialogue voice after the reproduction of the preceding voice with the first reproduction instructing unit.
-
公开(公告)号:US20180130462A1
公开(公告)日:2018-05-10
申请号:US15862096
申请日:2018-01-04
Applicant: YAMAHA CORPORATION
Inventor: Hiraku KAYAMA , Hiroaki MATSUBARA
Abstract: A voice interaction method includes acquiring a voice utterance signal representative of an uttered voice, and acquiring a response signal representative of a response voice responsive to a content of the uttered voice identified by voice recognition of the voice utterance signal. The voice interaction method also includes supplying the response signal to a voice player, to have the response voice played by the voice player, and supplying an interjection signal representative of interjection voice to the voice player, to have the interjection voice played by the voice player during a wait period that starts from an end point of the uttered voice and ends at a start of playback of the response voice.
-
6.
公开(公告)号:US20160086597A1
公开(公告)日:2016-03-24
申请号:US14892624
申请日:2014-06-02
Applicant: YAMAHA CORPORATION
Inventor: Hiroaki MATSUBARA , Junya URA , Takehiko KAWAHARA , Yuji HISAMINATO , Katsuji YOSHIMURA
IPC: G10L13/033 , G10L15/18 , G10L25/90
CPC classification number: G10L13/0335 , G10L13/027 , G10L13/033 , G10L13/06 , G10L13/10 , G10L15/18 , G10L25/90 , H04M2201/39
Abstract: The present invention is provided with: a voice input section that receives a remark (a question) via a voice signal; a reply creation section that creates a voice sequence of a reply (response) to the remark; a pitch analysis section that analyzes the pitch of a first segment (e.g., word ending) of the remark; and a voice generation section (a voice synthesis section, etc.) that generates a reply, in the form of voice, represented by the voice sequence. The voice generation section controls the pitch of the entire reply in such a manner that the pitch of a second segment (e.g., word ending) of the reply assumes a predetermined pitch (e.g., five degrees down) with respect to the pitch of the first segment of the remark. Such arrangements can realize synthesis of replying voice capable of giving a natural feel to the user.
Abstract translation: 本发明提供有:语音输入部分,经由语音信号接收注释(问题); 回复创建部分,其创建对该评论的回复(响应)的语音序列; 音调分析部分,用于分析语句的第一段(例如,结尾)的音调; 以及语音生成部(声音合成部等),以语音的形式产生由语音序列表示的应答。 语音产生部分以这样一种方式来控制整个答复的音调,使得答复的第二片段(例如,结尾的字节)的音调相对于第一节目的节距呈现预定的音调(例如,五度下降) 段的段落。 这样的配置可以实现能够给予用户自然感觉的回复语音的综合。
-
公开(公告)号:US20170110111A1
公开(公告)日:2017-04-20
申请号:US15375984
申请日:2016-12-12
Applicant: YAMAHA CORPORATION
Inventor: Hiroaki MATSUBARA , Junya URA , Takehiko KAWAHARA , Yuji HISAMINATO , Katsuji YOSHIMURA
IPC: G10L13/033 , G10L13/027 , G10L15/18 , G10L25/90
CPC classification number: G10L13/0335 , G10L13/027 , G10L13/033 , G10L13/06 , G10L13/10 , G10L15/18 , G10L25/90 , H04M2201/39
Abstract: The present invention is provided with: a voice input section that receives a remark (a question) via a voice signal; a reply creation section that creates a voice sequence of a reply (response) to the remark; a pitch analysis section that analyzes the pitch of a first segment (e.g., word ending) of the remark; and a voice generation section (a voice synthesis section, etc.) that generates a reply, in the form of voice, represented by the voice sequence. The voice generation section controls the pitch of the entire reply in such a manner that the pitch of a second segment (e.g., word ending) of the reply assumes a predetermined pitch (e.g., five degrees down) with respect to the pitch of the first segment of the remark. Such arrangements can realize synthesis of replying voice capable of giving a natural feel to the user.
-
-
-
-
-
-