-
公开(公告)号:US11217230B2
公开(公告)日:2022-01-04
申请号:US16472544
申请日:2018-11-01
Applicant: SONY CORPORATION
Inventor: Hiro Iwase , Shinichi Kawano , Yuhei Taki , Kunihito Sawai
Abstract: There is provided an information processing device and an information processing method that enable speeding up of a responsivity of a system response to a speech of a user. The information processing device includes a processing unit configured to determine, on the basis of a result of semantic analysis that is to be obtained from an interim result of speech recognition of a speech of a user, presence or absence of a response to the speech of the user. It thereby becomes possible to speed up a responsivity of a system response to the speech of the user. The present technology can be applied to a speech dialogue system, for example.
-
公开(公告)号:US11869499B2
公开(公告)日:2024-01-09
申请号:US17268421
申请日:2019-07-01
Applicant: Sony Corporation
Inventor: Yuhei Taki , Hiro Iwase , Kunihito Sawai , Masaki Takase , Akira Miyashita
CPC classification number: G10L15/22 , G10L15/02 , G10L15/28 , G10L2015/223
Abstract: An information processing apparatus includes an extracting unit (133) that extracts a changing message related to a change in macro data (M), the changing message including at least one piece of first information indicating a function to be executed, and second information linked to the first information, from a user speech; a presuming unit (134) that presumes an element to be changed in the macro data (M) based on the changing message extracted by the extracting unit (133); and a changing unit (135) that changes the element to be changed in the macro data (M) presumed by the presuming unit (134), based on the changing message.
-
公开(公告)号:US11335334B2
公开(公告)日:2022-05-17
申请号:US16464494
申请日:2018-10-19
Applicant: SONY CORPORATION
Inventor: Hiro Iwase , Shinichi Kawano , Yuhei Taki , Kunihito Sawai
Abstract: There is provided an information processing device and an information processing method that enable the intention of a speech of a user to be estimated more accurately. The information processing device includes: a detection unit configured to detect a breakpoint of a speech of a user on the basis of a result of recognition that is to be obtained during the speech of the user; and an estimation unit configured to estimate an intention of the speech of the user on the basis of a result of semantic analysis of a divided speech sentence obtained by dividing a speech sentence at the detected breakpoint of the speech. The present technology can be applied, for example, to a speech dialogue system.
-
公开(公告)号:US11335322B2
公开(公告)日:2022-05-17
申请号:US16478602
申请日:2018-02-27
Applicant: SONY CORPORATION
Inventor: Hiro Iwase , Mari Saito , Shinichi Kawano
IPC: G10L13/02 , G10L13/047 , G10L15/16 , G10L15/22 , G10L15/25 , G10L25/84 , G06T1/00 , G10L21/00 , G10L25/63 , G10L15/00
Abstract: The present technology relates to a learning device, a learning method, a voice synthesis device, and a voice synthesis method configured so that information can be provided via voice allowing easy understanding of contents by a user as a speech destination. A learning device according to one embodiment of the present technology performs voice recognition of speech voice of a plurality of users, estimates statuses when a speech is made, and learns, on the basis of speech voice data, a voice recognition result, and the statuses when the speech is made, voice synthesis data to be used for generation of synthesized voice according to statuses upon voice synthesis. Moreover, a voice synthesis device estimates statuses, and uses the voice synthesis data to generate synthesized voice indicating the contents of predetermined text data and obtained according to the estimated statuses. The present technology can be applied to an agent device.
-
公开(公告)号:US11250873B2
公开(公告)日:2022-02-15
申请号:US16633161
申请日:2018-04-24
Applicant: Sony Corporation
Inventor: Hiro Iwase , Shinichi Kawano , Mari Saito , Yuhei Taki
IPC: G10L25/54 , G10L15/22 , G10L21/028 , G10L25/84
Abstract: Provided is an information processing device including an output control unit that controls presentation of content to a user, and when a non-viewing/listening period is detected in a viewing and listening behavior of the user corresponding to the content, causes a summary of the content to be output. The output control unit determines an amount of information in the summary of the content, based on the length of the non-viewing/listening period. Moreover, provided is an information processing method including: by a processor, controlling presentation of content to a user; and when a non-viewing/listening period is detected in a viewing and listening behavior of the user corresponding to the content, causing a summary of the content to be output. The causing the summary of the content to be output further includes determining an amount of information in the summary of the content, based on the length of the non-viewing/listening period.
-
公开(公告)号:US11183170B2
公开(公告)日:2021-11-23
申请号:US16321328
申请日:2017-08-03
Applicant: SONY CORPORATION
Inventor: Hiro Iwase , Mari Saito , Shinichi Kawano
Abstract: The present technology relates to an interaction control apparatus and a method that enable more appropriate interaction control to be performed. The interaction control apparatus includes an interaction progress controller that causes an utterance to be made in one or a plurality of understanding action request positions on the basis of utterance text that has been divided in the one or the plurality of understanding action request positions, the utterance inducing a user to perform an understanding action, and that controls a next utterance on the basis of a result of detecting the understanding action and the utterance text. The present technology is applicable to a speech interaction system.
-
公开(公告)号:US12147808B2
公开(公告)日:2024-11-19
申请号:US15733885
申请日:2019-03-08
Applicant: SONY CORPORATION
Inventor: Hiro Iwase , Yuhei Taki , Kunihito Sawai
Abstract: To automatically determine a more memorable macro name. Provided is an information processing device that comprises an utterance learning adaptation unit that executes clustering pertaining to a plurality of function execution instructions by a user and estimates, as a macro, a cluster that includes the plurality of function execution instructions and a response control unit that controls the presentation of information pertaining to the macro, wherein the utterance learning adaptation unit determines a name for the estimated macro on the basis of a context acquired at the time of issuing the plurality of function execution instructions included in the cluster, the response control unit controls a notification of the macro name to the user, and the plurality of function execution instructions include at least one function execution instruction issued via an utterance.
-
公开(公告)号:US12062360B2
公开(公告)日:2024-08-13
申请号:US16972420
申请日:2019-03-12
Applicant: SONY CORPORATION
Inventor: Hiro Iwase , Yuhei Taki , Kunihito Sawai
IPC: G10L15/065 , G10L15/08 , G10L15/18 , G10L15/22 , G10L15/28
CPC classification number: G10L15/065 , G10L15/1815 , G10L15/22 , G10L15/28 , G10L2015/088
Abstract: The present invention has an issue of effectively reducing the input load related to a voice trigger. There is provided an information processing device comprising a registration control unit that dynamically controls registration of startup phrases used as start triggers of a voice interaction session, in which the registration control unit temporarily additionally registers at least one of the startup phrases based on input voice. There is also provided an information processing method comprising dynamically controlling, by a processor, registration of startup phrases used as start triggers of a voice interaction session, in which the controlling further includes temporarily additionally registering at least one of the startup phrases based on input voice.
-
公开(公告)号:US12033624B2
公开(公告)日:2024-07-09
申请号:US17250627
申请日:2019-07-01
Applicant: SONY CORPORATION
Inventor: Hiro Iwase , Kunihito Sawai , Yuhei Taki , Masaki Takase , Akira Miyashita
CPC classification number: G10L15/22 , G06F21/554 , G10L15/10 , G06F2221/033
Abstract: An information processing apparatus includes: a determination unit (250) that, in a case where the determination unit (250) has recognized a user's phrase related to execution of a macro including at least one function execution instruction, determines a degree of security risk of the macro based on at least one of a matching rate, at the time of execution of the macro, of a context indicating a status of the user, or frequency of occurrence of the phrase; and a response control unit (270) that changes control of the execution of the macro based on a determination result of the determination unit (250).
-
公开(公告)号:US11803352B2
公开(公告)日:2023-10-31
申请号:US16970080
申请日:2018-12-06
Applicant: SONY CORPORATION
Inventor: Shinichi Kawano , Yuhei Taki , Hiro Iwase
CPC classification number: G06F3/167 , G06F3/017 , G06F3/165 , G10L15/22 , G10L25/63 , G06F2203/011 , G10L2015/223
Abstract: Provided is an information processing apparatus and an information processing method which can adaptively switch a user interface to be used by a user to environmental information. The information processing apparatus includes an interface control unit that switches a user interface to be used by a first user at least between a first user interface using voice and a second user interface different from the first user interface on the basis of environmental information.
-
-
-
-
-
-
-
-
-