Electronic device and control method thereof

    公开(公告)号:US11238871B2

    公开(公告)日:2022-02-01

    申请号:US16665532

    申请日:2019-10-28

    Abstract: An electronic apparatus and a control method are provided, including an input interface, a communication interface, a memory including at least one command, and at least one processor configured to control the electronic device and execute the at least one command to receive a user speech through the input interface, determine whether or not the user speech is a speech related to a task requiring user confirmation by analyzing the user speech, generate a question for the user confirmation when it is determined that the user speech is the speech related to the task requiring the user confirmation, and perform a task corresponding to the user speech when a user response corresponding to the question is input through the input interface. Embodiments may use an artificial intelligence model learned according to at least one of machine learning, a neural network, and a deep learning algorithm.

    System and method for recognizing user's speech

    公开(公告)号:US11532310B2

    公开(公告)日:2022-12-20

    申请号:US16988929

    申请日:2020-08-10

    Abstract: Provided is a system and method for recognizing a user's speech. A method, performed by a server, of providing a text string for a speech signal input to a device includes: receiving, from the device, an encoder output value derived from an encoder of an end-to-end automatic speech recognition (ASR) model included in the device; identifying a domain corresponding to the received encoder output value; selecting a decoder corresponding to the identified domain from among a plurality of decoders of an end-to-end ASR model included in the server; obtaining a text string from the received encoder output value using the selected decoder; and providing the obtained text string to the device.

    System and method for recognizing user's speech

    公开(公告)号:US11475896B2

    公开(公告)日:2022-10-18

    申请号:US16988929

    申请日:2020-08-10

    Abstract: Provided is a system and method for recognizing a user's speech. A method, performed by a server, of providing a text string for a speech signal input to a device includes: receiving, from the device, an encoder output value derived from an encoder of an end-to-end automatic speech recognition (ASR) model included in the device; identifying a domain corresponding to the received encoder output value; selecting a decoder corresponding to the identified domain from among a plurality of decoders of an end-to-end ASR model included in the server; obtaining a text string from the received encoder output value using the selected decoder; and providing the obtained text string to the device.

    Electronic device and method of controlling thereof

    公开(公告)号:US11551671B2

    公开(公告)日:2023-01-10

    申请号:US16872559

    申请日:2020-05-12

    Abstract: An electronic device and a method for controlling the electronic device are disclosed. The electronic device of the disclosure includes a microphone, a memory storing at least one instruction, and a processor configured to execute the at least one instruction. The processor, by executing the at least one instruction, is configured to: obtain second voice data by inputting first voice data input via the microphone to a first model trained to enhance sound quality, obtain a weight by inputting the first voice data and the second voice data to a second model, and identify input data to be input to a third model using the weight.

    Method and device for speech recognition

    公开(公告)号:US11302331B2

    公开(公告)日:2022-04-12

    申请号:US16750274

    申请日:2020-01-23

    Abstract: Provided are an electronic device for recognizing speech of a user, and a method, performed by the electronic device, of recognizing speech. The method includes obtaining an audio signal based on a speech input based on the audio signal being input, obtaining an output value of a first automatic speech recognition (ASR) model that outputs a character string at a first level; obtaining an output value of a second ASR model that outputs a character string at a second level corresponding to the audio signal based on the output value of the first ASR model based on the audio signal being input; and recognizing the speech from the output value of the second ASR model.

Patent Agency Ranking