Patent search ap:("APPLE INC.") AND inv:"PAULIK Page Matthias"

1.

发明申请
USER-SPECIFIC ACOUSTIC MODELS 审中-公开

公开(公告)号：WO2018208859A1

公开(公告)日：2018-11-15

申请号：PCT/US2018/031700

申请日：2018-05-08

Applicant: APPLE INC.

Inventor： PAULIK, Matthias , MASON, Henry G. , SKINDER, Jason A.

IPC: G10L17/06 , G10L15/07 , G10L15/02

Abstract: Systems and processes for providing user-specific acoustic models are provided. In accordance with one example, a method includes, at an electronic device having one or more processors, receiving a plurality of speech inputs, each of the speech inputs associated with a same user of the electronic device; providing each of the plurality of speech inputs to a user-independent acoustic model, the user-independent acoustic model providing a plurality of speech results; initiating a user-specific acoustic model on the electronic device; and adjusting the user-specific acoustic model based on the plurality of speech inputs and the plurality of speech results.

2.

发明申请
USER INTERFACE FOR CORRECTING RECOGNITION ERRORS 审中-公开

公开(公告)号：WO2018208491A1

公开(公告)日：2018-11-15

申请号：PCT/US2018/028890

申请日：2018-04-23

Applicant: APPLE INC.

Inventor： GARG, Ashish , SADDLER, Harry J. , GRAMPUROHIT, Shweta , WALKER, Robert A. , SHAH, Rushin N. , SEIGEL, Matthew S. , PAULIK, Matthias

IPC: G10L25/00

CPC classification number: G10L25/00

Abstract: Speech recognition is performed on a received utterance to determine a plurality of candidate text representations of the utterance, including a primary text representation and one or more alternative text representations. Natural language processing is performed on the primary text representation to determine a plurality of candidate actionable intents, including a primary actionable intent and one or more alternative actionable intents. A result is determined based on the primary actionable intent. The result is provided to the user. A recognition correction trigger is detected. In response to detecting the recognition correction trigger, a set of alternative intent affordances and a set of alternative text affordances are concurrently displayed.

3.

发明申请
AUTOMATIC ACCENT DETECTION 审中-公开
Title translation: 自动检测

公开(公告)号：WO2016200415A1

公开(公告)日：2016-12-15

申请号：PCT/US2015/053366

申请日：2015-09-30

Applicant: APPLE INC.

Inventor： NALLASAMY, Udhyakumar , KAJAREKAR, Sachin , PAULIK, Matthias , SEIGEL, Matthew

IPC: G10L15/06 , G10L25/51

CPC classification number: G10L15/07 , G10L15/063 , G10L15/10 , G10L15/16 , G10L25/51 , G10L2015/0631

Abstract: Systems and processes for automatic accent detection are provided. In accordance with one example, a method includes, at an electronic device with one or more processors and memory, receiving a user input, determining a first similarity between a representation of the user input and a first acoustic model of a plurality of acoustic models, and determining a second similarity between the representation of the user input and a second acoustic model of the plurality of acoustic models. The method further includes determining whether the first similarity is greater than the second similarity. In accordance with a determination that the first similarity is greater than the second similarity, the first acoustic model may be selected; and in accordance with a determination that the first similarity is not greater than the second similarity, the second acoustic model may be selected.

Abstract translation: 提供了自动重音检测的系统和过程。根据一个示例，一种方法包括在具有一个或多个处理器和存储器的电子设备处接收用户输入，确定用户输入的表示与多个声学模型的第一声学模型之间的第一相似度，以及确定所述用户输入的表示与所述多个声学模型的第二声学模型之间的第二相似度。该方法还包括确定第一相似度是否大于第二相似度。根据第一相似度大于第二相似度的确定，可以选择第一声学模型; 并且根据第一相似度不大于第二相似度的确定，可以选择第二声学模型。

4.

发明申请
OFFLINE PERSONAL ASSISTANT 审中-公开

公开(公告)号：WO2018209093A1

公开(公告)日：2018-11-15

申请号：PCT/US2018/032075

申请日：2018-05-10

Applicant: APPLE INC.

Inventor： ZEITLIN, Nicolas , PAULIK, Matthias , MASON, Henry G. , KWONG, Karric , AKAY, Sinan , RATHINAM, Saravana Kumar , BISWAS, Anumita

IPC: G06F17/27 , G06F17/30

CPC classification number: G06F17/279 , G06F16/90332

Abstract: Systems and processes for performing a task with a digital assistant are provided. In accordance with one example, a method includes, at an electronic device having one or more processors, receiving a natural-language input; determining, based on the natural- language input, a first task and first usefulness score associated with the first task; receiving, from another electronic device, a second task and second usefulness score associated with the second task; determining whether the first usefulness score is higher than the second usefulness score; in accordance with a determination that the first usefulness score is higher than the second usefulness score: performing the first task determined by the electronic device; and providing an output indicating whether the first task has been performed; and in accordance with a determination that the second usefulness score is higher than the first usefulness score: performing the second task received from the another electronic device; and providing an output indicating whether the second task has been performed.

5.

发明申请
CONTEXT-BASED ENDPOINT DETECTION 审中-公开
Title translation: 基于语境的端点检测

公开(公告)号：WO2016200470A1

公开(公告)日：2016-12-15

申请号：PCT/US2016/025407

申请日：2016-03-31

Applicant: APPLE INC.

Inventor： WILLIAMS, Shaun, E. , MASON, Henry, G. , KRISHNAMOORTHY, Mahesh , PAULIK, Matthias , AGRAWAL, Neha , KAJAREKAR, Sachin, S. , UGUROGLU, Selen , MOHAMED, Ali, S.

IPC: G10L15/26 , G10L21/00 , G10L25/78

CPC classification number: G10L15/04 , G10L17/02 , G10L25/87 , G10L2025/783

Abstract: The present disclosure generally relates to context-based endpoint detection in user speech input. A method for identifying an endpoint of a spoken request by a user may include receiving user input of natural language speech including one or more words; identifying at least one context associated with the user input; generating a probability, based on the at least one context associated with the user input, that a location in the user input is an endpoint; determining whether the probability is greater than a threshold; and in accordance with a determination that the probability is greater than the threshold, identifying the location in the user input as the endpoint.

Abstract translation: 本公开通常涉及用户语音输入中的基于上下文的端点检测。用于识别用户的口头请求的端点的方法可以包括接收包括一个或多个单词的自然语言语言的用户输入; 识别与所述用户输入相关联的至少一个上下文; 基于与所述用户输入相关联的所述至少一个上下文生成所述用户输入中的位置是端点的概率; 确定概率是否大于阈值; 并且根据概率大于阈值的确定，将用户输入中的位置识别为端点。

6.

发明申请
USER INTERFACES FOR MANAGING VISUAL CONTENT IN MEDIA 审中-公开

公开(公告)号：WO2022225822A1

公开(公告)日：2022-10-27

申请号：PCT/US2022/025096

申请日：2022-04-15

Applicant: APPLE INC.

Inventor： PAUL, Grant , ALBERT, Kellie , ALVARO MUNOZ, Francisco , AXELROD, Amittai , BAKER, Steven , BORIOS, Guillaume , BRADFORD, Adam Huff , BRASKET, Jeffrey A. , CHATTERJEE, Rajen , CHEN, Jennifer Pon , COREY, Brandon J. , CRANE, Neil G. , CRANFILL, Elizabeth Caroline , DANTONE, Matthias , DE VRIES, Nathan , DESELAERS, Thomas , DIXON, Ryan S. , FEDERIGHI, Craig M. , JAGADEESH, Vignesh , JONES, James N. , KHULLAR, Mallika Priya , LANE, Vincent Michael , LIU, Xishuo , LUPINETTI, Nicholas , MANZARI, Behkish J. , MARINEAU-MES, Sebastien V. , MILADINOV, Viktor , PATEL, Kayur , PAULIK, Matthias , PHAM, Rubii , SANTOS, Ron , SHAH, Pulah , SHARMA, Vinay , SIBLINI, Aya , SOUZA DOS SANTOS, Andre , TANG, Siyang , WANG, Xin , YE, Chen , ZHAO, Yang , ZHONG, Guangyu , ZULIANI, Marco

IPC: G06V20/62 , G06F1/16 , G06F3/00 , G06F3/03 , G06F3/04842 , G06F3/04845 , G06F3/04883 , G06F40/166 , G06V30/40 , G06F1/1686 , G06F3/002 , G06F3/0304 , G06F40/279 , G06F40/58 , G06V20/70 , G06V30/10

Abstract: The present disclosure generally relates to methods and user interfaces for managing visual content at a computer system. In some embodiments, methods and user interfaces for managing visual content in media are described. In some embodiments, methods and user interfaces for managing visual indicators for visual content in media are described. In some embodiments, methods and user interfaces for inserting visual content in media are described. In some embodiments, methods and user interfaces for identifying visual content in media are described. In some embodiments, methods and user interfaces for translating visual content in media are described. In some embodiments, methods and user interfaces for translating visual content in media are described. In some embodiments, methods and user interfaces for managing user interface objects for visual content in media are described.

7.

发明申请
IMPROVING AUTOMATIC SPEECH RECOGNITION BASED ON USER FEEDBACK 审中-公开
Title translation: 基于用户反馈改进自动语音识别

公开(公告)号：WO2016033257A1

公开(公告)日：2016-03-03

申请号：PCT/US2015/047062

申请日：2015-08-26

Applicant: APPLE INC.

Inventor： KRISHNAMOORTHY, Mahesh , PAULIK, Matthias

IPC: G10L15/22 , G10L15/32 , G10L15/01

CPC classification number: G10L15/22 , G10L15/01 , G10L15/02 , G10L15/32 , G10L2015/025

Abstract: Systems and processes for processing speech in a digital assistant are provided. In one example process, a first speech input can be received from a user. The first speech input can be processed using a first automatic speech recognition system to produce a first recognition result. An input indicative of a potential error in the first recognition result can be received. The input can be used to improve the first recognition result. For example, the input can include a second speech input that is a repetition of the first speech input. The second speech input can be processed using a second automatic speech recognition system to produce a second recognition result.

Abstract translation: 提供了一种用于在数字助理中处理语音的系统和过程。在一个示例过程中，可以从用户接收第一语音输入。可以使用第一自动语音识别系统来处理第一语音输入以产生第一识别结果。可以接收表示第一识别结果中的潜在错误的输入。该输入可用于改善第一识别结果。例如，输入可以包括作为第一语音输入的重复的第二语音输入。可以使用第二自动语音识别系统来处理第二语音输入以产生第二识别结果。

8.

发明公开
USER-SPECIFIC ACOUSTIC MODELS 审中-公开

公开(公告)号：EP3905242A1

公开(公告)日：2021-11-03

申请号：EP21181086.6

申请日：2018-05-08

Applicant: Apple Inc.

Inventor： PAULIK, Matthias , MASON, Henry G. , SKINDER, Jason A.

IPC: G10L17/06 , G10L15/07

Abstract: Systems and processes for providing user-specific acoustic models are provided. In accordance with one example, a method includes, at an electronic device having one or more processors, initiating a user-specific acoustic model on the electronic device; receiving first speech input corresponding to a plurality of speech inputs, each of the speech inputs associated with a user of the electronic device; adjusting the user-specific acoustic model based on the plurality of speech inputs; wherein the method further comprises providing the adjusted user-specific acoustic model to another electronic device and at the another electronic device: receiving the adjusted user-specific acoustic model: receiving a second speech input from a speaker; and identifying, with the adjusted user-specific acoustic model, the speaker of the second speech input as the user.

9.

发明公开
USER INTERFACES FOR MANAGING VISUAL CONTENT IN MEDIA 审中-公开

公开(公告)号：EP4298618A1

公开(公告)日：2024-01-03

申请号：EP22720880.8

申请日：2022-04-15

Applicant: Apple Inc.

Inventor： PAUL, Grant R. , ALBERT, Kellie L. , ALVARO MUNOZ, Francisco , AXELROD, Amittai , BAKER, Steven D. , BORIOS, Guillaume , BRADFORD, Adam Huff , BRASKET, Jeffrey A. , CHATTERJEE, Rajen , CHEN, Jennifer Pon , COREY, Brandon J. , CRANE, Neil G. , CRANFILL, Elizabeth Caroline , DANTONE, Matthias , DE VRIES, Nathan , DESELAERS, Thomas , DIXON, Ryan S. , FEDERIGHI, Craig M. , JAGADEESH, Vignesh , JONES, James N. , KHULLAR, Mallika Priya , LANE, Vincent M. , LIU, Xishuo , LUPINETTI, Nicholas D. , MANZARI, Johnnie B. , MARINEAU-MES, Sebastien V. , MILADINOV, Viktor , PATEL, Kayur D. , PAULIK, Matthias , PHAM, Ngoc H. , SANTOS, Ron , SHAH, Pulah J. , SHARMA, Vinay , SIBLINI, Aya , SOUZA DOS SANTOS, Andre , TANG, Siyang , WANG, Xin , YE, Chen , ZHAO, Yang , ZHONG, Guangyu , ZULIANI, Marco , VENKATACHARY, Srinivasan

IPC: G06V20/62 , G06F1/16 , G06F3/00 , G06F3/03 , G06F3/04842 , G06F3/04845 , G06F3/04883 , G06F40/166 , G06V30/40

10.

发明公开
USER INTERFACE FOR CORRECTING RECOGNITION ERRORS 审中-公开

公开(公告)号：EP3593350A1

公开(公告)日：2020-01-15

申请号：EP18723277.2

申请日：2018-04-23

Applicant: Apple Inc.

Inventor： GARG, Ashish , SADDLER, Harry J. , GRAMPUROHIT, Shweta , WALKER, Robert A. , SHAH, Rushin N. , SEIGEL, Matthew S. , PAULIK, Matthias

IPC: G10L25/00

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification