Invention Grant
- Patent Title: System and method for multi-modal podcast summarization
-
Application No.: US17036136Application Date: 2020-09-29
-
Publication No.: US11295746B2Publication Date: 2022-04-05
- Inventor: Amanmeet Garg , Aneesh Vartakavi , Joshua Ernest Morris
- Applicant: Gracenote, Inc.
- Applicant Address: US CA Emeryville
- Assignee: Gracenote, Inc.
- Current Assignee: Gracenote, Inc.
- Current Assignee Address: US CA Emeryville
- Agency: McDonnell Boehnen Hulbert & Berghoff LLP
- Main IPC: G10L19/00
- IPC: G10L19/00 ; G10L21/00 ; G10L15/26 ; G10L15/02

Abstract:
In one aspect, a method includes receiving podcast content, generating a transcript of at least a portion of the podcast content, and parsing the podcast content to (i) identify audio segments within the podcast content, (ii) determine classifications for the audio segments, (iii) identify audio segment offsets, and (iv) identify sentence offsets. The method also includes based on the audio segments, the classifications, the audio segment offsets, and the sentence offsets, dividing the generated transcript into text sentences and, from among the text sentences of the divided transcript, selecting a group of text sentences for use in generating an audio summary of the podcast content. The method also includes based on timestamps at which the group of text sentences begin in the podcast content, combining portions of audio in the podcast content that correspond to the group of text sentences to generate an audio file representing the audio summary.
Public/Granted literature
- US20220020376A1 System And Method For Multi-Modal Podcast Summarization Public/Granted day:2022-01-20
Information query