Invention Grant
- Patent Title: System and method for multi-modal podcast summarization
-
Application No.: US17677186Application Date: 2022-02-22
-
Publication No.: US12300243B2Publication Date: 2025-05-13
- Inventor: Amanmeet Garg , Aneesh Vartakavi , Joshua Ernest Morris
- Applicant: Gracenote, Inc.
- Applicant Address: US CA Emeryville
- Assignee: Gracenote, Inc.
- Current Assignee: Gracenote, Inc.
- Current Assignee Address: US CA Emeryville
- Agency: McDonnell Boehnen Hulbert & Berghoff LLP
- Main IPC: G10L15/26
- IPC: G10L15/26 ; G10L15/02

Abstract:
In one aspect, a method includes receiving podcast content, generating a transcript of at least a portion of the podcast content, and parsing the podcast content to (i) identify audio segments within the podcast content, (ii) determine classifications for the audio segments, (iii) identify audio segment offsets, and (iv) identify sentence offsets. The method also includes based on the audio segments, the classifications, the audio segment offsets, and the sentence offsets, dividing the generated transcript into text sentences and, from among the text sentences of the divided transcript, selecting a group of text sentences for use in generating an audio summary of the podcast content. The method also includes based on timestamps at which the group of text sentences begin in the podcast content, combining portions of audio in the podcast content that correspond to the group of text sentences to generate an audio file representing the audio summary.
Public/Granted literature
- US20220172726A1 System And Method For Multi-Modal Podcast Summarization Public/Granted day:2022-06-02
Information query