Abstract:
Systems and processes are disclosed for operating a digital assistant for media search and playback. In an exemplary process, an audio input containing a media search request can be received. A primary user intent corresponding to the media search request can be determined and one or more secondary user intents based on one or more previous user intents can be determined. A primary set of media items corresponding to the primary user intent can be displayed and one or more secondary sets of media items corresponding to the one or more secondary user intents can be displayed.
Abstract:
Systems and processes are disclosed for real-time updating of virtual assistant media knowledge. Virtual assistant knowledge can be updated with timely information associated with playing media (e.g., a sporting event, a television show, or the like). A data feed can be received that includes data relating events to particular times in a media stream. A user request can be received based on speech input, and the user request can be associated with an event in a media stream or show. In response to receiving the request, the media stream can be cued to commence playback at a time in the media stream associated with the event referred to in the request. In another example, a response to the user request can be generated based on the data relating to the events. The response can then be delivered to the user (e.g., spoken aloud, displayed, etc.).
Abstract:
Systems and processes are disclosed for controlling television user interactions using a virtual assistant. A virtual assistant can interact with a television set-top box to control content shown on a television. Speech input for the virtual assistant can be received from a device with a microphone. User intent can be determined from the speech input, and the virtual assistant can execute tasks according to the user's intent, including causing playback of media on the television. Virtual assistant interactions can be shown on the television in interfaces that expand or contract to occupy a minimal amount of space while conveying desired information. Multiple devices associated with multiple displays can be used to determine user intent from speech input as well as to convey information to users. In some examples, virtual assistant query suggestions can be provided to the user based on media content shown on a display.