DISTRIBUTED MULTI-DEVICE AUDIO CAPTURE IN A SHARED ACOUSTIC ENVIRONMENT

    公开(公告)号:US20240357285A1

    公开(公告)日:2024-10-24

    申请号:US18757676

    申请日:2024-06-28

    CPC classification number: H04R3/005 H04R29/005 H04M3/568

    Abstract: Techniques are provided herein for auto-muting procedures that result in efficient high-quality audio capture in a multi-device environment. In particular, when there are multiple computing devices in a shared meeting room, the microphone with the highest rated audio input is selected for the teleconference audio input from the shared environment. Each computing device connected to the teleconference from the meeting room determines a score for its microphone signal. The score is shared with the other devices in the room, and the microphone signal with the highest score is transmitted to the conference. Host-based systems include a host device receiving and reviewing the scores and determining which microphones to auto-mute. Other distributed systems include each computing device transmitting its score to the other devices and receiving the scores from the other devices, and each device determining whether to auto-mute.

    ENVIRONMENT CLASSIFIER FOR DETECTION OF LASER-BASED AUDIO INJECTION ATTACKS

    公开(公告)号:US20200243067A1

    公开(公告)日:2020-07-30

    申请号:US16849525

    申请日:2020-04-15

    Abstract: Techniques are provided for detection of laser-based audio injection attacks through classification of the acoustic environment. A methodology implementing the techniques according to an embodiment includes broadcasting a reference signal over a loudspeaker into a local environment, and generating a reference model of the local environment based on analysis of a transformed version of that reference signal received through a microphone of the device. The method further includes generating an estimate model based on analysis of a segment of speech in an audio signal received through the microphone. The estimate model is associated with an environment in which the speech was generated. The method further includes calculating a similarity metric (e.g., mathematical distance) between the reference model and the estimate model, and providing warning of a laser-based audio attack if the similarity metric exceeds a threshold value associated with an attack.

    Detection of laser-based audio injection attacks using channel cross correlation

    公开(公告)号:US11961535B2

    公开(公告)日:2024-04-16

    申请号:US16941191

    申请日:2020-07-28

    CPC classification number: G10L25/69 G10L15/22 G10L2015/221 H04R3/005

    Abstract: Techniques are provided for detection of laser-based audio injection attacks. A methodology implementing the techniques according to an embodiment includes calculating cross correlations between signals received from microphones of an array of two or more microphones. The method also includes identifying time delays associated with peaks of the cross correlations, and magnitudes associated with the peaks of the cross correlations. The method further includes calculating a time alignment metric based on the time delays and calculating a similarity metric based on the magnitudes. The method further includes generating a first attack indicator based on a comparison of the time alignment metric to a first threshold and generating a second attack indicator based on a comparison of the similarity metric to a second threshold. The method further includes providing warning of a laser-based audio attack based on the first attack indicator and/or the second attack indicator.

    Ultrasonic attack prevention for speech enabled devices

    公开(公告)号:US10565978B2

    公开(公告)日:2020-02-18

    申请号:US16118719

    申请日:2018-08-31

    Abstract: Techniques are provided for defending against an ultrasonic attack on a speech enabled device. A methodology implementing the techniques according to an embodiment includes detecting voice activity in an audio signal received by the device and generating an ultrasonic jamming signal in response to the detection. The jamming signal is broadcast over a loudspeaker for up to the duration of the detected voice activity to defend against the ultrasonic attack. According to another embodiment, the ultrasonic jamming signal is generated in response to detection of a wake-on-voice key phrase in the received audio signal, and the jamming signal is broadcast over the loudspeaker for a time duration selected to be less than or equal to a time window during which spoken commands are accepted by the device following the wake-on-voice key phrase detection. The jamming signal may include white or colored noise, combinations of tones, and/or a periodic sweep frequency.

    Ultrasonic attack detection employing deep learning

    公开(公告)号:US10957341B2

    公开(公告)日:2021-03-23

    申请号:US16235903

    申请日:2018-12-28

    Abstract: A mechanism, method, and computer readable medium to enhance speech enabled devices. The method comprising receiving, by an ultrasonic attack detector of a speech enabled device, an audio stream from one or more microphones and a segmentation signal from a keyword detector indicating a location of a detected keyword within the audio stream, preprocessing, by the ultrasonic attack detector, a segmented portion of the audio stream including the detected keyword to obtain a spectrogram, and executing, by the ultrasonic attack detector, a neural network classifier using the spectrogram as input, the neural network classifier to discern real human speech from intermodulation distortion products resulting from ultrasonic attacks on the speech enabled device.

Patent Agency Ranking