-
公开(公告)号:US11575990B2
公开(公告)日:2023-02-07
申请号:US15583189
申请日:2017-05-01
Applicant: Cerence Operating Company
Inventor: Tobias Herbig , Markus Buck , Meik Pfeffinger
IPC: H04R3/12 , H04M9/08 , H04R1/40 , H04R3/02 , H04R3/00 , G10L21/0208 , G10L25/84 , H04R27/00 , H04R5/02
Abstract: An communication system supports communication paths within an environment by receiving speech signals of a speaker and playing it back for one or more listeners. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in an environment having multiple acoustic zones includes a plurality of microphone communication instances coupled and a plurality of loudspeaker instances.
-
公开(公告)号:US11798576B2
公开(公告)日:2023-10-24
申请号:US16671830
申请日:2019-11-01
Applicant: Cerence Operating Company
Inventor: Tobias Herbig , Meik Pfeffinger , Bernd Iser
IPC: G10L21/02 , G10L21/034 , H03G3/32 , G10L21/0232 , H04R3/00 , H03G3/30 , G10L25/81 , H04R3/12 , G10L25/78 , G10L21/0208 , G10L21/0324 , H04R27/00
CPC classification number: G10L21/034 , G10L21/02 , G10L21/0232 , G10L25/81 , H03G3/3089 , H03G3/32 , H04R3/00 , H04R3/005 , H04R3/12 , G10L21/0208 , G10L21/0324 , G10L25/78 , H04R27/00 , H04R2430/01 , H04R2499/13
Abstract: Methods and apparatus for a communication system having microphones and loudspeakers to determine a noise and speech level estimate for a transformed signal, determine a SNR from the noise and speech level estimates, and determine a gain for the transformed signal to achieve a selected SNR range at a given position. In one embodiment, the gain is determined by adapting an actual gain to follow a target gain, wherein the target gain is adjusted to achieve the selected SNR range.
-
公开(公告)号:US11176957B2
公开(公告)日:2021-11-16
申请号:US16638866
申请日:2017-08-17
Applicant: Cerence Operating Company
Inventor: Simon Graf , Tobias Herbig , Markus Buck
IPC: G10L21/02 , G10L21/013 , G10L21/034 , G10L25/18 , G10L25/84 , G10L25/90
Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.
-
公开(公告)号:US11950067B2
公开(公告)日:2024-04-02
申请号:US18105979
申请日:2023-02-06
Applicant: Cerence Operating Company
Inventor: Tobias Herbig , Markus Buck , Meik Pfeffinger
IPC: H04R3/12 , G10L21/0208 , G10L25/84 , H04M9/08 , H04R1/40 , H04R3/00 , H04R3/02 , H04R27/00 , H04R5/02
CPC classification number: H04R3/12 , G10L21/0208 , G10L25/84 , H04M9/082 , H04R1/406 , H04R3/005 , H04R3/02 , H04R27/00 , G10L2021/02082 , H04R5/02 , H04R2201/403 , H04R2227/001 , H04R2227/009 , H04R2410/05 , H04R2420/01 , H04R2499/13
Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.
-
公开(公告)号:US20230209260A1
公开(公告)日:2023-06-29
申请号:US18105979
申请日:2023-02-06
Applicant: Cerence Operating Company
Inventor: Tobias Herbig , Markus Buck , Meik Pfeffinger
CPC classification number: H04R3/12 , H04M9/082 , H04R1/406 , H04R3/02 , H04R3/005 , G10L21/0208 , G10L25/84 , H04R27/00 , H04R2201/403 , H04R2227/001 , H04R2227/009 , H04R2410/05 , H04R2420/01 , H04R2499/13 , H04R5/02
Abstract: An In-Car Communication (ICC) system supports the communication paths within a car by receiving the speech signals of a speaking passenger and playing it back for one or more listening passengers. Signal processing tasks are split into a microphone related part and into a loudspeaker related part. A sound processing system suitable for use in a vehicle having multiple acoustic zones includes a plurality of microphone In-Car Communication (Mic-ICC) instances coupled and a plurality of loudspeaker In-Car Communication (Ls-ICC) instances. The system further includes a dynamic audio routing matrix with a controller and coupled to the Mic-ICC instances, a mixer coupled to the plurality of Mic-ICC instances and a distributor coupled to the Ls-ICC instances.
-
公开(公告)号:US20210134311A1
公开(公告)日:2021-05-06
申请号:US16638866
申请日:2017-08-17
Applicant: CERENCE OPERATING COMPANY
Inventor: Simon Graf , Tobias Herbig , Markus Buck
IPC: G10L21/013 , G10L25/84 , G10L25/90 , G10L25/18 , G10L21/034
Abstract: A low-complexity method and apparatus for detection of voiced speech and pitch estimation is disclosed that is capable of dealing with special constraints given by applications where low latency is required, such as in-car communication (ICC) systems. An example embodiment employs very short frames that may capture only a single excitation impulse of voiced speech in an audio signal. A distance between multiple such impulses, corresponding to a pitch period, may be determined by evaluating phase differences between low-resolution spectra of the very short frames. An example embodiment may perform pitch estimation directly in a frequency domain based on the phase differences and reduce computational complexity by obviating transformation to a time domain to perform the pitch estimation. In an event the phase differences are determined to be substantially linear, an example embodiment enhances voice quality of the voiced speech by applying speech enhancement to the audio signal.
-
公开(公告)号:US20240062770A1
公开(公告)日:2024-02-22
申请号:US18386825
申请日:2023-11-03
Applicant: Cerence Operating Company
Inventor: Tobias Herbig , Stefan Richardt
IPC: G10L21/0364 , G10L25/18 , H03G9/02
CPC classification number: G10L21/0364 , G10L25/18 , H03G9/025
Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.
-
公开(公告)号:US11817115B2
公开(公告)日:2023-11-14
申请号:US16099941
申请日:2016-09-01
Applicant: Cerence Operating Company
Inventor: Tobias Herbig , Stefan Richardt
IPC: G10L21/0364 , G10L25/18 , H03G9/02
CPC classification number: G10L21/0364 , G10L25/18 , H03G9/025
Abstract: Methods and systems for deessing of speech signals are described. A deesser of a speech processing system includes an analyzer configured to receive a full spectral envelope for each time frame of a speech signal presented to the speech processing system, and to analyze the full spectral envelope to identify frequency content for deessing. The deesser also includes a compressor configured to receive results from the analyzer and to spectrally weight the speech signal as a function of results of the analyzer. The analyzer can be configured to calculate a psychoacoustic measure from the full spectral envelope, and may be further configured to detect sibilant sounds of the speech signal using the psychoacoustic measure. The psychoacoustic measure can include, for example, a measure of sharpness, and the analyzer may be further configured to calculate deesser weights based on the measure of sharpness. An example application includes in-car communications.
-
公开(公告)号:US11600269B2
公开(公告)日:2023-03-07
申请号:US16308849
申请日:2016-06-15
Applicant: CERENCE OPERATING COMPANY
Inventor: Meik Pfeffinger , Timo Matheja , Tobias Herbig , Tim Haulick
Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.
-
公开(公告)号:US10783899B2
公开(公告)日:2020-09-22
申请号:US16073740
申请日:2016-11-18
Applicant: Cerence Operating Company
Inventor: Simon Graf , Tobias Herbig , Markus Buck
IPC: G10L21/0232 , G10L25/84 , G10L21/0208 , G10L21/0216 , G10L15/22 , G10L25/21 , G10L25/93
Abstract: Systems and methods are introduced to perform noise suppression of an audio signal. The audio signal includes foreground speech components and background noise. The foreground speech components correspond to speech from a user's speaking into an audio receiving device. The background noise includes babble noise that includes speech from one or more interfering speakers. A soft speech detector determines, dynamically, a speech detection result indicating a likelihood of a presence of the foreground speech components in the audio signal. The speech detection result is employed to control, dynamically, an amount of attenuation of the noise suppression to reduce the babble noise in the audio signal. Further processing achieves a more stationary background and reduction of musical tones in the audio signal.
-
-
-
-
-
-
-
-
-