-
公开(公告)号:EP1309964A4
公开(公告)日:2007-04-18
申请号:EP01951885
申请日:2001-07-12
Applicant: IBM
Inventor: CHAZAN DAN , ZIBULSKI MEIR , HOORY RON
CPC classification number: G10L25/90
Abstract: A method for estimating a pitch frequency of an audio signal includes computing a first transform of the signal to a frequency domain over a first time interval, and computing a second transform of the signal to the frequency domain over a second time interval, which contains the first time interval. A line spectrum of the signal is found, based on the first and second transforms, the spectrum including spectral lines having respective line amplitudes and line frequencies. A utility function (130) that is periodic in the frequencies of the lines in the spectrum is then computed. This function is indicative, for each candidate pitch frequency in a given pitch frequency range, of a compatibility of the spectrum with the candidate pitch frequency. The pitch frequency of the speech signal is estimated responsive to the utility function.
-
公开(公告)号:WO0207363A3
公开(公告)日:2002-05-16
申请号:PCT/IL0100644
申请日:2001-07-12
Applicant: IBM , CHAZAN DAN , ZIBULSKI MEIR , HOORY RON
Inventor: CHAZAN DAN , ZIBULSKI MEIR , HOORY RON
CPC classification number: G10L25/90
Abstract: A method for estimating a pitch frequency of an audio signal includes computing a first transform of the signal to a frequency domain over a first time interval (42), and computing a second transform of the signal of the frequency domain over a second time interval (44), which contains the first time interval. A line spectrum of the signal is found, based on the first and second transforms, the spectrum including spectral lines having respective line amplitudes and line frequencies. A utility function (130) that is periodic in the frequencies of the lines in the spectrum is then computed. This function is indicative (158), for each candidate pitch frequency in a given pitch frequency range, of a compatibility of the spectrum with the candidate pitch frequency. The pitch frequency of the speech signal is estimated responsive to the utility function (176, 178).
Abstract translation: 一种用于估计音频信号的音调频率的方法包括:在第一时间间隔(42)上计算信号到频域的第一变换(42),以及在第二时间间隔上计算频域信号的第二变换( 44),其包含第一时间间隔。 基于第一和第二变换,发现包括具有各自线路幅度和线路频率的谱线的频谱的信号线谱。 然后计算在频谱中的线的频率中周期性的效用函数(130)。 对于给定音调频率范围内的每个候选音调频率,该功能指示(158)频谱与候选音调频率的兼容性。 响应于效用函数来估计语音信号的音调频率(176,178)。
-
公开(公告)号:GB2498042A
公开(公告)日:2013-07-03
申请号:GB201220270
申请日:2012-11-12
Applicant: IBM
Inventor: HOORY RON , NAHAMOO DAVID , SICCONI ROBERTO , CONNELL JONATHAN HUDSON II , BEN-DAVID SHAY
Abstract: A facilities access system comprises a mobile device to request access; the gathering multifactor biometric data (such as voiceprint, fingerprint, face, iris, multibiometrics etc.) to authenticate the user; fixed sensor/s located in/near the facility; the validating the mobile and fixed-sensor data; and the granting of access if successful. The fixed-sensor data may also validate that the request was made in the vicinity of the facility, possibly by content of the access request, or by use of an outgoing challenge followed by receipt of challenge confirmation. The position of the mobile device may be determined in order to select the closest fixed sensor device. The authentication process may be conducted on the mobile device or at a remote server; whilst the fixed-sensors may be used to determine whether a user is present or absent at the facility. The fixed-sensor may act to confirm the same biometric factor as the mobile device.
-
公开(公告)号:CA2613154A1
公开(公告)日:2007-01-18
申请号:CA2613154
申请日:2006-05-12
Applicant: IBM
Inventor: AZULAI OPHIR , HOORY RON , SIVAN ZOHAR
IPC: G10L15/18
Abstract: A method for querying an electronic dictionary using letters of an alphabet enunciated by a user includes accepting a speech input from the user. The speech input includes a sequence of spelled letters enunciated by the user that spell a query word. The speech input is analyzed to determine one or mo re sequences of the letters that approximate the sequence of spelled letters. T he one or more sequences of the letters are post-processed so as to produce a plurality of recognized words approximating the query word. The electronic dictionary is queried with the plurality of recognized words so as to retrie ve a respective plurality of dictionary entries. A list of results including th e plurality of recognized words and the respective plurality of dictionary entries is presented to the user.
-
公开(公告)号:GB2506278A
公开(公告)日:2014-03-26
申请号:GB201316988
申请日:2012-03-13
Applicant: IBM
Inventor: KONS ZVI , HOORY RON , NAHAMOO DAVID , BEN-DAVID SHAY
IPC: G10L21/003 , G10L19/018
Abstract: Method, system, and computer program product for voice transformation are provided. The method includes transforming a source speech using transformation parameters, and encoding information on the transformation parameters in an output speech using steganography, wherein the source speech can be reconstructed using the output speech and the information on the transformation parameters. A method for reconstructing voice transformation is also provided including: receiving an output speech of a voice transformation system wherein the output speech is transformed speech which has encoded information on the transformation parameters using steganography; extracting the information on the transformation parameters; and carrying out an inverse transformation of the output speech to obtain an approximation of an original source speech.
-
公开(公告)号:DE102012220130A1
公开(公告)日:2013-05-23
申请号:DE102012220130
申请日:2012-11-06
Applicant: IBM
Inventor: BEN-DAVID SHAY , CONNELL JONATHAN HUDSON , HOORY RON , NAHAMOO DAVID , SICCONI ROBERTO
IPC: G06F21/32
Abstract: Ein Verfahren, ein System und ein Computerprogrammprodukt zum Zugang zu sicheren Einrichtungen werden bereitgestellt. Das Verfahren kann beinhalten: Empfangen einer Zugangsanfrage zu einer sicheren Einrichtung von einer Mobileinheit; Authentifizieren eines Benutzers mittels biometrischer Authentifizierung mit mehreren Faktoren mit Daten von der Mobileinheit; Erhalten von Daten von einer oder mehreren ortsfesten Sensoreinheiten an einem Standort in räumlicher Nähe der sicheren Einrichtung; Querprüfen der Daten von der Mobileinheit mit Daten von der einen oder den mehreren ortsfesten Sensoreinheiten; und Gewähren des Zugangs zur sicheren Einrichtung, wenn die Authentifizierung des Benutzers und die Querprüfung erfolgreich sind. Beim Querprüfen kann mithilfe von Daten von der einen oder den mehreren ortsfesten Sensoreinheiten ermittelt werden, ob die Zugangsanfrage von der Mobileinheit in der Nähe der sicheren Einrichtung erfolgt. Das Verfahren kann beinhalten: Erhalten von Daten von einer oder mehreren ortsfesten Sensoreinheiten und Verwenden der Daten, um Authentifizierungsdaten bereitzustellen; und Querprüfen einiger der Authentifizierungsdaten von der Mobileinheit mit einigen der Authentifizierungsdaten von der einen oder den mehreren ortsfesten Sensoreinheiten.
-
公开(公告)号:DE60136716D1
公开(公告)日:2009-01-08
申请号:DE60136716
申请日:2001-07-12
Applicant: IBM
Inventor: CHAZAN DAN , ZIBULSKI MEIR , HOORY RON
IPC: G10L25/90
Abstract: A method for estimating a pitch frequency of an audio signal includes computing a first transform of the signal to a frequency domain over a first time interval, and computing a second transform of the signal to the frequency domain over a second time interval, which contains the first time interval. A line spectrum of the signal is found, based on the first and second transforms, the spectrum including spectral lines having respective line amplitudes and line frequencies. A utility function that is periodic in the frequencies of the lines in the spectrum is then computed. This function is indicative, for each candidate pitch frequency in a given pitch frequency range, of a compatibility of the spectrum with the candidate pitch frequency. The pitch frequency of the speech signal is estimated responsive to the utility function.
-
公开(公告)号:GB2357231A
公开(公告)日:2001-06-13
申请号:GB0023864
申请日:2000-09-29
Applicant: IBM
Inventor: HOORY RON , CHAZAN DAN , SILVERA EZRA , ZILBULSKI MEIR
Abstract: In a method for encoding a digitized speech signal so as to generate data capable of being decoded as speech, a digitized speech signal is first converted to a series of feature vectors by deriving at successive instances of time, e.g. using ABS and Mel-Binning unit 32, an estimate of the spectral envelope of the digitized speech signal and multiplying each estimate of the spectral envelope by a predetermined set of frequency domain window functions, wherein each window occupies a narrow range of frequencies. The integrals thereof are computed and they or a set of predetermined functions thereof are assigned to respective components of a corresponding feature vector in the series of feature vectors. For each instance of time a respective pitch value of the digitized speech signal is computed at 34,35, and successive acoustic vectors each containing the respective pitch value and feature vector are compressed so as to derive therefrom a bit stream. A suitable decoder reverses the operation so as to extract the features vectors and pitch values, thus allowing speech reproduction and playback. In addition, speech recognition is possible using the decompressed feature vectors, with no impairment of the recognition accuracy and no computational overhead.
-
公开(公告)号:DE102012220130B4
公开(公告)日:2019-04-04
申请号:DE102012220130
申请日:2012-11-06
Applicant: IBM
Inventor: BEN-DAVID SHAY , CONNELL JONATHAN HUDSON , HOORY RON , NAHAMOO DAVID , SICCONI ROBERTO
IPC: G06F21/32
Abstract: Ein Verfahren, ein System und ein Computerprogrammprodukt zum Zugang zu sicheren Einrichtungen werden bereitgestellt. Das Verfahren kann beinhalten: Empfangen einer Zugangsanfrage zu einer sicheren Einrichtung von einer Mobileinheit; Authentifizieren eines Benutzers mittels biometrischer Authentifizierung mit mehreren Faktoren mit Daten von der Mobileinheit; Erhalten von Daten von einer oder mehreren ortsfesten Sensoreinheiten an einem Standort in räumlicher Nähe der sicheren Einrichtung; Querprüfen der Daten von der Mobileinheit mit Daten von der einen oder den mehreren ortsfesten Sensoreinheiten; und Gewähren des Zugangs zur sicheren Einrichtung, wenn die Authentifizierung des Benutzers und die Querprüfung erfolgreich sind. Beim Querprüfen kann mithilfe von Daten von der einen oder den mehreren ortsfesten Sensoreinheiten ermittelt werden, ob die Zugangsanfrage von der Mobileinheit in der Nähe der sicheren Einrichtung erfolgt. Das Verfahren kann beinhalten: Erhalten von Daten von einer oder mehreren ortsfesten Sensoreinheiten und Verwenden der Daten, um Authentifizierungsdaten bereitzustellen; und Querprüfen einiger der Authentifizierungsdaten von der Mobileinheit mit einigen der Authentifizierungsdaten von der einen oder den mehreren ortsfesten Sensoreinheiten.
-
公开(公告)号:IL135192A
公开(公告)日:2004-06-20
申请号:IL13519200
申请日:2000-03-21
Applicant: IBM
Inventor: CHAZAN DAN , COHEN GILAD , HOORY RON
Abstract: A method for speech synthesis includes receiving an input speech signal containing a set of speech segments, and estimating spectral envelopes of the input speech signal in a succession of time intervals during each of the speech segments. The spectral envelopes are integrated over a plurality of window functions in a frequency domain so as to determine elements of feature vectors corresponding to the speech segments. An output speech signal is reconstructed by concatenating the feature vectors corresponding to a sequence of the speech segments.
-
-
-
-
-
-
-
-
-