-
公开(公告)号:DE1472004A1
公开(公告)日:1969-10-09
申请号:DE1472004
申请日:1965-06-03
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER , WOODROW JOHNSON JUN OLIVER
Abstract: 1,068,282. Speech waveform modification. INTERNATIONAL BUSINESS MACHINES CORPORATION. May 14, 1965 [June 9, 1964], No. 20363/65. Heading H4R. The time duration of an audio signal is modified, e.g. to make speech samples from different sources sound as if from the same source, by adjusting the lengths of the pitch periods of the speech samples to a common length, discontinuities due to amplitude differences between the end of an adjusted pitch period and the beginning of the following period being eliminated by adding to the adjusted pitch period signal a " ramp " signal having an amplitude of zero at the commencement of the pitch period and an amplitude equal to the amplitude difference at the end of the adjusted pitch period. The actual pitch period of the samples is determined by measuring the time of occurrence of the maximum peak to peak excursions of the speech waveform during time intervals assessed by a rough determination of the pitch period. Figs. 3A, 3B and 3C show an embodiment in which a sample of speech from a source 2 is applied via a sampling switch 4 to a store 8 in which the speech sample circulates, together with a synchronizing pulse from single shot circuit 6 marking the start of the speech sample. On each repetition of the speech sample the synchronizing pulse is applied to reset the counter 22 which during the repeat of the sample provides a time scale by counting the output of oscillator 24. The speech sample is applied to a voicing detector 26, which produces a pulse at the beginning of a voiced sound, and to a conventional form of pitch extractor 10 to 16, which produces a count in counter 18 corresponding to the approximate pitch period. The pulse from the voicing detector is applied to gate 28 to gate a count, corresponding to the start of the voiced speech, from counter 22 into the register 30. In addition, this count is fed from gate 28 to an " ADD " circuit 34 which is also fed with the count from counter 18, corresponding to the pitch period, and the resulting count is fed into register 36. During the following cycles the counts in registers 30 and 36 are compared in comparators 38 and 40 with the count from counter 22 and signals are produced to trigger the bi-stable 42 to produce an output on lead Q which is positive during a period from the commencement of voiced signal to a time approximately one pitch period later. During this time speech is fed via gate 44 to the positive and negative peak detectors 64 and 66 which feed the values of the respective peaks to the gates 56, 58, 60 and 62. Initially, the synch. pulse sets bi-stable 72 so that the output 1a is energized and therefore the first positive and negative peak values are fed respectively via gates 56 and 58 to hold circuits 46 and 48, the outputs from which are fed to a differential amplifier 90 to obtain a signal representative of the first peak to peak excursion of the waveform, which is applied via an inverter 92 to adder 94, in addition the time of occurrence of the positive peak is fed via gate 74 into the register 68. Since no input has yet been applied to gates 60 and 62 the output of differential amplifier 96 is zero and the output of adder 94 is therefore negative and passes via gate 98 and gate 100, operated by the delayed negative peak, to trigger bi-stable 72 so that output 1a is removed and 2a is energized so that the following positive and negative peak values are fed via gates 60 and 62 to hold circuits 50 and 52 and differential amplifier 96, while the time of occurrence of the positive peak is fed into register 70 via gate 104. The outputs of amplifiers 90 and 96 are then compared and depending on the relative values either a positive or negative output results from adder 94 which is fed via gates 98 or 108 and gate 100 to trigger bi-stable 72 into such a condition that the following pair of positive and negative peaks is fed in via gates 56 and 58 or gates 60 and 62 to replace the values in the hold circuits corresponding to the smaller peak to peak swing. The process is repeated during the remaining duration of the Q signal discarding always the smaller of the two peak to peak swings being compared until at the end of the Q period the negative going signal detector 54 is energized to apply an output which is gated through the appropriate one of gates 110 and 112 to feed the output of the register 68 or 70, holding the position of the maximum peak to peak swing, into the computer 3. The computer takes the count corresponding to the maximum peak to peak value and adds to that a count corresponding to half a pitch period as stored in counter 18, and one and a half pitch periods, and the resulting values are fed-in to replace the counts stored in registers 30 and 36 respectively. The determination of the maximum peak to peak swing is then carried out, as before, for the interval between the counts now stored in registers 30 and 36 to determine the position of the next pitch pulse. In a similar fashion the positions of the maximum peak to peak swings of the speech waveform is determined for the remainder of the speech sample stored in the circulating store 8 and these values are stored in the computer 3. In order to adjust the pitch cycles to the required length the speech is fed to gates 122 and 140. Each pitch period is adjusted in length during a cycle of operations which entails two repeats of the speech sample from store 8. During the first repeat a pitch pulse from computer 3 on line 126 triggers bi-stable 124 to allow speech to pass through gate 122 to gate 134. At the end of the delay time produced by delay 128, which is equal to the desired pitch period and is equal to or shorter than any actual pitch period in the sample, bi-stable 124 is reset to inhibit gate 122 and gate 134 is operated to apply the voltage value existing at the end of the modified pitch period to the hold circuit 136 where it is stored. The following pitch pulse on line 138 operates gate 140 to feed to hold circuit 142 the voltage value of the speech signal at the beginning of the next pitch period. The two signals from stores 136 and 142 are applied to a differential amplifier 144 to obtain a signal representing the error between the amplitude of the signal at the end of the length modified pitch period and the amplitude of the signal at the commencement of the following pitch period, this signal being applied to the input of the integrating amplifier circuit 146. During the second repetition of the period being modified the gate 122 gates through the speech signal for the modified length period to one input of " add " circuit 130, in addition, the output of bi-stable 124 opens switch 154 on the output of integrating amplifier 146 for the duration of the modified pitch period so that the output of this amplifier consists of a ramp waveform which is zero at the beginning of the pitch period and has a value equal to the output of differential amplifier 144 at the end of the modified pitch period, this signal is applied to the other input of " add " circuit 130 to be added to the modified length speech waveform sample so that the resulting signal will be continuous in amplitude with the following sample starting at the following pitch pulse. The output of adder 130 is converted to digital form in the analogue to digital converter 132 so that it may be stored in computer 3 to await the following length modified pitch periods of the speech sample which will be processed in a similar way on subsequent cycles of the circulating store 8.
-
公开(公告)号:DE1623193A1
公开(公告)日:1970-11-05
申请号:DE1623193
申请日:1967-03-28
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER
-
公开(公告)号:DE1591229B1
公开(公告)日:1971-02-25
申请号:DE1591229
申请日:1967-12-15
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER
IPC: H04L27/14 , H04L27/144 , G01R23/02
Abstract: 1,142,426. Decoders. INTERNATIONAL BUSINESS MACHINES CORP. 5 Oct., 1967 [6 Jan., 1967], No.45429/67. Heading G4H. The frequency of an input signal f(t) which may have any one of n predetermined frequencies is determined by mixing the input signal with n-1 reference frequencies and feeding the resultant n-1 beat signals to associated low pass filters which each generate an output signal when the input signal f(t) is one of the pair of frequencies adjacent the associated reference signal. The combination of outputs from the filters represents the frequency of the input signal. In the embodiment described in which the input signal may have one of three frequencies the outputs from the filters 25, 26 are fed to detectors 27, 28 which generate a " 1 " signal in response to an output signal from the associated filter, the outputs from the detectors being fed to AND gates 31, 32, 33 and inverters 29, 30 so that a signal is generated on one of three lines 1T, 2T, 3T to control a device 34.
-
公开(公告)号:DE1298322B
公开(公告)日:1969-06-26
申请号:DEI0032500
申请日:1966-12-14
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER
IPC: G06K9/00
Abstract: 1,117,069. Pattern recognition. INTERNATIONAL BUSINESS MACHINES CORPORATION. 15 Dec., 1966 [29 Dec., 1965], No. 56105/66. Heading G4R. The average values of the co-ordinates of a plurality of points related in a predetermined manner to a line pattern are obtained. During a first scan of a fingerprint 40 by a curve follower 30 (Fig. 2A), the X and Y coordinates of successive points on the ridges of the fingerprint are obtained as continuous waveforms (though digitization is possible as an alternative) and used by differentiators 46, 48 and divider 50 to calculate dY/dX. This is used by the circuitry in the top quarter of Fig. 2B to calculate the radius of curvature R, and also by sin A and cos A generators 76, 78 (tan A#dY/dX), the outputs of which are multiplied 72, 74 by R, the resulting products being added 84, 86 to X and Y respectively to give the co-ordinates X 0 , Y 0 of the centre of curvature. Whenever R is less than a preset maximum, as determined by comparator 70, X 0 and Y 0 are gated 88, 90 to respective integrators 104, 106. At the end of the scan the integrator contents, constituting the coordinates of the centre of the fingerprint, are stored at 116, 118. Two or more further scans now follow to obtain improved values for the centre co-ordinates, since the fingerprint may be multilated. Each further scan is like the first except that a further requirement for X 0 , Y 0 to be gated to the integrators 104, 106 is imposed, viz. that X and Y must be within a rectangular (or square) area centred on the centre obtained by the previous scan, the circuitry in the bottom three quarters of Fig. 2B being provided for this purpose. #X and #Y generators 134, 136 are preset with half the dimensions of the rectangle required. In the case of a square they could be combined. A circular area may be used instead, it is mentioned. The centre co-ordinates obtained in the last scan are used in recognition means. It is mentioned that tangent line intersections may be used instead of centres of curvature.
-
公开(公告)号:DE1623203A1
公开(公告)日:1970-12-17
申请号:DE1623203
申请日:1967-09-20
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER
-
公开(公告)号:DE1522128A1
公开(公告)日:1969-08-07
申请号:DE1522128
申请日:1967-01-07
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER , WILLIAM BAUER EDWIN
-
公开(公告)号:DE1791072A1
公开(公告)日:1971-10-21
申请号:DE1791072
申请日:1968-09-06
Applicant: IBM
Inventor: ABRAMSON PAUL , KENNEDY FRENCH WALTER
IPC: G01R19/22
Abstract: 1,175,948. Detector circuits. INTERNATIONAL BUSINESS MACHINES CORP. 27 Aug., 1968 [14 Sept., 1967], No. 40820/68. Heading H3T. A signal detection circuit comprises means for chopping an input signal to produce a sequence of discrete pulses, means for differentiating the pulses to produce a sequence of spike pulses and means for smoothing the latter to produce a unipolar output signal having an amplitude level representative of the amplitude of the input signal. A signal A, Fig. 2, is applied over capacitor 10, Fig. 1, to the source 14 of field effect transistor, the gate 18 of which is driven with a square waveform B, to produce a chopped waveform C at the drain. This is differentiated in differentiating circuit comprising capacitor 24 and resistor 26 to produce the spike waveform D. This is applied to the base of transistor 28, the negative-going spikes causing conduction in the transistor, the collector potential of which is smoothed by means of R, C circuit 30, 32 to produce at terminal 34 a relatively steady signal E representative of the amplitude of the input signal.
-
公开(公告)号:DE1774540A1
公开(公告)日:1971-10-21
申请号:DE1774540
申请日:1968-07-12
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER
Abstract: 1,209,534. Pattern recognition. INTERNATIONAL BUSINESS MACHINES CORP. 17 June, 1968 [14 July, 1967], No. 28709/68. Heading G4R. [Also in Division F2] An optical scanning device comprises a member movable over an aperture, the member moving successively along a plurality of paths, each path having a different angular relationship to the aperture. Portions, selected by a movable mask (not shown in Fig. 2), of a fingerprint (or similar pattern) to be recognized, are imaged by respective lenses 29 via an aperture 30 in the planet carrier 31 of an epicyclic gear on to respective photo-cells. A motor-driven gear 37 meshes with and rotates the planet carrier 31 which also meshes with idler gears 39, 40, 41. The shaft of a planet pinion 43 is supported in the planet carrier 31 (which itself has no shaft). The pinion 43 meshes with a fixed ring of gear teeth 42 and rotates a notched disc 46 to chop the light coming through aperture 30 from progressively changing directions so that the slope of fingerprint ridges can be detected. Comparison of the slopes from different portions of the fingerprint enables ridge ends and bifurcations to be detected. The outputs of the photo-cells can be filtered before use, to remove the chopping frequency. The photo-cell outputs can be averaged (with each other). Ridges may be counted.
-
公开(公告)号:DE1547032A1
公开(公告)日:1969-11-06
申请号:DE1547032
申请日:1967-04-29
Applicant: IBM
Inventor: KENNEDY FRENCH WALTER
IPC: G01R29/033 , G10L17/00 , H03K5/22 , H04M1/24
Abstract: 1,179,029. Speech recognition. INTERNATIONAL BUSINESS MACHINES CORP. 19 April, 1967 [2 May, 1966], No. 17934/67. Heading G4R. Speech . recognition, apparatus determines characteristics of successive pulse periods of a speech signal, accumulates the levels of correspondence between these characteristics and stored characteristics compared with them, and indicates when the accumulation exceeds a predetermined value after the comparison of a predetermined number of characteristics. The identity of a person is verified by comparing characteristics of a particular spoken word (or words) with stored characteristics of the word as spoken by the true person. The characteristics used are the voltage and time of occurrence of the first 3 peaks and 2 troughs of each of a series of pulse periods of the speech, a pulse period being a period of the voiced component of the speech. The voltage characteristics are all positive by choice of the zero of voltage. The characteristics mentioned for representative pulse periods of the word as spoken by the true person are retrieved from a back-up store .and placed in a closed-loop shift register. After detection of the beginning of .a pulse period in the input speech (i.e. that from the person, whose identity is to be verified), detection of the first peak causes the stored voltage and time characteristics for the first peak in each of three adjacent pulse period sections of the shift register to be subtracted from the actual voltage value of the input first peak (converted to digital form) and the time (specified by a clock-driven time counter) respectively, the six subtractions being accomplished by adders since the characteristics are stored in the shift register in negative form. The differences are squared and added into respective ones of three accumulators (via preliminary adders), one accumulator corresponding to each of the three shift register sections used. The next two troughs and two peaks are treated similarly, the shift register being shifted to bring the stored characteristics for these troughs and peaks into position for use, and the squared differences being added into the same accumulators. After the total of five troughs and peaks, a second counter causes the contents of the accumulator holding the smallest value, representing the difference between the input pulse period and the most similar of the three stored pulse periods used, to be gated to a sum accumulator, and also causes the shift register to be so shifted that the centre one of the three stored pulse periods which will be compared with the next input pulse period will be the most similar stored pulse period just determined. The above operations are repeated for each of the following input pulse periods, the sum accumulator accumulating a measure of the total deviation so far between the input and stored characteristics. The time counter, second counter and the accumulators (except the sum accumulator) are reset for each pulse period. After each pulse period, the second counter also gates the contents of the sum accumulator to a divider to be divided by the number of pulse periods which have so far occurred, as specified by a third counter. The quotient is compared with a threshold and if it exceeds it, a "stop non-verify" signal is produced which indicates the identity is false and stops operations. The totality of characteristics in the shift register is followed by a special mark and when all the stored characteristics have been used this is detected in a particular stage of the register and produces an "identity verified" signal if the "stop non- verify" signal is absent. The stored characteristics could be in analogue form and analogue circuitry be used. The input speech could be compared with a plurality of stored sets of characteristics, each set as above, to identify the speaker.
-
-
-
-
-
-
-
-