Improvements in vocoders
    3.
    发明专利

    公开(公告)号:GB1087305A

    公开(公告)日:1967-10-18

    申请号:GB2131465

    申请日:1965-05-20

    Applicant: IBM

    Abstract: 1,087,305. Vocoders. INTERNATIONAL BUSINESS MACHINES CORPORATION. May 20, 1965 [May 26, 1964], No. 21314/65. Heading H4R. In an analysis synthesis telephone system an excitation function is derived comprising a pulse sequence of a repetition frequency equal to the fundamental frequency of the speech input, which pulse sequence is used to control the sampling of the energy in the spectrum analysis bands. During unvoiced sounds, when the speech input is of such a character as to prevent the production of a satisfactory excitation function, an auxiliary pulse generator is switched in to control the sampling in the spectrum analysis bands. Fig. 3 shows an analyser in which a speech signal at 1 is applied to a low pass filter 2 and a zero crossing detector 3 to generate a train of pulses at the fundamental frequency of the speech input. These pulses are fed via an OR gate 17 to a sampling pulse generator 14 to generate pulses to open gates 8 1 to 8 n to feed samples from the outputs of the band pass filters 7 1 to 7 n , arranged to divide the frequency spectrum of the incoming speech into a number, n, of frequency bands, to the analog to digital converters 9 1 to 9 n , the outputs of which define the energy distribution across the frequency spectrum of the speech, i.e. the aggregate function. If the speech input is such that the zero crossing detector fails to produce pulses a single shot circuit, which is maintained in its metastable state by the pulses from the zero crossing detector, reverts to its stable state in 12 m. sees. to cause the pulse generator 16 to supply pulses, via OR gate 17, to the sampling generator 14 at 3 m. sec. intervals until pulses reappear at the output of the zero crossing detector. The pulse output from the zero crossing detector 3 is also applied to an " AND " gate 6 to feed the count of counter 5 to the output circuit 19, the counter being subsequently reset by the same pulse so that the counter output is a measure of the interval between pulses, the count being transmitted as the excitation function. The speech input at 1, via an amplifier 10 and level detector 11, keeps a single shot 12 in its metastable state to maintain gate 13 open so that if the speech level remains at zero for longer than 100 m. sees. the single shot 12 reverts to its stable state to inhibit the feed of sampling pulses from generator 14 to the gates 91 to 9 n . Voiced/unvoiced detector 18 described in detail with reference to Fig. 4 (not shown) comprises a logic circuit which produces an output, either when all the outputs of a number of the lowest frequency analysis filters are zero, or when the outputs of these filters is lower than that of groups of the same number of higher frequency analysis filters. Synthesiser circuit, Fig. 5.-Excitation function signals from the input-distributer circuit 26 are fed to a register 27 which sets counter 28 in accordance with the count corresponding to the interval between the excitation function pulses. The counter then counts, under the control of generator 29, down to zero when it generates a pulse to feed to distributer 35 via OR gate 36 before being reset in accordance with the following count in register 27. Meanwhile the aggregate function signals and voiced/ unvoiced signals are fed to the register 30, the voiced/unvoiced signals being used via a trigger circuit 37 to enable gate 38 to feed pulses from the auxiliary generator 39 to OR gate 36. and hence the distributer 35, in the absence of pulses from the counter 28 during unvoiced sounds. The digital to analog converter 31 converts the received aggregate function signals to analogue signals whose amplitudes vary according to the spectral energy distribution of the original speech, these signals being fed to the gates 32 1 to 32 n where they are gated appropriately by the pulses from the distributer 35. The resulting signals are passed through band-pass filters similar to the analysis filters in the analyser to an adding circuit 34 from which the synthesised speech is obtained. Since the mean frequency of the auxiliary generator 39, about 1000 c./s., is much higher than the fundamental speech frequency from counter 28, about 80 to 300 c./s., during unvoiced sounds the sample pulses from the digital to analog converter are narrowed in order that the energy intensity of the synthesised unvoiced sounds is made commensurate with that of the voiced sounds.

    Improvements relating to methods and apparatus for reducing random variations in periodicity in a pulse signal

    公开(公告)号:GB1093102A

    公开(公告)日:1967-11-29

    申请号:GB1316665

    申请日:1965-03-29

    Applicant: IBM

    Abstract: 1,093,102. Vocoder excitation function generator; pulse modulation systems. INTERNATIONAL BUSINESS MACHINES CORPORATION. March 29, 1965 [April 6, 1964], No. 13166/65. Headings H4L and H4R. In a vocoder the excitation function, which is a series of pulses corresponding to unidirectional zero crossings of the fundamental frequency of the speech, is treated to reduce the effect of noise on the speech by deriving from the pulse train the mean value of the pulse spacing over a predetermined time interval and transmitting a pulse at this pulse spacing. The process is repeated for successive time intervals, each time interval commencing at a time after the commencement of the preceding time interval which is equal to the pulse spacing derived during the preceding time interval. The mean pulse spacing may be obtained by weighting each pulse spacing in the incoming signal in accordance with the time of occurrence of the pulse after the commencement of the interval, adding the weighted pulse spacings, and dividing the sum by a suitable factor. The Figure shows an excitation function generator for a vocoder in which the incoming speech is applied via a non-linear circuit NLG and a filter BP to a zero crossing detector ND and a pulse shaper IF to produce a pulse at every zero crossing in one direction of the fundamental frequency of the speech. The speech is also applied to a voiced-unvoiced detector D the output of which operates gates T1 and T2 so that during unvoiced sounds, when the excitation pulses are randomly spaced, the excitation pulses are fed direct to the output ANK 1 while during voiced sounds the pulses are fed to the units 1 to IV in order to derive the mean pulse spacings in accordance with the invention. The unit I comprises a device which measures the spacing between each pair of excitation pulses and stores these measured pulse spacings. The unit II then sums these pulse spacings over a measured interval, weighted in accordance with the time of occurrence after the start of the time interval if required, and divides them by the number of pulses, or some function thereof, to obtain the mean pulse spacing which is used in unit IV to generate the output excitation function pulse train. It is stated that the arrangement introduces some degradation into speech signals but may be advantageous under noisy conditions.

Patent Agency Ranking