Abstract:
PROBLEM TO BE SOLVED: To automatically arrange dynamic layout of a video stream in a multi-stream multi-site telepresence conferencing system.SOLUTION: A videoconference multipoint control unit (MCU) automatically generates display layouts for videoconference endpoints. The display layouts are generated on the basis of attributes associated with plural video streams received from the endpoints and display configuration information of the endpoints. Each endpoint includes one or more attributes in each outgoing stream. The attributes are assigned on the basis of video streams' role, content, camera source, etc. Display layouts are regenerated if one or more attributes change. A mixer generates video streams to be displayed at the endpoints on the basis of the display layout.
Abstract:
PROBLEM TO BE SOLVED: To provide a method and system for cancellation of a table noise in a speaker used for video or audio conferencing.SOLUTION: A table noise is cancelled by providing a signal or a message whenever a key is depressed on a keyboard or a mouse is clicked. When the key depression signal or message is received, the system determines whether speech is occurring. If the speech is not occurring, then the microphone in the system is muted. However, if the speech is occurring, the microphone is not muted for the prescribed time to allow the speech to be transmitted to the far end. This allows the conference to be continued in the presence of keyboard sounds if the speech is occurring at the same time but also silences the keyboard sounds if the speech is not occurring at the same time.
Abstract:
PROBLEM TO BE SOLVED: To provide a method for reconstruction of video information lost as a result of transmission errors.SOLUTION: A system and method for reconstruction of video information lost as a result of transmission errors have four aspects, comprising: (1) changing a bit and/or packet rate; (2) inserting redundant information into a video bit stream; (3) providing periodic automatic refresh of certain regions of video; and (4) interleaving coded macro blocks into some diversity groups for transmission to spatially spread effect of lost packets. Then, by using these three aspects, image reconstruction may provide an enhanced result in presence of transmission losses.
Abstract:
PROBLEM TO BE SOLVED: To provide a system and method for concealing video errors.SOLUTION: The system encodes, reorders, and packetizes video information into video data packets for transmission over a communication network such that the system conceals errors caused by lost video data packets when the system receives, depacketizes, orders (915), and decodes the data packets. The system and method encodes and packetizes video information (925), such that adjacent macroblocks are not placed in the same video data packets. Additionally, the system and method may provide information accompanying the video data packets to facilitate the decoding process. An advantage is that errors due to video data packet loss are spatially distributed over a video frame.
Abstract:
PROBLEM TO BE SOLVED: To dynamically adjust image capturing of a video conference depending on the environment of the video conference, seating arrangement of the participants, and the person who is speaking.SOLUTION: A videoconferencing apparatus automatically tracks speakers in a room and dynamically switches between a controlled, people-view camera and a fixed, room-view camera. When no one is speaking, the apparatus shows the room view to the far-end. When there is a dominant speaker in the room, the apparatus directs the people-view camera at the dominant speaker and switches from the room-view camera to the people-view camera. When there is a new speaker in the room, the apparatus switches to the room-view camera first, directs the people-view camera at the new speaker, and then switches to the people-view camera directed at the new speaker. When there are two near-end speakers engaged in a conversation, the apparatus tracks them and causes the people-view camera to zoom-in so that both speakers are in view.
Abstract:
PROBLEM TO BE SOLVED: To provide a multilingual multipoint videoconferencing system which provides real-time translation of speech by conferees to one or more languages.SOLUTION: Audio streams containing speech may be converted into text (220, 250) and inserted as subtitles into video streams (250, 240). Speech may also be translated from one language to another (240), with the translated speech inserted into video streams as and choose the subtitles or replacing the original audio stream with speech in the other language generated by a text to speech engine (220, 240, 250). Different conferees may receive different translations of the same speech based on information provided by the conferees on desired languages (210).
Abstract:
PROBLEM TO BE SOLVED: To provide audio location perception to an endpoint in a multipoint videoconferencing by providing a plurality of audio streams to the endpoint. SOLUTION: Audio streams are differentiated so as to emphasize broadcasting of the audio streams through one or more loudspeakers closest to a position of a speaking endpoint in a videoconferencing layout that is displayed at the endpoint. For example, the audio broadcast at a loudspeaker that is at a far-side of the screen might be attenuated or time delayed compared to audio broadcast at a loudspeaker that is located at a near-side of the display. The disclosure also provides a multipoint control unit (MCU) that processes audio signals from two or more endpoints according to the positions in a layout of the endpoints and then transmits processed audio streams to the endpoints. COPYRIGHT: (C)2009,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide an ultrasonic camera tracking system and methods. SOLUTION: A camera tracking system includes a controllable camera, an array of microphones, and a controller. The microphones are arranged adjacent the controllable camera and are responsive to ultrasound emitted from a source. The microphones may additionally be capable of responding to sound. The controller receives ultrasound signals communicated from the microphones in response to ultrasound emitted from the source and processes the ultrasound signals to determine an approximate location of the source. Then, the controller sends command signals to the controllable camera to direct at the determined location of the source. The camera tracking system tracks the source as it moves and continues to emit ultrasound. The source can be an emitter pack having one or more ultrasonic transducers that produce ultrasonic waves that sweep from about 24-kHz to about 40-kHz. COPYRIGHT: (C)2008,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To improve efficiency of audio codec. SOLUTION: Methods, devices, and systems for coding and decoding audio are disclosed. At least two transforms are applied on an audio signal, each with different transform periods for better resolutions at both low and high frequencies. The transform coefficients are selected and combined such that the data rate remains similar as a single transform. The transform coefficients may be coded with a fast lattice vector quantizer. The quantizer has a high rate quantizer and a low rate quantizer. The high rate quantizer includes a scheme to truncate the lattice. The low rate quantizer includes a table based searching method. The low rate quantizer may also include a table based indexing scheme. The high rate quantizer may further include Huffman coding for the quantization indices of transform coefficients to improve the quantizing/coding efficiency. COPYRIGHT: (C)2008,JPO&INPIT
Abstract:
PROBLEM TO BE SOLVED: To provide solving method for a conferencing solving the prior art defect of usage being limited inside restricted areas. SOLUTION: The system, method and apparatus enables widening of the wireless personal area network, such as Bluetooth (R) and Piconet to a far distant location beyond a standard area through the connection of conferencing. For example, connection for conferencing can have one or more ISDN lines or an IP connection between two or more ISDN lines. A broadband connection can have a video, voice, control, and Bluetooth (R) channels. COPYRIGHT: (C)2008,JPO&INPIT