Automatic video layout for multi-stream multi-site telepresence conference system
    41.
    发明专利
    Automatic video layout for multi-stream multi-site telepresence conference system 审中-公开
    多流道多功能会议系统的自动视频布局

    公开(公告)号:JP2014161029A

    公开(公告)日:2014-09-04

    申请号:JP2014057290

    申请日:2014-03-19

    CPC classification number: H04N7/152 G06F3/147

    Abstract: PROBLEM TO BE SOLVED: To automatically arrange dynamic layout of a video stream in a multi-stream multi-site telepresence conferencing system.SOLUTION: A videoconference multipoint control unit (MCU) automatically generates display layouts for videoconference endpoints. The display layouts are generated on the basis of attributes associated with plural video streams received from the endpoints and display configuration information of the endpoints. Each endpoint includes one or more attributes in each outgoing stream. The attributes are assigned on the basis of video streams' role, content, camera source, etc. Display layouts are regenerated if one or more attributes change. A mixer generates video streams to be displayed at the endpoints on the basis of the display layout.

    Abstract translation: 要解决的问题:在多流多站点网真会议系统中自动安排视频流的动态布局。解决方案:视频会议多点控制单元(MCU)自动生成视频会议端点的显示布局。 显示布局是基于与从端点接收的多个视频流相关联的属性生成的,并且显示端点的配置信息。 每个端点包括每个输出流中的一个或多个属性。 属性是根据视频流的角色,内容,摄像头源等进行分配的。如果一个或多个属性发生变化,则重新生成显示布局。 混合器根据显示布局生成要在端点处显示的视频流。

    Automatic microphone muting of undesired noises
    42.
    发明专利
    Automatic microphone muting of undesired noises 审中-公开
    自动麦克风无噪音的噪音

    公开(公告)号:JP2014053890A

    公开(公告)日:2014-03-20

    申请号:JP2013180476

    申请日:2013-08-30

    Abstract: PROBLEM TO BE SOLVED: To provide a method and system for cancellation of a table noise in a speaker used for video or audio conferencing.SOLUTION: A table noise is cancelled by providing a signal or a message whenever a key is depressed on a keyboard or a mouse is clicked. When the key depression signal or message is received, the system determines whether speech is occurring. If the speech is not occurring, then the microphone in the system is muted. However, if the speech is occurring, the microphone is not muted for the prescribed time to allow the speech to be transmitted to the far end. This allows the conference to be continued in the presence of keyboard sounds if the speech is occurring at the same time but also silences the keyboard sounds if the speech is not occurring at the same time.

    Abstract translation: 要解决的问题:提供用于消除用于视频或音频会议的扬声器中的表格噪声的方法和系统。解决方案:每当键盘上的键被按下时,通过提供信号或消息来取消表格噪声,或者 点击鼠标。 当接收到按键抑制信号或消息时,系统确定语音是否发生。 如果语音没有发生,则系统中的麦克风静音。 然而,如果发生语音,则麦克风不会静音规定的时间,以允许语音被发送到远端。 如果语音同时发生,则可以在存在键盘声音的情况下继续进行会议,但是如果语音不是同时发生,则会使键盘声音静音。

    System and method for video error concealment
    44.
    发明专利
    System and method for video error concealment 审中-公开
    视频错误隐藏的系统和方法

    公开(公告)号:JP2012070391A

    公开(公告)日:2012-04-05

    申请号:JP2011225808

    申请日:2011-10-13

    Abstract: PROBLEM TO BE SOLVED: To provide a system and method for concealing video errors.SOLUTION: The system encodes, reorders, and packetizes video information into video data packets for transmission over a communication network such that the system conceals errors caused by lost video data packets when the system receives, depacketizes, orders (915), and decodes the data packets. The system and method encodes and packetizes video information (925), such that adjacent macroblocks are not placed in the same video data packets. Additionally, the system and method may provide information accompanying the video data packets to facilitate the decoding process. An advantage is that errors due to video data packet loss are spatially distributed over a video frame.

    Abstract translation: 要解决的问题:提供一种隐藏视频错误的系统和方法。 解决方案:系统将视频信息编码,重新排序和打包成视频数据包,以便通过通信网络进行传输,以便系统在系统接收,取消分组订单(915)时隐藏由丢失的视频数据包引起的错误 解码数据包。 该系统和方法对视频信息(925)进行编码和分组,使得相邻宏块不被放置在相同的视频数据分组中。 此外,系统和方法可以提供伴随视频数据分组的信息以便于解码过程。 优点在于视频数据包丢失引起的错误在空间上分布在视频帧上。 版权所有(C)2012,JPO&INPIT

    Automatic camera framing for videoconferencing
    45.
    发明专利
    Automatic camera framing for videoconferencing 审中-公开
    自动摄像机视频录像

    公开(公告)号:JP2011244455A

    公开(公告)日:2011-12-01

    申请号:JP2011110881

    申请日:2011-05-17

    CPC classification number: H04N7/142 G06K9/00234 G10L25/78 H04N5/232

    Abstract: PROBLEM TO BE SOLVED: To dynamically adjust image capturing of a video conference depending on the environment of the video conference, seating arrangement of the participants, and the person who is speaking.SOLUTION: A videoconferencing apparatus automatically tracks speakers in a room and dynamically switches between a controlled, people-view camera and a fixed, room-view camera. When no one is speaking, the apparatus shows the room view to the far-end. When there is a dominant speaker in the room, the apparatus directs the people-view camera at the dominant speaker and switches from the room-view camera to the people-view camera. When there is a new speaker in the room, the apparatus switches to the room-view camera first, directs the people-view camera at the new speaker, and then switches to the people-view camera directed at the new speaker. When there are two near-end speakers engaged in a conversation, the apparatus tracks them and causes the people-view camera to zoom-in so that both speakers are in view.

    Abstract translation: 要解决的问题:根据视频会议的环境,参与者的座位安排和正在发言的人来动态调整视频会议的图像捕获。

    解决方案:视频会议设备自动跟踪房间中的扬声器,并在受控人像摄像机和固定的室内摄像机之间动态切换。 当没有人在说话的时候,设备会显示到远端的房间视图。 当房间中有主导扬声器时,设备将人物摄像机引导到主扬声器,并从室内摄像机切换到人民摄像机。 当房间里有一个新的扬声器时,设备首先切换到室内摄像机,将人物摄像机引导到新的扬声器,然后切换到指向新扬声器的人物摄像机。 当有两个近端扬声器进行通话时,该设备将跟踪它们并使人们观看的相机放大,以便两个扬声器都在视野中。 版权所有(C)2012,JPO&INPIT

    Method and system for adding translation in videoconference
    46.
    发明专利
    Method and system for adding translation in videoconference 有权
    用于在视频中添加翻译的方法和系统

    公开(公告)号:JP2011209731A

    公开(公告)日:2011-10-20

    申请号:JP2011076604

    申请日:2011-03-30

    CPC classification number: H04N7/152 G06F17/289

    Abstract: PROBLEM TO BE SOLVED: To provide a multilingual multipoint videoconferencing system which provides real-time translation of speech by conferees to one or more languages.SOLUTION: Audio streams containing speech may be converted into text (220, 250) and inserted as subtitles into video streams (250, 240). Speech may also be translated from one language to another (240), with the translated speech inserted into video streams as and choose the subtitles or replacing the original audio stream with speech in the other language generated by a text to speech engine (220, 240, 250). Different conferees may receive different translations of the same speech based on information provided by the conferees on desired languages (210).

    Abstract translation: 要解决的问题:提供一种多语言多点视频会议系统,其提供与会者对一种或多种语言的语音的实时翻译。解决方案:包含语音的音频流可以被转换为文本(220,250)并作为字幕插入视频 流(250,240)。 语音也可以从一种语言翻译成另一种语言(240),其中翻译的语音被插入到视频流中并且选择字幕,或者用由文本到语音引擎(220,240)生成的另一语言中的语音替换原始音频流 ,250)。 不同的与会者可以根据与会者提供的关于所需语言的信息(210)接收相同语音的不同翻译。

    Spatially correlated audio in multipoint videoconferencing
    47.
    发明专利
    Spatially correlated audio in multipoint videoconferencing 审中-公开
    多点视频中的空间相关音频

    公开(公告)号:JP2009177827A

    公开(公告)日:2009-08-06

    申请号:JP2009066601

    申请日:2009-03-18

    CPC classification number: H04N7/152 H04M3/567 H04M3/568 H04M2201/50 H04N7/142

    Abstract: PROBLEM TO BE SOLVED: To provide audio location perception to an endpoint in a multipoint videoconferencing by providing a plurality of audio streams to the endpoint. SOLUTION: Audio streams are differentiated so as to emphasize broadcasting of the audio streams through one or more loudspeakers closest to a position of a speaking endpoint in a videoconferencing layout that is displayed at the endpoint. For example, the audio broadcast at a loudspeaker that is at a far-side of the screen might be attenuated or time delayed compared to audio broadcast at a loudspeaker that is located at a near-side of the display. The disclosure also provides a multipoint control unit (MCU) that processes audio signals from two or more endpoints according to the positions in a layout of the endpoints and then transmits processed audio streams to the endpoints. COPYRIGHT: (C)2009,JPO&INPIT

    Abstract translation: 要解决的问题:通过向端点提供多个音频流来在多点视频会议中向端点提供音频位置感知。 解决方案:音频流是有区别的,以便通过一个或多个扬声器来强调音频流的广播,该扬声器最靠近终端显示的视频会议布局中的说话端点的位置。 例如,与位于显示器附近的扬声器处的音频广播相比,在屏幕的远侧的扬声器处的音频广播可能被衰减或者时间延迟。 本公开还提供了一种根据端点布局中的位置来处理来自两个或更多个端点的音频信号的多点控制单元(MCU),然后将经处理的音频流发送到端点。 版权所有(C)2009,JPO&INPIT

    Ultrasonic camera tracking system and associated methods
    48.
    发明专利
    Ultrasonic camera tracking system and associated methods 有权
    超声相机追踪系统及相关方法

    公开(公告)号:JP2008113431A

    公开(公告)日:2008-05-15

    申请号:JP2007271854

    申请日:2007-10-18

    CPC classification number: G01S5/22 G01S3/808 H04N7/15

    Abstract: PROBLEM TO BE SOLVED: To provide an ultrasonic camera tracking system and methods. SOLUTION: A camera tracking system includes a controllable camera, an array of microphones, and a controller. The microphones are arranged adjacent the controllable camera and are responsive to ultrasound emitted from a source. The microphones may additionally be capable of responding to sound. The controller receives ultrasound signals communicated from the microphones in response to ultrasound emitted from the source and processes the ultrasound signals to determine an approximate location of the source. Then, the controller sends command signals to the controllable camera to direct at the determined location of the source. The camera tracking system tracks the source as it moves and continues to emit ultrasound. The source can be an emitter pack having one or more ultrasonic transducers that produce ultrasonic waves that sweep from about 24-kHz to about 40-kHz. COPYRIGHT: (C)2008,JPO&INPIT

    Abstract translation: 要解决的问题:提供超声波相机跟踪系统和方法。

    解决方案:相机跟踪系统包括可控摄像机,麦克风阵列和控制器。 麦克风被布置在可控照相机附近并响应于从源发射的超声波。 麦克风还可以响应声音。 响应于从源发射的超声波,控制器接收从麦克风传送的超声信号,并处理超声信号以确定源的近似位置。 然后,控制器将命令信号发送到可控摄像机以指向源的确定位置。 相机跟踪系统在移动时跟踪源,并继续发射超声波。 源可以是具有一个或多个超声换能器的发射器组,其产生从约24kHz至约40kHz扫描的超声波。 版权所有(C)2008,JPO&INPIT

    Dual-transform coding of audio signal
    49.
    发明专利
    Dual-transform coding of audio signal 有权
    音频信号的双变换编码

    公开(公告)号:JP2008102520A

    公开(公告)日:2008-05-01

    申请号:JP2007269116

    申请日:2007-10-16

    CPC classification number: G10L19/0212 G10L19/022

    Abstract: PROBLEM TO BE SOLVED: To improve efficiency of audio codec. SOLUTION: Methods, devices, and systems for coding and decoding audio are disclosed. At least two transforms are applied on an audio signal, each with different transform periods for better resolutions at both low and high frequencies. The transform coefficients are selected and combined such that the data rate remains similar as a single transform. The transform coefficients may be coded with a fast lattice vector quantizer. The quantizer has a high rate quantizer and a low rate quantizer. The high rate quantizer includes a scheme to truncate the lattice. The low rate quantizer includes a table based searching method. The low rate quantizer may also include a table based indexing scheme. The high rate quantizer may further include Huffman coding for the quantization indices of transform coefficients to improve the quantizing/coding efficiency. COPYRIGHT: (C)2008,JPO&INPIT

    Abstract translation: 要解决的问题:提高音频编解码器的效率。 公开了用于编码和解码音频的方法,设备和系统。 在音频信号上应用至少两个变换,每个变换具有不同的变换周期,以便在低频和高频都能获得更好的分辨率。 选择和组合变换系数,使得数据速率与单个变换保持相似。 可以用快速格子矢量量化器对变换系数进行编码。 量化器具有高速率量化器和低速率量化器。 高速率量化器包括截短晶格的方案。 低速率量化器包括基于表的搜索方法。 低速率量化器还可以包括基于表的索引方案。 高速率量化器还可以包括用于变换系数的量化索引的霍夫曼编码,以提高量化/编码效率。 版权所有(C)2008,JPO&INPIT

Patent Agency Ranking