MICROPHONE FOR AUDIO SOURCE TRACKING
    31.
    发明申请
    MICROPHONE FOR AUDIO SOURCE TRACKING 审中-公开
    麦克风用于音频源跟踪

    公开(公告)号:WO2008082308A1

    公开(公告)日:2008-07-10

    申请号:PCT/NO2007/000455

    申请日:2007-12-20

    CPC classification number: H04N7/15 H04N7/142 H04R1/02 H04R1/34

    Abstract: The present invention discloses an arrangement utilizes a certain microphone assembly for facilitating audio source tracking systems in communication systems. It can be applied to both single and array microphones. The principal idea is to enhance the sound level acoustically in the critical high frequency range, thereby increasing the effective signal-to-noise ratio both for sound pickup and localization algorithms. This is done by enclosing the microphone into a channel or a small cavity (a Helmholtz-resonator), thereby introducing a high-frequency response peak (resonance), fairly broad-band.

    Abstract translation: 本发明公开了一种利用某个麦克风组件来促进通信系统中的音频源跟踪系统的装置。 它可以应用于单个和阵列麦克风。 主要思想是在关键的高频范围内增强声级,从而增加了声音拾取和定位算法的有效信噪比。 这通过将麦克风包围到通道或小腔(亥姆霍兹共振器)中,从而引入相当宽带的高频响应峰(谐振)。

    METHOD AND APPARATUS FOR VIDEO CONFERENCING HAVING DYNAMIC LAYOUT BASED ON KEYWORD DETECTION
    32.
    发明申请
    METHOD AND APPARATUS FOR VIDEO CONFERENCING HAVING DYNAMIC LAYOUT BASED ON KEYWORD DETECTION 审中-公开
    基于关键字检测的动态布局视频会议的方法和装置

    公开(公告)号:WO2007142533A1

    公开(公告)日:2007-12-13

    申请号:PCT/NO2007/000180

    申请日:2007-05-25

    CPC classification number: H04N7/147 G10L2015/088

    Abstract: In particular, the present invention provides a method and system for conferencing, including the steps of connecting at least two sites to a conference, receiving at least two video signals and two audio signals from the connected sites, consecutively analyzing the audio data from the at least two sites connected in the conference by converting at least a part of the audio data to acoustical features and extracting keywords and speech parameters from the acoustical features using speech recognition, and comparing said extracted keywords to predefined words, then deciding if said extracted predefined keywords are to be considered a call for attention based on said speech parameters, and further, defining an image layout based on said decision, and processing the received video signals to provide a video signal according to the defined image layout, and transmitting the composite video signal to at least one of the at least two connected sites.

    Abstract translation: 特别地,本发明提供了一种用于会议的方法和系统,包括将至少两个站点连接到会议的步骤,从连接的站点接收至少两个视频信号和两个音频信号,连续地分析来自at 通过将音频数据的至少一部分转换为声学特征并使用语音识别从声学特征提取关键词和语音参数,并将所提取的关键字与预定义的词进行比较,然后判断所述提取的预定义关键词 将被视为基于所述语音参数的关注呼叫,并且还基于所述决定定义图像布局,并且处理所接收的视频信号以根据所定义的图像布局提供视频信号,并且发送复合视频信号 至少至少两个连接位点中的至少一个。

    SEARCHABLE MULTIMEDIA STREAM
    33.
    发明申请
    SEARCHABLE MULTIMEDIA STREAM 审中-公开
    可搜索多媒体流

    公开(公告)号:WO2007078200A1

    公开(公告)日:2007-07-12

    申请号:PCT/NO2006/000423

    申请日:2006-11-22

    CPC classification number: H04N21/278 G10L15/26 H04N21/234309

    Abstract: The present invention provides a system and a method making an archived conference or presentation searchable after being stored in the archive server. According to the invention, one or more media streams coded according to H.323 or SIP are transmitted to a conversion engine for converting multimedia content into a standard streaming format, which may be a cluster of files, each representing a certain medium (audio, video, data) and/or a structure file that synchronizes and associates the different media together. When the conversion is carried out, the structure file is copied and forwarded to a post-processing server. The post-processing server includes i.a. a speech recognition engine generating a text file of alphanumeric characters representing all recognized words in the audio file. The text file is then entered into the cluster of files associating each identified word to a timing tag in the structure file. After this post-processing, finding key words and associated points of time in the media stream could easily be executed by a conventional search engine.

    Abstract translation: 本发明提供一种在归档服务器中存储之后,使归档会议或表示可搜索的系统和方法。 根据本发明,根据H.323或SIP编码的一个或多个媒体流被发送到转换引擎,用于将多媒体内容转换成标准流格式,其可以是一组文件,每个文件集合表示某一媒体(音频, 视频,数据)和/或将不同媒体同步并关联在一起的结构文件。 执行转换时,将结构文件复制并转发到后处理服务器。 后处理服务器包括i.a. 语音识别引擎,生成表示音频文件中所有识别的单词的字母数字字符的文本文件。 然后将文本文件输入到将每个识别的词与结构文件中的定时标签相关联的文件集合中。 在这种后处理之后,在传统的搜索引擎中可以容易地执行在媒体流中查找关键词和相关联的时间点。

    SYSTEM AND METHOD FOR PRESENCE DETETION
    35.
    发明申请
    SYSTEM AND METHOD FOR PRESENCE DETETION 审中-公开
    用于存在检测的系统和方法

    公开(公告)号:WO2005122576A1

    公开(公告)日:2005-12-22

    申请号:PCT/NO2005/000193

    申请日:2005-06-07

    CPC classification number: G01S13/04 G06K9/00228 G06K9/00771

    Abstract: The present invention discloses a system and method for automatically detecting the presence of a user in a presence application. The presence detection is provided by active detection mechanisms monitoring the localities near the endpoint or terminal connected to the application. The presence information is centrally stored in a presence server collecting the information directly from the respective user terminals. According to preferred embodiments of the present invention, presence is determined i.a. by means of radar detection, infrared light detection, video processing and face detection/recognition.

    Abstract translation: 本发明公开了一种用于在存在应用中自动检测用户的存在的系统和方法。 通过主动检测机制来监视存在检测,监视连接到应用的端点或终端附近的地点。 存在信息被集中存储在直接从各个用户终端收集信息的存在服务器中。 根据本发明的优选实施例, 通过雷达检测​​,红外光检测,视频处理和人脸检测/识别。

    SYSTEM AND METHOD FOR ENHANCED STEREO AUDIO
    36.
    发明申请
    SYSTEM AND METHOD FOR ENHANCED STEREO AUDIO 审中-公开
    用于增强立体声的系统和方法

    公开(公告)号:WO2005062595A1

    公开(公告)日:2005-07-07

    申请号:PCT/NO2004/000398

    申请日:2004-12-22

    CPC classification number: H04M9/082 H04M9/08

    Abstract: The present invention relates to an audio communication system and method with improved acoustic characteristics. A stereo detector is introduced in the echo cancellator of the system. When stereo in far-end audio is detected, converging of the adaptive model of the cancellator is suspended. According to alternative embodiments of the invention, the system is extended with a second echo cancellator removing the stereo image of the echo signal, in addition to a miscellaneous processing unit configured to attenuate the signal at certain events implying a large stereo echo contribution. A stereo collapsing unit is also introduced on the channels of the far-end audio to remove the stereo image at certain events to further suppress the echo contribution.

    Abstract translation: 本发明涉及具有改善的声学特性的音频通信系统和方法。 在系统的回波消除器中引入立体声检测器。 当检测到远端音频中的立体声时,取消器的自适应模型的收敛被暂停。 根据本发明的替代实施例,除了配置成在意味着大的立体声回波贡献的某些事件处衰减信号的杂项处理单元之外,系统还延长了除去回波信号的立体图像的第二回波消除器。 在远端音频的声道上还引入了立体声折叠单元,以在某些事件中去除立体声图像,以进一步抑制回波的贡献。

    SYSTEM AND METHOD FOR SIMPLIFIED CONFERENCE INITIATION
    37.
    发明申请
    SYSTEM AND METHOD FOR SIMPLIFIED CONFERENCE INITIATION 审中-公开
    用于简化会议启动的系统和方法

    公开(公告)号:WO2005057924A1

    公开(公告)日:2005-06-23

    申请号:PCT/NO2004/000329

    申请日:2004-10-29

    Inventor: SCHRADER, Thies

    CPC classification number: H04M7/003 H04M7/006

    Abstract: The present invention discloses a method and a system for initiating, routing and scheduling conferences. A dial URL is introduced with a prefix unique for calling purposes. When a user activates such an URL in his web browser, a content handler associated with the browser will recognize the type of URL and send a request to a managing tool to determine an available calling route between the user's preferred end-point and the end-point being addressed in the URL with the required resources. The managing tool then schedule the resources and initiate the call between the end-points. The invention allows for a one-click initiation of ad-hoc calls and conferences.

    Abstract translation: 本发明公开了一种用于发起,路由和调度会议的方法和系统。 引入了一个拨号网址,带有专用于呼叫目的的前缀。 当用户在其浏览器中激活这样的URL时,与浏览器相关联的内容处理器将识别URL的类型并向管理工具发送请求以确定用户的优选终点和终端之间的可用呼叫路由, 在URL中处理所需资源。 然后,管理工具调度资源并启动端点之间的通话。 本发明允许一键启动即席呼叫和会议。

    DISTRIBUTED REAL-TIME MEDIA COMPOSER
    39.
    发明申请
    DISTRIBUTED REAL-TIME MEDIA COMPOSER 审中-公开
    分布式实时媒体组合

    公开(公告)号:WO2005048600A1

    公开(公告)日:2005-05-26

    申请号:PCT/NO2004/000344

    申请日:2004-11-15

    Abstract: A system and a method allowing simultaneous exchange of audio, video and/or data information between a plurality of units over a communication network, supported by a central unit, wherein that the central unit is, based on knowledge regarding one or more of the units, adapted to instruct said one or more units to generate multimedia data streams adjusted to fit into certain restrictions to be presented on other units participating in a same session.

    Abstract translation: 一种允许在通过中央单元支持的通信网络上的多个单元之间同时交换音频,视频和/或数据信息的系统和方法,其中所述中央单元基于关于一个或多个单元的知识 适于指示所述一个或多个单元产生被调整以适应要在其他参与相同会话的单元呈现的某些限制的多媒体数据流。

Patent Agency Ranking