-
61.
公开(公告)号:KR1020080052193A
公开(公告)日:2008-06-11
申请号:KR1020070046719
申请日:2007-05-14
Applicant: 한국전자통신연구원
Abstract: An apparatus and a method for assigning a bio pathway of a gene list by using gene homologue information are provided to enable a chip analyzer to determine a function of a cluster easily by automatically giving a bio pathway name of a gene cluster list resulted from clustering chip experiment data. An apparatus for assigning a bio pathway of a gene list includes a homologue/uniqueness gene list generator(110), a vocabulary assigning unit(120), a statistical similarity probability calculator(130) and a pathway assigning unit(140). The homologue/uniqueness gene list generator generates a homologue or uniqueness gene list by selecting genes owned and not owned by a comparison species among a gene list of a target species. The vocabulary assigning unit assigns vocabularies of a GO(Gene Ontology) and a KEGG(Kyoto Encyclopedia of Genes and Genomes) to the homologue/uniqueness gene list. The statistical similarity probability calculator calculates hyper geometric distribution similarity probability in the homologue/uniqueness gene list for each vocabulary of the GO and the KEGG. The pathway assigning unit selects the vocabulary of the GO and the KEGG having the minimum hyper geometric distribution similarity probability for each gene list, and assigns the selected vocabulary to the bio pathway name.
Abstract translation: 提供了一种通过使用基因同源信息来分配基因列表的生物学途径的装置和方法,以使得芯片分析仪可以通过自动给出由聚类芯片产生的基因簇列表的生物学途径名来容易地确定簇的功能 实验数据。 用于分配基因列表的生物学途径的装置包括同源/唯一性基因列表生成器(110),词汇分配单元(120),统计学相似性概率计算器(130)和路径分配单元(140)。 同源/唯一性基因列表生成器通过选择目标物种的基因列表中的比较物种所拥有且不拥有的基因来产生同源或唯一性基因列表。 词汇分配单位将GO(基因本体论)和KEGG(京都百科全书基因组和基因组)的词汇分配给同源/唯一性基因列表。 统计学相似概率计算器计算GO和KEGG的每个词汇的同源/唯一性基因列表中的超几何分布相似概率。 路径分配单元选择GO和具有每个基因列表的最小超几何分布相似概率的KEGG的词汇,并将所选择的词汇分配给生物学途径名称。