차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치
    3.
    发明公开
    차세대 시퀀싱을 이용하여 획득된 유전 정보를 압축 및 압축해제하는 방법 및 장치 审中-实审
    使用下一代序列(NGS)压缩和分解遗传信息的方法和装置

    公开(公告)号:KR1020130069427A

    公开(公告)日:2013-06-26

    申请号:KR1020120143620

    申请日:2012-12-11

    CPC classification number: G06F17/30153 G06F19/22 G06F19/28

    Abstract: PURPOSE: A method for compressing and decompressing gene information obtained by using NSG(Next Generation Sequencing) and a device thereof are provided to supply the random access of a lead about a reference genome and to compress and store an NSG lead arranged in the reference genome. CONSTITUTION: A data obtaining unit(1901) obtains arrangement information about positions of leads arranged in a reference rank and lead information about the leads obtained by using NSG. A lead analyzing unit(1903) groups the leads into blocks corresponding to sections by using an addressing method dividing the reference rank into the sections. A compressing unit(1904) generates a compression file including information about addresses of the blocks. An addressing unit(1902) identifies the addressing method based on the distribution of the leads arranged in the rank. [Reference numerals] (1804) Gene information compressing and decompressing device(a processing unit); (1901) Data obtaining unit; (1902) Addressing unit; (1903) Lead analyzing unit; (1904) Compressing unit; (1905) Decompressing unit; (AA) Arrangement information and read information; (BB) Reference rank; (CC) Gene search information and compressed files; (DD) Compressed files; (EE) Gene information

    Abstract translation: 目的:提供一种用于压缩和解压缩使用NSG(下一代测序)获得的基因信息的方法及其装置,以提供铅参与基因组的随机接近并压缩并存储安排在参考基因组中的NSG引物 。 构成:数据获取单元(1901)获得关于以参考等级排列的引线的位置的布置信息和关于通过使用NSG获得的引线的引导信息。 引导分析单元(1903)通过使用将参考等级划分为部分的寻址方法将引线分组成与部分相对应的块。 压缩单元(1904)生成包括关于块的地址的信息的压缩文件。 寻址单元(1902)基于排列在排列中的引线的分布来识别寻址方法。 (附图标记)(1804)基因信息压缩和解压缩装置(处理单元); (1901)数据获取单元; (1902)寻址单位; (1903)铅分析单元; (1904)压缩机组; (1905)解压单元; (AA)安排信息和阅读资料; (BB)参考等级; (CC)基因检索信息和压缩文件; (DD)压缩文件; (EE)基因信息

    유전자 데이터를 압축하는 방법 및 장치
    4.
    发明公开
    유전자 데이터를 압축하는 방법 및 장치 审中-实审
    用于压缩遗传数据的方法和装置

    公开(公告)号:KR1020120137235A

    公开(公告)日:2012-12-20

    申请号:KR1020120056228

    申请日:2012-05-25

    Abstract: PURPOSE: A genetic data compressing method and apparatus are provided to efficiently compress sequence data of text based format obtained by using NGS(Next Generation Sequencing) with high compression gain. CONSTITUTION: A parsing part(702) parses text of sequence data to fields according to information included in the text. A statistic acquiring part(703) acquires statistics about symbol included in the fields. A coding algorithm identifying part identifies coding algorithms having a highest compression gain corresponding to the fields based on the statistics. A compressing part(705) generates a bit stream compressing the sequence data by coding the sequence data based on the coding algorithms corresponding to the parsed fields. [Reference numerals] (700) Sequence data compression device; (701) Data receiving unit; (702) Parsing unit; (703) Statistics acquisition unit; (704) Encoding algorithm discrimination unit; (705) Compression unit; (AA) Sequence data(FASTQ file); (BB) Bit stream

    Abstract translation: 目的:提供遗传数据压缩方法和装置,以有效地压缩通过使用具有高压缩增益的NGS(下一代测序)获得的基于文本的格式的序列数据。 构成:解析部分(702)根据文本中包含的信息将序列数据的文本解析成字段。 统计获取部分(703)获取关于字段中包括的符号的统计信息。 基于统计信息,编码算法识别部分识别具有与场相对应的最高压缩增益的编码算法。 压缩部分(705)通过基于对应于解析字段的编码算法对序列数据进行编码来生成压缩序列数据的比特流。 (附图标记)(700)序列数据压缩装置; (701)数据接收单元; (702)分析单元; (703)统计采集单位; (704)编码算法鉴别单元; (705)压缩机组; (AA)序列数据(FASTQ文件); (BB)位流

Patent Agency Ranking