-
-
公开(公告)号:KR101922129B1
公开(公告)日:2018-11-26
申请号:KR1020120143620
申请日:2012-12-11
Applicant: 삼성전자주식회사
Inventor: 브홀라바이샬 , 카즈이크발나딜 , 보파르디카시암선더아지트 , 말라바라푸라마스리칸쓰 , 나라야난란가비탈 , 안태진
CPC classification number: G06F17/30153 , G06F19/22 , G06F19/28
Abstract: 유전정보를압축하는방법및 장치는리드들에관한리드정보및 참조서열에정렬된리드들의위치들에관한정렬정보를획득하고, 정렬된리드들에대응되는블록의어드레스에대한정보를포함하는압축파일을생성한다. 그리고, 유전정보를압축해제하는방법및 장치는유전정보에대한압축파일을획득하고, 획득된압축파일로부터입력된유전자검색정보에대응되는블록의어드레스를결정하고, 결정된어드레스에대응되는유전정보를선택적으로압축해제한다.
-
3.
公开(公告)号:KR1020130069427A
公开(公告)日:2013-06-26
申请号:KR1020120143620
申请日:2012-12-11
Applicant: 삼성전자주식회사
Inventor: 브홀라바이샬 , 카즈이크발나딜 , 보파르디카시암선더아지트 , 말라바라푸라마스리칸쓰 , 나라야난란가비탈 , 안태진
CPC classification number: G06F17/30153 , G06F19/22 , G06F19/28
Abstract: PURPOSE: A method for compressing and decompressing gene information obtained by using NSG(Next Generation Sequencing) and a device thereof are provided to supply the random access of a lead about a reference genome and to compress and store an NSG lead arranged in the reference genome. CONSTITUTION: A data obtaining unit(1901) obtains arrangement information about positions of leads arranged in a reference rank and lead information about the leads obtained by using NSG. A lead analyzing unit(1903) groups the leads into blocks corresponding to sections by using an addressing method dividing the reference rank into the sections. A compressing unit(1904) generates a compression file including information about addresses of the blocks. An addressing unit(1902) identifies the addressing method based on the distribution of the leads arranged in the rank. [Reference numerals] (1804) Gene information compressing and decompressing device(a processing unit); (1901) Data obtaining unit; (1902) Addressing unit; (1903) Lead analyzing unit; (1904) Compressing unit; (1905) Decompressing unit; (AA) Arrangement information and read information; (BB) Reference rank; (CC) Gene search information and compressed files; (DD) Compressed files; (EE) Gene information
Abstract translation: 目的:提供一种用于压缩和解压缩使用NSG(下一代测序)获得的基因信息的方法及其装置,以提供铅参与基因组的随机接近并压缩并存储安排在参考基因组中的NSG引物 。 构成:数据获取单元(1901)获得关于以参考等级排列的引线的位置的布置信息和关于通过使用NSG获得的引线的引导信息。 引导分析单元(1903)通过使用将参考等级划分为部分的寻址方法将引线分组成与部分相对应的块。 压缩单元(1904)生成包括关于块的地址的信息的压缩文件。 寻址单元(1902)基于排列在排列中的引线的分布来识别寻址方法。 (附图标记)(1804)基因信息压缩和解压缩装置(处理单元); (1901)数据获取单元; (1902)寻址单位; (1903)铅分析单元; (1904)压缩机组; (1905)解压单元; (AA)安排信息和阅读资料; (BB)参考等级; (CC)基因检索信息和压缩文件; (DD)压缩文件; (EE)基因信息
-
公开(公告)号:KR1020120137235A
公开(公告)日:2012-12-20
申请号:KR1020120056228
申请日:2012-05-25
Applicant: 삼성전자주식회사
Inventor: 안태진 , 브홀라바이샬 , 보파르디카시암선더아지트 , 이규상 , 나라야난란가비탈
IPC: G06F19/10
Abstract: PURPOSE: A genetic data compressing method and apparatus are provided to efficiently compress sequence data of text based format obtained by using NGS(Next Generation Sequencing) with high compression gain. CONSTITUTION: A parsing part(702) parses text of sequence data to fields according to information included in the text. A statistic acquiring part(703) acquires statistics about symbol included in the fields. A coding algorithm identifying part identifies coding algorithms having a highest compression gain corresponding to the fields based on the statistics. A compressing part(705) generates a bit stream compressing the sequence data by coding the sequence data based on the coding algorithms corresponding to the parsed fields. [Reference numerals] (700) Sequence data compression device; (701) Data receiving unit; (702) Parsing unit; (703) Statistics acquisition unit; (704) Encoding algorithm discrimination unit; (705) Compression unit; (AA) Sequence data(FASTQ file); (BB) Bit stream
Abstract translation: 目的:提供遗传数据压缩方法和装置,以有效地压缩通过使用具有高压缩增益的NGS(下一代测序)获得的基于文本的格式的序列数据。 构成:解析部分(702)根据文本中包含的信息将序列数据的文本解析成字段。 统计获取部分(703)获取关于字段中包括的符号的统计信息。 基于统计信息,编码算法识别部分识别具有与场相对应的最高压缩增益的编码算法。 压缩部分(705)通过基于对应于解析字段的编码算法对序列数据进行编码来生成压缩序列数据的比特流。 (附图标记)(700)序列数据压缩装置; (701)数据接收单元; (702)分析单元; (703)统计采集单位; (704)编码算法鉴别单元; (705)压缩机组; (AA)序列数据(FASTQ文件); (BB)位流
-
-
-