Invention Grant
- Patent Title: Method and systems for genome sequence compression
-
Application No.: US17445202Application Date: 2021-08-17
-
Publication No.: US11769570B2Publication Date: 2023-09-26
- Inventor: Zhenhao Sun , Meng Wang , Shiqi Wang , Tak Wu Sam Kwong
- Applicant: City University of Hong Kong
- Applicant Address: CN Hong Kong
- Assignee: City University of Hong Kong
- Current Assignee: City University of Hong Kong
- Current Assignee Address: CN Hong Kong
- Agency: S&F/WEHRW
- Main IPC: H03M7/34
- IPC: H03M7/34 ; G16B50/50 ; G16B40/20 ; H03M7/30 ; G06F16/22 ; G06N7/01

Abstract:
Systems and methods for genome sequence compression and decompression are provided. The method for compression encoding of a genome sequence includes partitioning a genome sequence into a plurality of Group of Bases (GoBs) and processing each of the plurality of GoBs independently to encode the genome sequence into a bit stream. Processing each of the plurality of GoBs includes dividing each of the plurality of GOBs into a first part and a second part, the first part including an initial context part and the second part including a learning-based inference part. The processing each of the plurality of GoBs further includes encoding the first part in accordance with a Markov model, encoding the second part in accordance with a learning-based model, and encoding the encoded first part and the encoded second part into the bit stream with an arithmetic encoder. The learning-based model may include Long and Short-Term Memory (LSTM)-based neural networks.
Public/Granted literature
- US20230076603A1 METHOD AND SYSTEMS FOR GENOME SEQUENCE COMPRESSION Public/Granted day:2023-03-09
Information query
IPC分类: