,Overlap graphs and de Bruijn graphs: data structures for de novo genome assembly in the big data er

来源 :定量生物学(英文版) | 被引量 : 0次 | 上传用户:xi19870623
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Background:De novo genome assembly relies on two kinds of graphs:de Bruijn graphs and overlap graphs.Overlap graphs are the basis for the Celera assembler,while de Bruijn graphs have become the dominant technical device in the last decade.Those two kinds of graphs are collectively called assembly graphs.Results:In this review,we discuss the most recent advances in the problem of constructing,representing and navigating assembly graphs,focusing on very large datasets.We will also explore some computational techniques,such as the Bloom filter,to compactly store graphs while keeping all functionalities intact.Conclusions:We complete our analysis with a discussion on the algorithmic issues of assembling from long reads (e.g.,PacBio and Oxford Nanopore).Finally,we present some of the most relevant open problems in this field.
其他文献
Background:Since biological systems are complex and often involve multiple types of genomic relationships,tensor analysis methods can be utilized to elucidate t
Background:Next-generation sequencing (NGS) technologies have fostered an unprecedented proliferation of highthroughput sequencing projects and a concomitant de
芝麻(Sesamum indicum L.)隶属于胡麻科胡麻属,在亚热带和温带地区广泛种植,富含优质的蛋白质、维生素、脂肪酸和氨基酸,是最古老的油料作物。茎点枯病是芝麻最严重的真菌病害之一,病原菌寄主范围广、抗逆能力强,再加上抗药性菌株的出现,直接影响到芝麻的产量、品质以及机械化操作。近年来,杀菌剂的广泛使用,造成环境污染、农药残留和生态问题。要求植物病理研究者探索环境友好型的措施防治芝麻茎点枯病
Background: The coronavirus disease 2019 (COVID-19) is rapidly spreading in China and more than 30 countries over last two months.COVID-19 has multiple characte
Deep leaing is making major breakthrough in several areas of bioinformatics.Anticipating that this will occur soon for the single-cell RNA-seq data analysis,we
Background: RNA structure is the crucial basis for RNA function in various cellular processes.Over the last decade,high throughput structure profiling (SP) expe
增密是实现春玉米超高产的基本条件,但增密又加剧了玉米个体的衰老进程,而果穗大小不同的品种对增密的反应不同。试验选用高秆大穗型和中秆中穗型两种穗型品种,在不同种植密度(7.5-12.0万株/hm~2)条件下,从根冠两个方面探索超高产春玉米的衰老特性。试验结果如下:1.不同种植密度处理下,超高产春玉米花粒期不同层位叶片间表现为下位叶衰老早于上位叶和穗位叶,穗位叶衰老速度最慢;不同土层根系衰老均呈现空间
党的十三大提出要通过各种现代化的新闻和宣传工具,增加对政务和党务活动的报道,发挥舆论监督作用。这是我国实现政治民主化的措施之一。我们新闻工作者应当勇敢地承担起这
如何选择题材是新闻写作中首先碰到的一个重要问题。对新闻写作涉足不深的年轻记者、通讯员,往往苦于难以看准题材的新闻价值,漏报错报有之,小题大作和大题小作有之。这里从
Background: With the recent advance of sequencing technology,the collection of RNA expression (RNA-seq) data has been growing rapidly.RNA-seq data are statistic