ZCURVE 3.0: identify prokaryotic genes with higher accuracy as well as automatically and accurately

来源 :第七届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:cerlin
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Background: In 2003,we developed an ab initio program,ZCURVE1.0,to find genes in bacterial and archaeal genomes.In this work,we present the updated version (i.e.ZCURVE 3.0).Using 422 prokaryotic genomes,the averageaccuracy was 93.7% with the updated version,compared with 88.7% with the original version.Such results also demonstrate that ZCURVE 3.0 is comparable with Glimmer 3.02 and may provide complementary predictions to it.In fact,the joint application of the two programs generated better results by correctly finding more annotated genes while also containing fewer false-positive predictions.As the exclusive function,ZCURVE 3.0 contains one post-processing program that can identify essential genes with high accuracy (>90%).
其他文献
Although non-coding DNA sequences do not encode proteins[1,2],more and more studies show that non-coding DNA plays an indispensable role in other aspects[3].In this paper,variant maps[4,5]are applied
Over sufficiently long genomic sequence,strand symmetry is a ubiquitous and explicit phenomenon.Despite being studied over two decades,the exact mechanism involved in strand-symmetry has not yet been
肉苁蓉(学名:Cistanche deserticola Ma)属于肉苁蓉属列当科,素有“沙漠人参”之美誉,具有极高的药用价值。多年来,肉苁蓉的研究多集中在其药用价值的研发、生物活性成分的分离鉴定以及人工栽培等方面,而对其遗传物质及其分子水平的研究鲜有报道。肉苁蓉是多年生专性全寄生性草本植物,专性寄生于藜科小乔木梭梭(Haloxylonammodendron)根部,而梭梭是适于生长在沙漠地区的抗旱
Coding/Non-coding genomic sequences[1]play a central role in modem Bioinformatics and System Biology especially for diagnosis of cancers & diseases base on genomic data sequences acquisition collected
Chor et al found that tetrapods animals (including all mammals) the frequency distribution of k-mer is showing multiple peaks.If the k-mer according to the number it contains CG dinucleotide classific
Bacterial pathogens secret numerous proteins,the effectors,in order to adapt to the new environment or promote virulence by the bacterium-host interactions.The mechanisms of secretion of effectors thr
Data quality and peak alignment efficiency of ChIP-sequencing profiles are directly related to the reliability and reproducibility of NSG experiments.Till now,there is no tool specifically designed fo
会议
SAROTUP (Scanner And Reporter Of Target-Unrelated Peptides) 3.0 is a significant upgrade to the widely used SAROTUP web server for the rapid identification of target-unrelated peptides (TUPs) from bio
Metagenome sequencing is a key technology for studying microbiome.A single metagenome sample usually contains millions of short reads from diverse species,with different genome length and abundance.Th
Pseudo dinucleotide composition (PseDNC) and Z curve are two widely used feature extracted methods from DNA sequences,and both of them show excellent performance in the classification issues for nucle