Genome-wide protein structure prediction and structure-based function annotation

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:baijiankai
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  The biological function of protein molecules is determined by the shape of their three-dimensional structures.Is it possible to predict protein structure and function from the amino acid sequence? We developed a new algorithm, I-TASSER, which assembles atomic structure of proteins using fragments excised from unrelated experiment structures.Functional insights (e.G. Ligand-binding affinity, enzyme classification and gene ontology) are then deduced by matching the predicted structure models with the known proteins in protein function libraries.The I-TASSER algorithm was ranked as the best for automated protein structure prediction in the communitywide CASP experiments of 2006, 2008 and 2010 ; it was also ranked at the top for protein function annotation in CASP9 in 2010.In this talk, we first review the recent progress in computer-based protein structure prediction including the new developments in ab initio folding and atomic structure refinements since the CASP9 experiment, and show that the protein structure prediction problem can in principle be solved using the current PDB library.Next, we discuss the application of the developed methods to the structural and functional modeling of a number of genomes, including all G-protein coupled receptors (GPCRs) in the human genome, yielding models 90% of which are shown to have correct topology, and Mareks disease virus, the first success of the computational modeling of a complete viral genome.Finally, we demonstrate how the predicted I-TASSER structure models can be used to annotate the biological function of the proteins and screen drug candidates by matching their global topology and functional sites against the existing structure/function/binding databases .
其他文献
  WD40 proteins are characterized by a class of repeat units known as WD40-repeats.Evidence indicates WD40 proteins take part in a wide range ofeukaryotic bio
会议
会议
会议
  从海量的组学数据中识别出癌症发生发展中的核心驱动力和靶标依然是一项很具有挑战的工作。为了解决这些问题并且提高基于靶标的诊断和预后能力,本论文提出了全新的方法来
会议
  系统生物学是系统性地研究一个生物系统中所有组成成分的构成以及在特定条件下这些组分间的相互关系,并分析生物系统的动力学过程的科学。系统生物医学是采用系统论方法研
会议
  Background: Meiotic recombination does not occur evenly across the genome, but instead occurs at relatively high frequencies in some genomic regions (hotspo
会议
  细菌基因组中必需基因是指在一定环境条件下,维持细菌的生命活动所必不可少的基因。这些基因所编码蛋白质的功能被认为是生命的基础。一个细菌的所有必需基因构成了该物种
会议
  The application of ChIP-seq and DNase-seq in recent years has greatly expedited the mechanistic understanding of transcriptional and epigenetic gene regulat
会议