An algorithm for pathway expansion with protein-protein interaction data and gene ontology

来源 :第五届全国生物信息学与系统生物学学术大会 | 被引量 : 0次 | 上传用户:glx19891006
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  Background: Pathways provide important information about how genes interact with each other in a concerted way.Nevertheless, our knowledge about them is fragmented.Although biological experiments offer a reliable way for pathway expansion, they are expensive and time consuming.Several previously developed computational methods for pathway expansion focused on analysis and modeling of individual experimental datasets, and they seldom fully utilized the rapidly accumulated prior knowledge in their inferences.Hence, their pathway expansion results were often not satisfactory and also lacked sounding biological interpretations.Methods: In this study, we developed a knowledge-mining based algorithm for pathway expansion with protein-protein interaction data and gene ontology.First, we used proteinprotein interaction data documented in HPRD database to identify the interacting neighbor genes for a target gene, and used GO (Gene Ontology) structure to define the distances for measuring functional similarities between the target gene and its neighbors.Then, we found the nearest neighbor genes for the target gene.Since two genes that are very similar in GO functions are very likely to take part in the same pathways, we predicted the target genes pathways as its nearest neighbor genes.Results: Totally, we analyzed 3937 genes.On average, 64.51% of the known pathways were correctly predicted by our algorithm.Furthermore, we also evaluated the capability of our algorithm to predict novel pathways.Of seven genes whose pathways were unknown in March 15, 2011, four genes turned out to be consistent with our predictions, verified by using the updated knowledge released on March 18, 2012.Conclusions: This study shows that the proposed knowledge-mining based algorithm offers an alternative and improved avenue to expand the current pathways .
其他文献
  Background: Trans-action siRNA (ta-siRNA) is a type of small interfering RNA detected from plant and is reported to play an important role in post transcrip
  Background: Yersinia pestis is a highly pathogenic Gram-negative bacterium.Y.pestis infection causes three deadly diseases: pneumonic plague, septicemic pla
  Background: Given the sequenced fragments from a pair of chromosomes, the goal of the haplotype assembly problem is to reconstruct the two haplotypes for th
  Background: Nasopharyngeal Carcinoma (NPC) is one of the highest mortal malignancies around the world, and its etiology involves a number of sophisticated b
  Background: Bacterial persisters are a tiny fraction of preexisting dormant cells inside bacterial populations.Although isogenic with the rest of the popula
  Background: Scientific nomenclature is a system of words used to name things in a particular discipline.Therefore, accurate translation of scientific nomenc
  Background: Current sequencing technology (Illumina Solexa, Applied Biosystems SoLiD, and Helicos Biosciences Heliscope etc.) allows one to read millions of
  Background: Intrinsically disordered proteins (IDPs) that do not possess stable secondary and tertiary structures are crucial for the function of numerous p
  Background: Our aim is to study novel high expression activated PTHLH feedback network-mediated regulation of cell growth mechanism in HCC.Methods: We studi
会议