,Predicting enhancer-promoter interaction from genomic sequence with deep neural networks

来源 :定量生物学(英文版) | 被引量 : 0次 | 上传用户:zkry123
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Background:In the human genome,distal enhancers are involved in regulating target genes through proximal promoters by forming enhancer-promoter interactions.Although recently developed high-throughput experimental approaches have allowed us to recognize potential enhancer-promoter interactions genome-wide,it is still largely unclear to what extent the sequence-level information encoded in our genome help guide such interactions.Methods:Here we report a new computational method (named "SPEID") using deep leaing models to predict enhancer-promoter interactions based on sequence-based features only,when the locations of putative enhancers and promoters in a particular cell type are given.Results:Our results across six different cell types demonstrate that SPEID is effective in predicting enhancerpromoter interactions as compared to state-of-the-art methods that only use information from a single cell type.As a proof-of-principle,we also applied SPEID to identify somatic non-coding mutations in melanoma samples that may have reduced enhancer-promoter interactions in tumor genomes.Conclusions:This work demonstrates that deep leaing models can help reveal that sequence-based features alone are sufficient to reliably predict enhancer-promoter interactions genome-wide.
其他文献
Background:Next-generation sequencing (NGS) technologies have fostered an unprecedented proliferation of highthroughput sequencing projects and a concomitant de
芝麻(Sesamum indicum L.)隶属于胡麻科胡麻属,在亚热带和温带地区广泛种植,富含优质的蛋白质、维生素、脂肪酸和氨基酸,是最古老的油料作物。茎点枯病是芝麻最严重的真菌病害之一,病原菌寄主范围广、抗逆能力强,再加上抗药性菌株的出现,直接影响到芝麻的产量、品质以及机械化操作。近年来,杀菌剂的广泛使用,造成环境污染、农药残留和生态问题。要求植物病理研究者探索环境友好型的措施防治芝麻茎点枯病
Background: The coronavirus disease 2019 (COVID-19) is rapidly spreading in China and more than 30 countries over last two months.COVID-19 has multiple characte
Deep leaing is making major breakthrough in several areas of bioinformatics.Anticipating that this will occur soon for the single-cell RNA-seq data analysis,we
Background: RNA structure is the crucial basis for RNA function in various cellular processes.Over the last decade,high throughput structure profiling (SP) expe
增密是实现春玉米超高产的基本条件,但增密又加剧了玉米个体的衰老进程,而果穗大小不同的品种对增密的反应不同。试验选用高秆大穗型和中秆中穗型两种穗型品种,在不同种植密度(7.5-12.0万株/hm~2)条件下,从根冠两个方面探索超高产春玉米的衰老特性。试验结果如下:1.不同种植密度处理下,超高产春玉米花粒期不同层位叶片间表现为下位叶衰老早于上位叶和穗位叶,穗位叶衰老速度最慢;不同土层根系衰老均呈现空间
党的十三大提出要通过各种现代化的新闻和宣传工具,增加对政务和党务活动的报道,发挥舆论监督作用。这是我国实现政治民主化的措施之一。我们新闻工作者应当勇敢地承担起这
如何选择题材是新闻写作中首先碰到的一个重要问题。对新闻写作涉足不深的年轻记者、通讯员,往往苦于难以看准题材的新闻价值,漏报错报有之,小题大作和大题小作有之。这里从
Background: With the recent advance of sequencing technology,the collection of RNA expression (RNA-seq) data has been growing rapidly.RNA-seq data are statistic
Background:De novo genome assembly relies on two kinds of graphs:de Bruijn graphs and overlap graphs.Overlap graphs are the basis for the Celera assembler,while