,A censored-Poisson model based approach to the analysis of RNA-seq data

来源 :定量生物学(英文版) | 被引量 : 0次 | 上传用户:mlj1234567890
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Background: With the recent advance of sequencing technology,the collection of RNA expression (RNA-seq) data has been growing rapidly.RNA-seq data are statistically count-type measurements.Poisson distribution is a basic probability distribution for modeling count-type data.With Poisson regression models,various experimental factors,GC content as well as alteative splicing isoforms can be flexibly considered in RNA-seq data analysis.Due to the biochemical and technical limitations of sequencing technology,the biases among RNA-seq data have been recognized.Methods: In this study,an artificial censoring approach has been proposed to an isoform-specific Poisson regression model for analyzing RNA-seq data.Low expression values can be grouped (censored) into one probability category,and high expression values can also be grouped (censored) into another probability category.We have implemented the related Newton-Raphson numeric computing procedure to achieve the maximum likelihood estimation for our censored-Poisson regression model.The related mathematical simplifications have been derived for the consideration of stable and convenient numerical computing.Results: The advantages of our artificial censoring approach have been demonstrated in both simulation studies and application analysis of experimental data.Conclusions: Our proposed artificial censoring approach allows us to focus on the majority of data.As the extreme values (tails) of data are artificially censored,more efficient analysis results can be obtained,even from relatively simple Poisson regression models.Our proposed artificial censoring approach can certainly be considered for other well-developed models or methods for RNA-seq data analysis.
其他文献
2007年在新疆呼图壁县大丰镇一块连作八年的棉田上分别采用直播和春小麦收获后复播两种不同的种植模式种植绿肥,研究不同种植方式绿肥的生物量、植株养分含量及不同绿肥翻压后对土壤的养分供给状况;2008年在已翻压绿肥的土壤上全部种植棉花,研究不同绿肥茬口对棉花的生育性状及棉花产量的影响。主要研究结果如下:1、春播绿肥生物量对土壤肥力的影响:草木樨、毛苕子、沙打旺三种绿肥中均以草木樨的生物量和植株养分含量
Background:Since biological systems are complex and often involve multiple types of genomic relationships,tensor analysis methods can be utilized to elucidate t
Background:Next-generation sequencing (NGS) technologies have fostered an unprecedented proliferation of highthroughput sequencing projects and a concomitant de
芝麻(Sesamum indicum L.)隶属于胡麻科胡麻属,在亚热带和温带地区广泛种植,富含优质的蛋白质、维生素、脂肪酸和氨基酸,是最古老的油料作物。茎点枯病是芝麻最严重的真菌病害之一,病原菌寄主范围广、抗逆能力强,再加上抗药性菌株的出现,直接影响到芝麻的产量、品质以及机械化操作。近年来,杀菌剂的广泛使用,造成环境污染、农药残留和生态问题。要求植物病理研究者探索环境友好型的措施防治芝麻茎点枯病
Background: The coronavirus disease 2019 (COVID-19) is rapidly spreading in China and more than 30 countries over last two months.COVID-19 has multiple characte
Deep leaing is making major breakthrough in several areas of bioinformatics.Anticipating that this will occur soon for the single-cell RNA-seq data analysis,we
Background: RNA structure is the crucial basis for RNA function in various cellular processes.Over the last decade,high throughput structure profiling (SP) expe
增密是实现春玉米超高产的基本条件,但增密又加剧了玉米个体的衰老进程,而果穗大小不同的品种对增密的反应不同。试验选用高秆大穗型和中秆中穗型两种穗型品种,在不同种植密度(7.5-12.0万株/hm~2)条件下,从根冠两个方面探索超高产春玉米的衰老特性。试验结果如下:1.不同种植密度处理下,超高产春玉米花粒期不同层位叶片间表现为下位叶衰老早于上位叶和穗位叶,穗位叶衰老速度最慢;不同土层根系衰老均呈现空间
党的十三大提出要通过各种现代化的新闻和宣传工具,增加对政务和党务活动的报道,发挥舆论监督作用。这是我国实现政治民主化的措施之一。我们新闻工作者应当勇敢地承担起这
如何选择题材是新闻写作中首先碰到的一个重要问题。对新闻写作涉足不深的年轻记者、通讯员,往往苦于难以看准题材的新闻价值,漏报错报有之,小题大作和大题小作有之。这里从