,Imputation of single-cell gene expression with an autoencoder neural network

来源 :定量生物学(英文版) | 被引量 : 0次 | 上传用户:lp999999
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
Background: Single-cell RNA-sequencing (scRNA-seq) is a rapidly evolving technology that enables measurement of gene expression levels at an unprecedented resolution.Despite the explosive growth in the number of cells that can be assayed by a single experiment,scRNA-seq still has several limitations,including high rates of dropouts,which result in a large number of genes having zero read count in the scRNA-seq data,and complicate downstream analyses.Methods: To overcome this problem,we treat zeros as missing values and develop nonparametric deep leaing methods for imputation.Specifically,our LATE (Leaing with AuToEncoder) method trains an autoencoder with random initial values of the parameters,whereas our TRANSLATE (TRANSfer leaing with LATE) method further allows for the use of a reference gene expression data set to provide LATE with an initial set of parameter estimates.Results: On both simulated and real data,LATE and TRANSLATE outperform existing scRNA-seq imputation methods,achieving lower mean squared error in most cases,recovering nonlinear gene-gene relationships,and better separating cell types.They are also highly scalable and can efficiently process over 1 million cells in just a few hours on a GPU.Conclusions: We demonstrate that our nonparametric approach to imputation based on autoencoders is powerful and highly efficient.
其他文献
Background: E2F1 protein,a major effector of the Rb/E2F pathway plays a central role in regulating cell-fate decisions involved in proliferation,apoptosis,and d
自旋交叉配合物的研究是分子磁化学中的一个重要领域 ,已引起人们的普遍关注 .近期我们设计、制备了配体 2 methyl 1,4,8,9 tetraaza triphenylene (mtt) ,并以此配体合成了
Nanocarbon-poly(methyl methacrylate)sols were prepared by pulsed laser ablation at the interface of target submerged in flowing liquid(PLA-IT/SFL)method,and the
Background:Ab initio protein structure prediction is to predict the tertiary structure of a protein from its amino acid sequence alone.As an important topic in
我们采用抽样调查的方法 ,从《现代汉语词典》 5 4 5— 5 6 7页选取 2 2个动词 (为了使分析全面一些 ,另外选取了“存在、发生、消失、应该、是、去、起来”等 7个动词 ) ,以
Background:In this work,we study two seemingly unrelated aspects of core genetic nonlinear dynamical control of the competence phenotype in Bacillus subtilis,a
热烈庆祝“香港中华国际人体科学择业研究院”和“天山圣林国际择业文化传播有限公司”正式成立!2013年7月1日,伴随着“圣林择业”和《天人一体择业法》风雨兼程21载后,根据
Background:Quantitative systems pharmacology (QSP) is an emerging discipline that integrates diverse data to quantitatively explore the interactions between dru
不久前当选为第七届全国人大代表、安徽省人大常委副主任的杨纪珂教授,即将赴京出席全国人大之际,在他的寓所对记者说,他希望我国能尽快确立新闻记者的旁听权和发表权。作为