Sparse multivariate factor analysis regression model

来源 :The 24th International Workshop on Matrices and Statistics(第 | 被引量 : 0次 | 上传用户:csss2
下载到本地 , 更方便阅读
声明 : 本文档内容版权归属内容提供方 , 如果您对本文有版权争议 , 可与客服联系进行内容授权或下架
论文部分内容阅读
  The multivariate regression model is a useful tool to explore complex associations between multiple response variables (e.g. gene expressions) and multiple predictors (e.g. SNPs). When the multiple responses are correlated, ignoring such dependency will impair statistical power in the data analysis. Motivated by an integrative genomic data, we propose a new methodology-sparse multivariate factor analysis regression model (smFARM), in which the covariance of the response variables is modeled by a factor analysis model with latent factors. This proposed method not only allows us to address the challenge that the number of genetic predictors is larger than the sample size, but also to adjust for unobserved genetic and/or non-genetic factors that potentially conceal the underlying real response-predictor associations. The proposed smFARM is implemented efficiently by utilizing the strength of the EM algorithm and the group-wise coordinate descend algorithm. In addition, the identified latent factors are explained by the means of gene enrichment analysis. The proposed methodology is evaluated and compared to the existing methods through extensive simulation studies. We apply smFARM in an integrative genomics analysis of a breast cancer dataset on the relationship between DNA copy numbers and gene expression arrays to derive genetic regulatory patterns relevant to breast cancer.
其他文献
Promoter strength, or activity, is important in genetic engineering and synthetic biology.A constitutive promoter with a certain strength for one given RNA can
会议
  We introduce a partially linear single-index proportional hazards model with current status data. We consider efficient estimations and effective algorithms
会议
  Cell polarization toward the attractant is related to both physical and chemical factors.Most existing mathematical models are based on reaction diffusion s
会议
Bacteria living in confined geometries are commonly found in clinical and natural environments. We are particularly interested in the behavior of bacteria confi
会议
The first part of the talk focuses on the mechanical principle that a single bacterium uses to propel itself. We show that though widely-accepted resistive-forc
会议
  I will consider estimation and prediction problems in generalized linear models when there are a number of predictors and some of them may have no and/or we
  What can one say on convergence to stationarity of a finite state Markov chain that behaves "locally" like a nearest neighbor random walk on the integer lat
会议
  For a given continuous random variable X with cdf F(x), it is requested, in resampling technique, to construct a discrete random variable Y with probability
Chemotaxis is the phenomenon in which cells direct their motion according to a chemical present in their environment. Since experimental observations have shown
会议
  Many happy returns, Simo! To celebrate over 25 years of collaboration, we present an indexed and illustrated bibliography on the occasion of your 70th birth
会议