Sparse Canonical Correlation Analysis for Multiple Measurements with Latent Trajectories

Nuria Senar,Aeilko H. ZwindermanTop Scholar, Michel H. Hof and

arXiv · Methodology（2025）

Cited 0|Views1

Abstract

Canonical Correlation Analysis, CCA, is a widely used multivariate method in omics research for integrating high dimensional datasets. CCA identifies hidden links by deriving linear projections of features maximally correlating datasets. For standard CCA, observations must be independent of each other. As a result, it cannot properly deal with repeated measurements. Current CCA extensions dealing with these challenges either perform CCA on summarized data or estimate correlations for each measurement. While these techniques factor in the correlation between measurements, they are sub-optimal for high dimensional analysis and exploiting this datas longitudinal qualities. We propose a novel extension of sparse CCA that incorporates time dynamics at the latent level through longitudinal models. This approach addresses the correlation of repeated measurements while drawing latent paths, focusing on dynamics in the correlation structures. To aid interpretability and computational efficiency, we implement a penalty to enforce fixed sparsity levels. We estimate these trajectories fitting longitudinal models to the low dimensional latent variables, leveraging the clustered structure of high dimensional datasets, thus exploring shared longitudinal latent mechanisms. Furthermore, modeling time in the latent space significantly reduces computational burden. We validate our models performance using simulated data and show its real world applicability with data from the Human Microbiome Project. Our CCA method for repeated measurements enables efficient estimation of canonical correlations across measurements for clustered data. Compared to existing methods, ours substantially reduces computational time in high dimensional analyses as well as provides longitudinal trajectories that yield interpretable and insightful results.

Translated text

Bibtex

AI Read Science

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Data Disclaimer

The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn

Chat Paper

【要点】：本文提出了一种稀疏规范相关性分析（Sparse CCA）的新方法，通过在潜在轨迹中引入时间动态模型，以处理具有潜在轨迹的重复测量数据，同时提高了计算效率和结果的解释性。

【方法】：该方法通过结合纵向模型将时间动态引入潜在变量，并采用惩罚函数来实现固定稀疏度，以估计低维潜在变量的轨迹。

【实验】：研究通过模拟数据和人类微生物组项目（Human Microbiome Project）的实际数据验证了模型性能，证明了该方法在处理重复测量数据集上的高效性，以及在高维分析中显著降低计算时间的同时提供了可解释的纵向轨迹。

去 AI 文献库对话