WeChat Mini Program
Old Version Features

Estimating Evolutionary and Demographic Parameters Via ARG-derived IBD

PLOS GENETICS(2025)

Univ Melbourne | Univ Oxford

Cited 0|Views7
Abstract
Inference of evolutionary and demographic parameters from a sample of genome sequences often proceeds by first inferring identical-by-descent (IBD) genome segments. By exploiting efficient data encoding based on the ancestral recombination graph (ARG), we obtain three major advantages over current approaches: (i) no need to impose a length threshold on IBD segments, (ii) IBD can be defined without the hard-to-verify requirement of no recombination, and (iii) computation time can be reduced with little loss of statistical efficiency using only the IBD segments from a set of sequence pairs that scales linearly with sample size. We first demonstrate powerful inferences when true IBD information is available from simulated data. For IBD inferred from real data, we propose an approximate Bayesian computation inference algorithm and use it to show that even poorly-inferred short IBD segments can improve estimation. Our mutation-rate estimator achieves precision similar to a previously-published method despite a 4 000-fold reduction in data used for inference, and we identify significant differences between human populations. Computational cost limits model complexity in our approach, but we are able to incorporate unknown nuisance parameters and model misspecification, still finding improved parameter inference.
More
Translated text
Key words
Adaptive Evolution,Phylogenetic Analysis
PDF
Bibtex
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
  • Pretraining has recently greatly promoted the development of natural language processing (NLP)
  • We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
  • We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
  • The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
  • Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Try using models to generate summary,it takes about 60s
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本文提出了一种基于祖先重组图(ARG)数据编码的新方法,通过推导出的身份一致(IBD)片段来估计进化与人口参数,无需对IBD片段长度设限,无需验证无重组要求,同时显著减少了计算时间,提高了统计效率。

方法】:研究利用ARG衍生的IBD推断方法,通过高效的编码手段,避免了传统方法中对IBD片段长度阈值和重组的限制,并在模拟数据上展示了强大的推断能力。

实验】:研究者在模拟数据上验证了方法的准确性,并使用近似贝叶斯计算算法处理真实数据中的IBD推断,实验结果显示,该方法即便在推断质量较差的短IBD片段上也能提高估计精度,所提出的突变率估计器的精确度与先前方法相当,但所需数据量减少了4000倍。此外,研究者还发现了不同人类群体之间的显著差异。尽管计算成本限制了模型复杂度,但方法仍能整合未知干扰参数和模型设定偏差,实现了参数推断的改进。