WeChat Mini Program
Old Version Features

Robust and Automatic Beamstop Shadow Outlier Rejection: Combining Crystallographic Statistics with Modern Clustering under a Semi-Supervised Learning Strategy

Acta Crystallographica Section D-Structural Biology(2024)SCI 3区SCI 2区

Univ Hamburg

Cited 0|Views1
Abstract
During the automatic processing of crystallographic diffraction experiments, beamstop shadows are often unaccounted for or only partially masked. As a result of this, outlier reflection intensities are integrated, which is a known issue. Traditional statistical diagnostics have only limited effectiveness in identifying these outliers, here termed Not-Excluded-unMasked-Outliers (NEMOs). The diagnostic tool AUSPEX allows visual inspection of NEMOs, where they form a typical pattern: clusters at the low-resolution end of the AUSPEX plots of intensities or amplitudes versus resolution. To automate NEMO detection, a new algorithm was developed by combining data statistics with a density-based clustering method. This approach demonstrates a promising performance in detecting NEMOs in merged data sets without disrupting existing data-reduction pipelines. Re-refinement results indicate that excluding the identified NEMOs can effectively enhance the quality of subsequent structure-determination steps. This method offers a prospective automated means to assess the efficacy of a beamstop mask, as well as highlighting the potential of modern pattern-recognition techniques for automating outlier exclusion during data processing, facilitating future adaptation to evolving experimental strategies.
More
Translated text
Key words
X-ray crystallography,AUSPEX,outliers,statistics,clustering,semi-supervised learning
求助PDF
上传PDF
Bibtex
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
  • Pretraining has recently greatly promoted the development of natural language processing (NLP)
  • We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
  • We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
  • The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
  • Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Upload PDF to Generate Summary
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Related Papers
1985

被引用11335 | 浏览

2006

被引用5573 | 浏览

E STANLEY
1972

被引用62 | 浏览

Wolfgang Kabsch
2010

被引用13581 | 浏览

Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper

要点】:本文提出了一种结合晶体学统计与现代聚类方法,在半监督学习策略下的自动剔除beamstop阴影引起的异常值的新算法,提高了晶体学衍射数据处理的质量。

方法】:通过将晶体学数据统计与基于密度的聚类方法相结合,开发出了一种新的算法来自动检测并排除在自动处理晶体学衍射实验中由beamstop阴影引起的异常值。

实验】:使用AUSPEX工具对NEMOs进行可视化检测,并在合并的数据集上应用新算法,实验结果表明排除识别的NEMOs能显著提高后续结构测定步骤的质量。文中未提及具体的数据集名称。