Machine Learning of Pyrite Geochemistry Reconstructs the Multi-Stage History of Mineral Deposits

Pengpeng Yu, Yuan Liu, Hanyu Wang,Xi Chen,Yi Zheng, Wei Cao,Yiqu Xiong, Hongxiang Shan

Geoscience Frontiers（2025）

Cited 0|Views14

Abstract

The application of machine learning for pyrite discrimination establishes a robust foundation for constructing the ore-forming history of multi-stage deposits; however, published models face challenges related to limited, imbalanced datasets and oversampling. In this study, the dataset was expanded to approximately 500 samples for each type, including 508 sedimentary, 573 orogenic gold, 548 sedimentary exhalative (SEDEX) deposits, and 364 volcanogenic massive sulfides (VMS) pyrites, utilizing random forest (RF) and support vector machine (SVM) methodologies to enhance the reliability of the classifier models. The RF classifier achieved an overall accuracy of 99.8%, and the SVM classifier attained an overall accuracy of 100%. The model was evaluated by a five-fold cross-validation approach with 93.8% accuracy for the RF and 94.9% for the SVM classifier. These results demonstrate the strong feasibility of pyrite classification, supported by a relatively large, balanced dataset and high accuracy rates. The classifier was employed to reveal the genesis of the controversial Keketale Pb-Zn deposit in NW China, which has been inconclusive among SEDEX, VMS, or a SEDEX-VMS transition. Petrographic investigations indicated that the deposit comprises early fine-grained layered pyrite (Py1) and late recrystallized pyrite (Py2). The majority voting classified Py1 as the VMS type, with an accuracy of RF and SVM being 72.2% and 75%, respectively, and confirmed Py2 as an orogenic type with 74.3% and 77.1% accuracy, respectively. The new findings indicated that the Keketale deposit originated from a submarine VMS mineralization system, followed by late orogenic-type overprinting of metamorphism and deformation, which is consistent with the geological and geochemical observations. This study further emphasizes the advantages of Machine learning (ML) methods in accurately and directly discriminating the deposit types and reconstructing the formation history of multi-stage deposits.

Translated text

Key words

Machine learning,Random forest,Support vector machine,Pyrite,Multi-stage genesis,Keketale deposit

求助PDF

上传PDF

Bibtex

AI Read Science

AI Summary

AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.

Example

Background

Key content

Introduction

Methods

Results

Related work

Fund

Key content

Pretraining has recently greatly promoted the development of natural language processing (NLP)
We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance

Upload PDF to Generate Summary

Must-Reading Tree

Example

Generate MRT to find the research sequence of this paper

Data Disclaimer

The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn

Chat Paper

Summary is being generated by the instructions you defined