Machine Learning of Pyrite Geochemistry Reconstructs the Multi-Stage History of Mineral Deposits
Geoscience Frontiers(2025)
Abstract
The application of machine learning for pyrite discrimination establishes a robust foundation for constructing the ore-forming history of multi-stage deposits; however, published models face challenges related to limited, imbalanced datasets and oversampling. In this study, the dataset was expanded to approximately 500 samples for each type, including 508 sedimentary, 573 orogenic gold, 548 sedimentary exhalative (SEDEX) deposits, and 364 volcanogenic massive sulfides (VMS) pyrites, utilizing random forest (RF) and support vector machine (SVM) methodologies to enhance the reliability of the classifier models. The RF classifier achieved an overall accuracy of 99.8%, and the SVM classifier attained an overall accuracy of 100%. The model was evaluated by a five-fold cross-validation approach with 93.8% accuracy for the RF and 94.9% for the SVM classifier. These results demonstrate the strong feasibility of pyrite classification, supported by a relatively large, balanced dataset and high accuracy rates. The classifier was employed to reveal the genesis of the controversial Keketale Pb-Zn deposit in NW China, which has been inconclusive among SEDEX, VMS, or a SEDEX-VMS transition. Petrographic investigations indicated that the deposit comprises early fine-grained layered pyrite (Py1) and late recrystallized pyrite (Py2). The majority voting classified Py1 as the VMS type, with an accuracy of RF and SVM being 72.2% and 75%, respectively, and confirmed Py2 as an orogenic type with 74.3% and 77.1% accuracy, respectively. The new findings indicated that the Keketale deposit originated from a submarine VMS mineralization system, followed by late orogenic-type overprinting of metamorphism and deformation, which is consistent with the geological and geochemical observations. This study further emphasizes the advantages of Machine learning (ML) methods in accurately and directly discriminating the deposit types and reconstructing the formation history of multi-stage deposits.
MoreTranslated text
Key words
Machine learning,Random forest,Support vector machine,Pyrite,Multi-stage genesis,Keketale deposit
求助PDF
上传PDF
View via Publisher
AI Read Science
AI Summary
AI Summary is the key point extracted automatically understanding the full text of the paper, including the background, methods, results, conclusions, icons and other key content, so that you can get the outline of the paper at a glance.
Example
Background
Key content
Introduction
Methods
Results
Related work
Fund
Key content
- Pretraining has recently greatly promoted the development of natural language processing (NLP)
- We show that M6 outperforms the baselines in multimodal downstream tasks, and the large M6 with 10 parameters can reach a better performance
- We propose a method called M6 that is able to process information of multiple modalities and perform both single-modal and cross-modal understanding and generation
- The model is scaled to large model with 10 billion parameters with sophisticated deployment, and the 10 -parameter M6-large is the largest pretrained model in Chinese
- Experimental results show that our proposed M6 outperforms the baseline in a number of downstream tasks concerning both single modality and multiple modalities We will continue the pretraining of extremely large models by increasing data to explore the limit of its performance
Upload PDF to Generate Summary
Must-Reading Tree
Example

Generate MRT to find the research sequence of this paper
Data Disclaimer
The page data are from open Internet sources, cooperative publishers and automatic analysis results through AI technology. We do not make any commitments and guarantees for the validity, accuracy, correctness, reliability, completeness and timeliness of the page data. If you have any questions, please contact us by email: report@aminer.cn
Chat Paper
Summary is being generated by the instructions you defined