Poster
in
Workshop: AI for Science: Scaling in AI for Scientific Discovery
Boost your crystal model with denoising pre-training
Shuaike Shen · Ke Liu · Muzhi Zhu · Hao Chen
Keywords: [ Crystal property predication ] [ Denoising pre-training ]
Crystals play a vital role in a wide range of materials, influencing both cutting-edge technologies and everyday applications. Recently, deep learning approaches for crystal property prediction have shown exceptional performance, driving significant progress in material discovery.However, supervised approaches can only be trained on labeled data and the number of data points varies for different properties. Making full use of unlabeled data remains an ongoing challenge.To address this issue, we propose an unsupervised Denoising Pre-training Framework (DPF) for crystal structure. DPF trains a model to reconstruct the original crystal structure from recover the masked atom types, perturbed atom positions, and perturbed crystal lattices.Through the pre-training, models learn the intrinsic features of crystal structures and capture the key features influencing crystal properties.We pre-train models on 380,743 unlabeled crystal structures and fine-tune them on downstream property prediction benchmarks. Extensive experiments demonstrate the effectiveness of our denoising pre-training framework.