Data from: Probabilistic Phase Labeling and Lattice Refinement for Autonomous Materials Research
X-ray diffraction (XRD) is essential for determining a material’s crystal structure in high-throughput experimentation, and is widely being incorporated in artificially intelligent agents for autonomous scientific discovery. However, rapid, automated and reliable analysis of XRD data at rates that match the rate of experimental measurements at synchrotron sources remains a major challenge. To address these issues, we developed CrystalShift for rapid and efficient probabilistic XRD phase labeling employing symmetry-constrained optimization, best-first tree search, and Bayesian model comparison. The algorithm estimates probabilities for phase combinations without requiring additional phase space information or training. We demonstrate that CrystalShift provides robust probability estimates, outperforming existing methods on synthetic and experimental datasets, and can be readily integrated into high-throughput experimental workflows. In addition to efficient phase labeling, CrystalShift offers quantitative insights into materials’ structural parameters, which facilitate both expert evaluation and AI-based modeling of the phase space, ultimately accelerating materials identification and discovery.
Facility, an NNCI member supported by NSF Grant NNCI-2025233. Additionally, this research was conducted with support from the Cornell University Center for Advanced Computing.