Fig. 5

Prediction map showing the probability of S. japonicum infection using the top-performing environmental data model. The final top performing model was defined as the one with the highest kappa, accuracy, and receiver operating characteristic (ROC) area under the curve (AUC), respectively. Model performance metrics (Cohen’s kappa and accuracy) highlighted that the open-source environmental data models outperformed the snail data models. The top performing environmental data model was used to create a prediction surface of the probability of S. japonicum infection across the entire study area