Figure 3:

LiSSI: Pathogenicity classification performance results. A Receiver Operating Characteristics (ROC) plots shows the performance of the classification models using genes as features to distinguish pathogenic actinobacteria from non-pathogenic ones. The data was evaluated five times using different 5-fold cross-validation sets to assess the robustness of the classifiers. The real label classifier curves are presented as dark-blue solid lines, while the random label classifiers are depicted as light-blue dashed lines (the ones close to the baseline). The variation of the AUCs (area under curve) in the cross-validation was included in the figure as a box-plot (bottom right). The numbers below each box-plot are the lower and upper quartiles.

© De Gruyter