Artificial Intelligence and Informatics - Original Article

Prediction of carcinogenic human papillomavirus types in cervical cancer from multiparametric magnetic resonance images with machine learning-based radiomics models

10.4274/dir.2022.221335

  • Okan İnce
  • Emre Uysal
  • Görkem Durak
  • Suzan Önol
  • Binnur Dönmez Yılmaz
  • Şükrü Mehmet Ertürk
  • Hakan Önder

Received Date: 11.01.2022 Accepted Date: 16.11.2022 Diagn Interv Radiol 0;0(0):0-0 [e-Pub]

PURPOSE

This study aimed to evaluate the potential of machine learning-based models for predicting carcinogenic human papillomavirus (HPV) oncogene types using radiomics features from magnetic resonance imaging (MRI).

METHODS

Pre-treatment MRI images of patients with cervical cancer were collected retrospectively. An HPV DNA oncogene analysis was performed based on cervical biopsy specimens. Radiomics features were extracted from contrast-enhanced T1-weighted images (CE-T1) and T2-weighted images (T2WI). A third feature subset was created as a combined group by concatenating the CE-T1 and T2WI subsets. Feature selection was performed using Pearson’s correlation coefficient and wrapper- based sequential-feature selection. Two models were built with each feature subset, using support vector machine (SVM) and logistic regression (LR) classifiers. The models were validated using a five-fold cross-validation technique and compared using Wilcoxon’s signed rank and Friedman’s tests.

RESULTS

Forty-one patients were enrolled in the study (26 were positive for carcinogenic HPV oncogenes, and 15 were negative). A total of 851 features were extracted from each imaging sequence. After feature selection, 5, 17, and 20 features remained in the CE-T1, T2WI, and combined groups, respectively. The SVM models showed 83%, 95%, and 95% accuracy scores, and the LR models revealed 83%, 81%, and 92.5% accuracy scores in the CE-T1, T2WI, and combined groups, respectively. The SVM algorithm performed better than the LR algorithm in the T2WI feature subset (P = 0.005), and the feature sets in the T2WI and the combined group performed better than CE-T1 in the SVM model (P = 0.033 and 0.006, respectively). The combined group feature subset performed better than T2WI in the LR model (P = 0.023).

CONCLUSION

Machine learning-based radiomics models based on pre-treatment MRI can detect carcinogenic HPV status with discriminative accuracy.

Keywords: Artificial intelligence, human papillomavirus DNA tests, machine learning, radiology, uterine cervical neoplasms