ABSTRACT

PURPOSE

This study aims to evaluate the effectiveness of the Kaiser score (KS) system in assessing breast lesions with uncertain malignant potential (B3).

METHODS

Breast magnetic resonance imaging (MRI) scans from a total of 76 patients with histologically proven B3 lesions were included in this study. The KS was recorded for each MRI scan. The patients were classified based on biopsy results, and upgraded lesions were identified. Statistical analysis was conducted to evaluate the association between high KS values and upgraded lesions.

RESULTS

The mean age of the 76 patients was calculated as 49.6 ± 10.1. A significant association was observed between the KS system and the prediction of malignancy upgrade (P < 0.001). Furthermore, among the descriptors, spiculation, margin, and upgrading prediction demonstrated a statistically significant difference (P < 0.001). Additionally, the specificity improved when the accepted KS cut-off value was set at seven instead of five. A significant association was also observed between the KS system and the papilloma upgrade rate within the B3 lesion subgroups (P < 0.001).

CONCLUSION

Breast radiology plays a crucial role in the diagnosis of B3 lesions. Our findings suggest that the KS system holds promise as a tool for predicting the upgrade potential of B3 lesions.

CLINICAL SIGNIFICANCE

This study demonstrated that the KS system may assist in predicting the upgrade potential of B3 breast lesions. It also demonstrated that spiculation and margin descriptors within the KS system possess a high positive predictive value for upgrade prediction. Additionally, we believe that the KS system can help prevent unnecessary surgeries in patients with B3 lesions.

Keywords:

B3 breast lesion, Kaiser score system, breast magnetic resonance imaging, breast, magnetic resonance imaging

Main points

• The Kaiser score (KS) system may help predict uncertain malignant potential (B3) breast lesions upgrade.

• Spiculation and margin identifiers in the KS system have a high positive predictive value in upgrade prediction.

• Unnecessary surgeries can be avoided in cases diagnosed with B3 lesions by using the KS system.

Uncertain malignant potential lesions (B3) of the breast can be classified as atypical ductal hyperplasia (ADH), radial scar, papillary lesions, lobular neoplasia (LN), and flat epithelial hyperplasia (FEH). These lesions are commonly characterized by an increased lifetime risk of breast cancer in women.^1-3 Due to the heterogeneity of high-risk lesion groups, upgrade rates for high-risk breast lesions have varied in the literature, ranging from 6% to 32%.^4-6

The management of B3 lesions is determined by pathological findings, patient age, risk factors, and the type of biopsy performed. Radiological-pathological discordance remains one of the key criteria for excision.^7-9

The Kaiser score (KS) system is a decision-making tool in parallel with the Breast Imaging-Reporting and Data System (BI-RADS) classification, considering the morphological and dynamic features in breast magnetic resonance imaging (MRI).^{10, 11} Dietzel and Baltzer¹⁰ developed a clinical decision tool originally referred to as the Tree flowchart and later renamed the KS after Werner A. Kaiser’s contributions to its development. Additionally, they published an essay that included a practical guide for the interpretation of breast MRI examinations using the KS.¹⁰ Their contribution has been further extended with a recently published article in which they emphasized that the KS served as an evidence-based decision-making tool to objectively differentiate between benign and malignant breast lesions.¹²

This study aims to evaluate the upgrade potential of high-risk breast lesions and to determine the role of the KS in avoiding potentially unnecessary surgical excisions.

Methods

This retrospective study was approved according to the principles of the Declaration of Helsinki by the Ethics Committee of the Bezmialem Vakıf University (approval no: E-54022451-050.01.04-3208, date: 20.10.2021), and all participants signed a written informed consent form.

Patient selection

Patients’ core biopsy-proven B3 lesions, collected between 2016 and 2021, were retrieved from the archives. Initially, 130 patients were reviewed. Among these, only 81 had MRI scans available in the system. Of the 81 patients, 5 were excluded due to the absence of pathological contrast enhancement on MRI. Patients with excision results or a follow-up period of at least 2 years (24–72 months) were included in this study. Based on these criteria, a total of 76 patients were considered eligible for this study. The age, risk status, and complaints of the patients were recorded.

Magnetic resonance imaging acquisition and image interpretation

All breast MRI scans were conducted using a 1.5 T scanner (Siemens Magnetom Avanto Fit, Siemens Healthineers; Erlangen, Germany) with a bilateral 16-channel breast coil in the prone position. Apparent diffusion coefficient (ADC) maps, subtraction, and maximum intensity projection images were acquired. Axial T2-weighted fat-suppressed imaging [repetition time (TR)/echo time (TE): 4560/59 ms; slice thickness: 4 mm, matrix: 340 × 512], axial T1-weighted imaging (TR/TE: 571/11 ms; slice thickness: 4 mm, matrix: 340 × 512), one precontrast and five postcontrast 3D T1 turbo spin-echo imaging (TR/TE: 5.16/2.38 ms; flip angle: 100, slice thickness: 1 mm), and diffusion-weighted imaging (b-values: 0–800 s/mm²) series were obtained. The gadolinium-based contrast agent was administered at 0.1 mmol/kg using a mechanical power injector, followed by a 15–20 cm³ saline flush.

Two breast radiologists evaluated all the images using the Siemens Syngo Via (Erlangen, Germany) workstation. They were blinded to clinical data and histopathology results. The KS was assigned via the online version to the patients after reaching consensus. Descriptors evaluated in the KS were spiculation, dynamic enhancement curves, margins, internal enhancement, and edema around the lesion. Using the KS, the patients were scored from 1 to 11. A score of 5–7 was categorized as BI-RADS 4, and a score of 8–11 was categorized as BI-RADS 5 and considered positive. Optional moderators were noted, such as evidence of microcalcification overlapping the area of contrast and ADC values. The cut-off value was >1.4 ×10^-3mm²/s as recommended in the KS.

Histopathological evaluation

The patients’ diagnoses were obtained using one of the following methods: tru-cut biopsy under ultrasonographic guidance (n = 59), vacuum-assisted biopsy (VAB) under mammographic guidance (n = 10), or biopsy under MRI guidance (n = 7). On average, 3–4 samples were obtained for tru-cut biopsies using a 14-gauge needle. The results of core biopsy, surgical excision, or follow-up evaluations were analyzed. Cases with an upgrade to ductal carcinoma in situ (DCIS) following excision, including those with progression detected during follow-up, were considered positive.

Statistical analysis

Statistical analysis was performed using SPSS software (IBM Corp. Released 2021. IBM SPSS Statistics for Windows, Version 28.0. Armonk, NY, USA). In addition to descriptive statistics [mean ± standard deviation for continuous variables, frequencies with percentages for categorical variables, and area under the receiver operating characteristic (ROC) curve (AUC) with standard error], the Shapiro–Wilk test was used to assess the distribution of the data. Comparisons of KS descriptors and upgrade rates were performed using Fisher’s exact test and the Fisher–Freeman–Halton test. ROC analysis was performed using MedCalc version 12 to assess the overall diagnostic performance of KS in predicting progression, and the optimal cut-off value was determined using Youden’s J index. Differences in KS and upgrade rates among high-risk lesion subgroups (ADH, radial scar, atypical papillomas, LCIS, LN, and FEH) were evaluated. Sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) were calculated. A type 1 error rate of α = 0.05 was considered statistically significant.

Results

A total of 76 patients were evaluated. Thirty patients were classified as high-risk due to a personal history of breast cancer (n = 8) or a family history of breast cancer in immediate relatives (n = 22). Clinical findings such as pain (n = 12), palpable mass (n = 7), and bloody nipple discharge (n = 6) were present in 33% of the patients. The mean age of the patients was calculated as 49.6 ± 10.1.

The histopathological distribution of lesions is presented in Table 1. Among the 76 B3 lesions, papilloma was the most common diagnosis (47.4%), followed by FEH (23.7%), radial scar (15.8%), LN (6.6%), and ADH (6.6%).

In the follow-up cases (n = 40), no progression was observed. During follow-up, the lesions remained stable in 33 patients (82.5%) and regressed in size in 7 patients (17.5%). Surgical excision was performed in 36 patients (47.3%), and DCIS was detected in 12 of 76 patients following excision. No upgrade to invasive cancer was identified. The overall upgrade rate in patients with B3 lesions was 16%.

Based on MRI results, non-mass enhancement was observed in 34 cases (44.7%), whereas mass enhancement was observed in 42 cases (55.2%). There was no statistically significant difference between upgraded lesions and MRI enhancement patterns (P > 0.050).

In the evaluation of optional moderators (suspicious microcalcifications and high ADC values), seven patients had microcalcifications overlapping with the contrast-enhanced area on MRI. Based on the KS, two points were added to these patients. An upgrade was detected in four of them.

No high ADC values were identified in the evaluation of lesions that would warrant a four-point reduction in the KS. All lesion ADC values were below 1.4 × 10^-3 mm²/s. The KS and MRI findings for the upgraded lesions are presented in Table 2.

A positive KS was a significant predictor of lesion upgrade status (P < 0.001). In patients with a KS exceeding 5, the sensitivity and specificity for predicting an upgrade were 81.25% and 83.33%, respectively. The NPV, PPV, and overall accuracy were 94.34%, 56.52%, and 82.89%, respectively (Table 3).

When the KS cut-off value was set at 7, the sensitivity, specificity, and accuracy for predicting an upgrade were 68.75%, 98.3%, and 80.2%, respectively, with an AUC of 0.86 and a standard error of 0.07 (95% confidence interval: 0.76–0.93, P < 0.001). Additionally, a cut-off value of >6 was identified (Figure 1).

Evaluation of MRI findings showed that spiculation was a significant predictor of lesion upgrade (P < 0.001). Furthermore, the subgroups of B3 lesions were analyzed. Among these subgroups, the KS was a significant predictor of the upgrade rate for papilloma (P < 0.001).

Discussion

B3 of the breast are commonly encountered in needle biopsies. Due to the potential for malignancy, the management of these lesions following needle biopsy remains controversial, with no universally accepted standard recommendation. Although surgical biopsy is widely recommended for ADH, the management of other B3 lesions should be determined on a patient basis through a multidisciplinary approach. Criteria for surgical excision may include sampling adequacy (e.g., needle gauge, number of samples, and accurate targeting), lesion size, and radiology–pathology concordance.^7-9

The literature has studied the role of MRI in managing high-risk lesions. Londero et al.¹³ reported that the absence of enhancement on breast MRI effectively eliminated the risk of invasive cancer and served as a reliable indicator for excluding surgery in B3 lesions. Similarly, in our study, no progress was seen during follow-up in cases that were diagnosed as B3 lesions but were not included in the study because MRI did not show any contrast enhancement.

The KS system is a decision-making tool that integrates five morphology and kinetic criteria, along with two optional modifiers (microcalcifications and ADC values), to differentiate benign from malignant breast tumors. The KS system offers a standardized approach to breast MRI evaluation, enhancing its utility in clinical practice. In recent years, there has been a rapid increase in studies employing this flowchart.^14-20 Studies have demonstrated that inter-reader agreement is high and that the KS enhances the diagnostic performance of MRI.^14-18 Wang et al.¹⁹ has also showed that the KS is a useful diagnostic tool that helps radiologists with different levels of breast MRI experiences make more accurate diagnoses. According to Zhang et al.²⁰, KS is a better way to diagnose breast lesions than BI-RADS, whether the lesions show non-mass enhancement or are evaluated on their own. Furthermore, Wengert et al.²¹ gave useful supporting data and pushed for the use of KS to eliminate BI-RADS 4 mammography calcifications. However, no studies to date have specifically evaluated the application of KS in B3 lesions.

In our study, the upgrade rates were comparable with those reported in the literature. However, the excision rates were higher than those documented in previous studies.^4-6 This can be attributed to the large proportion of high-risk patients and the limited availability of VAB in our country.

There were three false-negative cases in our study. Two cases (LN and FEH) were upgraded to low-grade DCIS following excision (Figure 2). It is well-established that MRI has low specificity for detecting low-grade DCIS,²² which may explain these false-negative results. In one of these cases (ADH), microcalcifications led to an increased score, highlighting the importance of incorporating optional moderators in the KS system.

We observed false-positive results in 10 cases. Four patients had papillomas, and three had radial scars. In the false-positive papilloma cases, the type 2 contrast enhancement pattern contributed to the increased scores (Figure 3). The literature indicates that papillomas are a heterogeneous group that may exhibit varying enhancement patterns,²³ which we believe contributes to the higher false-positive rate. Additionally, contour irregularity and spiculation positivity increased the scores in cases of radial scars. Radial scars were present in six of the patients with false-positive results. Radial scars are inherently characterized by irregular contours.²⁴ In the KS system, scoring begins at six points due to the spiculation positivity commonly observed in radial scars, leading to false-positive outcomes.

Evaluation of the KS descriptors revealed that 11 patients exhibited positive spiculation and contour irregularity. The KS values of all 11 cases ranged from 6 to 11, and 8 of them were upgraded (Figure 4). These findings show that spiculation positivity and contour irregularity are significantly associated with lesion upgrade.

Our study identified three cases with edema, all of which underwent an upgrade. Recent studies have shown that peritumoral edema is associated with poor prognosis.²⁵ Consequently, the presence of edema may have a high positive value for predicting B3 lesions upgrade.

Additionally, the internal enhancement pattern may significantly influence the prediction of lesion upgrade. The inhomogeneous enhancing pattern increased the lesion score from 4 to 8. Significantly, three of these lesions are confirmed true positives.

The acceptance of a 5 KS value for differentiating malignant from benign tumors resulted in an accuracy of 82.89%. Nevertheless, when the cut-off value was set at 7, the specificity (98.3%) improved without significantly reducing accuracy.

The limitations of our study include the heterogeneity of B3 lesion pathologies. The study included a small number of ADH lesions because their exclusion would not have made a statistical difference. The high number of papillomas may be due to the broad MRI indication, which aimed to reduce the risk of papillomatosis and detect cancer in the ipsilateral breast.²⁶ Furthermore, our study is single-centered and retrospective in design. Additionally, two breast radiologists conducted the KS assessment; however, another limitation is the absence of statistical analysis for inter-reader agreement. This study can be conducted prospectively on specific B3 lesion subgroups.

In conclusion, we speculate that increasing the KS threshold value in future studies with larger sample sizes could help avoid unnecessary surgeries. In conclusion, the KS system demonstrates the ability to predict B3 lesion upgrade accurately.

Conflict of interest disclosure

The authors declared no conflicts of interest.

References

Perry N, Broeders M, de Wolf C, Törnberg S, Holland R, von Karsa L. European guidelines for quality assurance in breast cancer screening and diagnosis. Fourth edition--summary document.Ann Oncol. 2008;19(4):614-622.

Lakhani SREI, Schnitt SJ, Tan PH, van de Vijver MJ. WHO classification of tumours of the breast, 4th ed. International Agency for Research on Cancer, Lyon; 2012;50-54.

Morrow M, Schnitt SJ, Norton L. Current management of lesions associated with an increased risk of breast cancer.Nat Rev Clin Oncol. 2015;12(4):227-238.

Mooney KL, Bassett LW, Apple SK. Upgrade rates of high-risk breast lesions diagnosed on core needle biopsy: a single-institution experience and literature review.Mod Pathol. 2016;29(12):1471-1484.

Polat DS, Schopp JG, Arjmandi F, et al. Performance of a clinical and imaging-based multivariate model as decision support tool to help save unnecessary surgeries for high-risk breast lesions.Breast Cancer Res Treat. 2021;185:479-494.

Oktay A, Aslan Ö, Taşkın F, et al. Outcomes of high-risk breast lesions diagnosed using image-guided core needle biopsy: results from a multicenter retrospective study. Diagn Interv Radiol. 2023;20;29(4):579-587.

Rageth CJ, O’Flynn EA, Comstock C, et al. First International Consensus Conference on lesions of uncertain malignant potential in the breast (B3 lesions).Breast Cancer Res Treat. 2016;159(2):203-213.

Bahl M. Management of high-risk breast lesions.Radiol Clin North Am. 2021;59(1):29-40.

Rageth CJ, O’Flynn EAM, Pinker K, et al. Second International Consensus Conference on lesions of uncertain malignant potential in the breast (B3 lesions).Breast Cancer Res Treat. 2019;174(2):279-296.

Dietzel M, Baltzer PAT. How to use the Kaiser score as a clinical decision rule for diagnosis in multiparametric breast MRI: a pictorial essay.Insights Imaging.2018;9(3):325-335.

Baltzer PA, Dietzel M, Kaiser WA. A simple and robust classification tree for differentiation between benign and malignant lesions in MR-mammography.Eur Radiol. 2013;23(8):2051-2060.

Baltzer PAT, Krug KB, Dietzel M. Evidence-based and structured diagnosis in breast MRI using the Kaiser Score.Rofo.2022;194(11):1216-1228.

Londero V, Zuiani C, Linda A, Girometti R, Bazzocchi M, Sardanelli F. High-risk breast lesions at imaging-guided needle biopsy: usefulness of MRI for treatment decision.AJR Am J Roentgenol. 2012;199(2):240-250.

Marino MA, Clauser P, Woitek R, et al. A simple scoring system for breast MRI interpretation: does it compensate for reader experience?Eur Radiol. 2016;26(8):2529-2537.

Istomin A, Masarwah A, Vanninen R, Okuma H, Sudah M. Diagnostic performance of the Kaiser score for characterizing lesions on breast MRI with comparison to a multiparametric classification system.Eur J Radiol. 2021;138:109659.

Cloete DJ, Minne C, Schoub PK, Becker JHR. Magnetic resonance imaging of fibroadenoma-like lesions and correlation with breast imaging-reporting and data system and Kaiser scoring system.SA J Radiol.2018;22(2):1532.

Milos RI, Pipan F, Kalovidouri A, et al. The Kaiser score reliably excludes malignancy in benign contrast-enhancing lesions classified as BI-RADS 4 on breast MRI high-risk screening exams.Eur Radiol.2020;30(11):6052-6061.

Jajodia A, Sindhwani G, Pasricha S, et al. Application of the Kaiser score to increase diagnostic accuracy in equivocal lesions on diagnostic mammograms referred for MR mammography.Eur J Radiol.2021;134:109413.

Wang Q, Fu F, Chen Y, et al. Application of the Kaiser score by MRI in patients with breast lesions by ultrasound and mammography.Diagn Interv Radiol.2022;28(4):322-328.

Zhang B, Feng L, Wang L, Chen X, Li X, Yang Q. Kaiser score for diagnosis of breast lesions presenting as non-mass enhancement on MRI.Nan Fang Yi Ke Da Xue Xue Bao.2020;40(4):562-566.

Wengert GJ, Pipan F, Almohanna J, et al. Impact of the Kaiser score on clinical decision-making in BI-RADS 4 mammographic calcifications examined with breast MRI. Eur Radiol. 2020;30(3):1451-1459.

Taskin F, Kalayci CB, Tuncbilek N, et al. The value of MRI contrast enhancement in biopsy decision of suspicious mammographic microcalcifications: a prospective multicenter study.Eur Radiol. 2021;31(3):1718-1726.

Yılmaz R, Bender Ö, ÇelikYabul F, Dursun M, Tunacı M, Acunas G. Diagnosis of nipple discharge: value of magnetic resonance imaging and ultrasonography in comparison with ductoscopy.Balkan Med J. 2017;34(2):119-126.

Bargallo X, Ubeda B, Ganau S, et al. Magnetic resonance imaging assessment of radial scars/complex sclerosing lesions of the breast.Curr Med Imaging. 2022;18(2):242-248.

Panzironi G, Moffa G, Galati F, Marzocca F, Rizzo V, Pediconi F. Peritumoral edema as a biomarker of the aggressiveness of breast cancer: results of a retrospective study on a 3 T scanner.Breast Cancer Res Treat.2020;181(1):53-60.

Gültekin MA, Yabul FÇ, Temur HO, et al. Papillary lesions of the breast: addition of DWI and TIRM sequences to routine breast MRI could help in differentiation benign from malignant.Curr Med Imaging.2022;18(9):962-969.