Hostname: page-component-745bb68f8f-kw2vx Total loading time: 0 Render date: 2025-02-11T13:26:08.898Z Has data issue: false hasContentIssue false

Predictive modelling analysis for development of a radiotherapy decision support system in prostate cancer: a preliminary study

Published online by Cambridge University Press:  31 January 2017

Kwang Hyeon Kim
Affiliation:
Department of Biomedical Science, Graduate School of Korea University, Seoul, South Korea Department of Radiation Oncology, College of Medicine, Korea University, Seoul, South Korea
Suk Lee*
Affiliation:
Department of Radiation Oncology, College of Medicine, Korea University, Seoul, South Korea
Jang Bo Shim
Affiliation:
Department of Radiation Oncology, College of Medicine, Korea University, Seoul, South Korea
Kyung Hwan Chang
Affiliation:
CQURE Healthcare, Seoul, South Korea
Yuanjie Cao
Affiliation:
Innotems, Daejeon, South Korea
Suk Woo Choi
Affiliation:
Asan Medical Center, Seoul, South Korea
Se Hyeong Jeon
Affiliation:
Department of Radiation Oncology, Tianjin Medical University Cancer Institute and Hospital, Tianjin, China
Dae Sik Yang
Affiliation:
Department of Radiation Oncology, College of Medicine, Korea University, Seoul, South Korea
Won Sup Yoon
Affiliation:
Department of Radiation Oncology, College of Medicine, Korea University, Seoul, South Korea
Young Je Park
Affiliation:
Department of Radiation Oncology, College of Medicine, Korea University, Seoul, South Korea
Chul Yong Kim
Affiliation:
Department of Radiation Oncology, College of Medicine, Korea University, Seoul, South Korea
*
Correspondence to: Suk Lee, PhD, Department of Radiation Oncology, Korea University Medical Center, 126-1, Anamdong, Seongbukgu, 02841 Seoul, Korea. Tel: +82-2-920-5519, Fax: +82-2-927-1419, E-mail: sukmp@korea.ac.kr
Rights & Permissions [Opens in a new window]

Abstract

Purpose

The aim of this study is to develop predictive models to predict organ at risk (OAR) complication level, classification of OAR dose-volume and combination of this function with our in-house developed treatment decision support system.

Materials and methods

We analysed the support vector machine and decision tree algorithm for predicting OAR complication level and toxicity in order to integrate this function into our in-house radiation treatment planning decision support system. A total of 12 TomoTherapyTM treatment plans for prostate cancer were established, and a hundred modelled plans were generated to analyse the toxicity prediction for bladder and rectum.

Results

The toxicity prediction algorithm analysis showed 91·0% accuracy in the training process. A scatter plot for bladder and rectum was obtained by 100 modelled plans and classification result derived. OAR complication level was analysed and risk factor for 25% bladder and 50% rectum was detected by decision tree. Therefore, it was shown that complication prediction of patients using big data-based clinical information is possible.

Conclusion

We verified the accuracy of the tested algorithm using prostate cancer cases. Side effects can be minimised by applying this predictive modelling algorithm with the planning decision support system for patient-specific radiotherapy planning.

Type
Original Articles
Copyright
© Cambridge University Press 2017 

Introduction

A number of treatment plans are generated for each patient in order to establish the optimal radiation treatment plan. The final treatment plan is selected by applying a quantitative analysis method by determining the delineation shape of the planning target volume, organ at risk (OAR) and a qualitative analysis method based on the dose volume histogram (DVH).

However, there is no guarantee that the treatment plan selected by this analysis and evaluation will not cause radiotherapy side effects in the patient. Therefore, if the radiation oncologists and medical physicists consider historical clinical data on complications with typical treatment plan factors, such as DVH and OAR dose constraint range, they can establish the optimal treatment plan minimising OAR concerns by suppressing the normal tissue complication probability (NTCP) and increasing the tumour control probability (TCP).

Current advances in diagnostic and therapeutic technologies are under research and development through innovative tools that combine oncology, diagnostics, genetics and computer science to improve the quality of life of patients after treatment. Thus, the clinical decision support system is also being researched using clinical big data with application of prediction modelling using a machine learning algorithm. The role of the prediction model in the radiation treatment decision support system is to maximise tumour control and minimise side effects after treatment and to determine whether the plan offers acceptable dose risk by applying classification and regression methods to dose-volume data using an existing clinical database.Reference Meldolesi, van Soest and Damiani 1

With this aim, Zhang et al. recently studied complication prediction of radiation therapy using machine learning in the field of radiation therapy.Reference Zhang, D’Souza, Shi and Meyer 2 In addition, Guidi et al. are also making progress in studies to predict criticality in which machine learning algorithms target particular cases, such as patients with head and neck cancer.Reference Guidi, Maffei and Vecchi 3

However, no known published studies have integrated machine learning algorithms and dosimetrical and biological index analysis functions into a radiation treatment planning decision support system to determine the optimal plan.

Cao et al. performed integrated analysis studies of dosimetrical and biological index data using prostate cancer cases.Reference Cao, Lee and Chang 4 , Reference Cao, Lee and Chang 5 In addition, big data analysis studies of prostate cancer using machine learning approaches have been performed.Reference Çınar, Engin, Engin and Atesçi 6 Reference Mohammed and Wagner Meira 8 According to Çınar et al., prostate cancer is currently the most common type of cancer in men except lung cancer.Reference Çınar, Engin, Engin and Atesçi 6 Therefore, we used prostate cancer cases as the model system.

The aim of this study was to develop a predictive model solution that includes the functions of support vector machine and decision tree algorithm to predict OAR complication level and suitable classification of OAR dose-volume values and to combine this function with an in-house developed treatment decision support system in a preliminary study.

Material and Methods

Patient group

The target population was 12 male patients with adenocarcinoma of the prostate, for whom 12 treatment plans had been established. The patient characteristics are as follows: average age, 72 years; average weight, 75·68 kg; tumour-node-metastasis (TNM) stage, T1c-T3b, N0 and M0 (Table 1). The treatment planning system used was TomoTherapy® (Accuray Incorporated, Sunnyvale, CA, USA).

Table 1 Characteristics of the patients with prostate cancer

Note: To establish original treatment plans for patients (n=12).

Abbreviations: PD, prescription dose; FD, fractional dose; OP, operation; TNM, tumour-node-metastasis.

The DVHs for the OARs of bladder and rectum are shown in Figure 1.

Figure 1 Established dose volume histogram (DVH) of bladder and rectum for 12 patients with prostate cancer (n=12).

We developed an in-house planning decision support programme to input DVH information from the treatment plan and integrated the results of this study into our system. As an example, we used dose-volume data of the OAR to predict complications as a constraint value (Table 2).

Table 2 Dose-volume constraints for bladder and rectum as organs at risk

Predictive modelling using machine learning algorithm

The machine learning algorithm for the radiation treatment planning decision support system can be used in the prediction of complications in OARs exposed to radiation as the peripheral target during radiation therapy.Reference El Naqa, Li and Murphy 9 That is, the predictive modelling algorithm calculates the results using comprehensive data in accordance with the state of the indicator characteristic of the patients and treatment plans.

Accordingly, there is a need to verify the results of late toxicity through a decision tree model or dose-volume data analysis based on current knowledge and historical clinical outcomes.

A prediction model can be applied using index data such as age, TNM stage, gender, prescribed dose, tumour control probability and survival rate.Reference Lambin, van Stiphout and Starmans 10 In addition, the support vector machine (SVM) algorithm can be applied to classify the different OAR dose constraints.Reference Zhang, D’Souza, Shi and Meyer 2 The DVH of the patients during radiation therapy is a significant predictive indicator.Reference Lambin, van Stiphout and Starmans 10

Therefore, we used the DVH data of patients as input parameters for the application of clinical big data, and machine learning techniques were used in the SVM and decision tree as described previously for complication prediction.Reference Zhang, D’Souza, Shi and Meyer 2

Support vector machine

The algorithm is developed to select the best classifier to separate two groups by drawing a perpendicular line between groups in the hyperplane. In the case of the nonlinear model, the kernel method is used to distinguish the linear machine.Reference El Naqa, Li and Murphy 9

A hyperplane is defined as the set of all points xR dimension that satisfy h(x)=0, where h(x) is the function of the hyperplane, as follows in equation (1) in d dimensions:Reference De Bari, Vallati and Gatta 7 , Reference Mohammed and Wagner Meira 8

(1) $$h\left( x \right){\rm {\equals}}w^{T} x{\rm {\plus}}b$$

In this study, we modelled the SVM algorithm using dose-volume input in test and training models as shown in the flow chart in Figure 2. Figure 2a shows the entire analysis system from the treatment planning data, including quantitative analysis for homogeneity index, conformity index and conformation number and dosimetrical indices, TCP, NTCP of biological indices in addition to a big data-based prediction algorithm, to the results. The predictive algorithm component is further defined as in Figure 2b, which describes how dose-volume data for every patient is used as the input and the training and test processes involved in the SVM for classification to achieve an accurate final outcome.

Figure 2 Model of a radiation treatment planning decision support system (a) and its predictive modelling flow chart for the support vector machine algorithm (b) of this study. Abbreviations: PITV, prescription isodose to target volume; CI, conformity index; HI, homogeneity index; TCI, target coverage index; MHI, modified homogeneity index; CN, conformity number; COSI, critical organ scoring index; QF, quality factor

A total of 100 model plans were generated based on 12 treatment plans for analysis of the support vector machine algorithm (Table 3).

Table 3 Classification of bladder and rectum complications by 100 modelled plans for the support vector machine algorithm (n=100)

Abbreviation: NC, non-complication.

Decision tree

A decision tree requires that critical decision points be selected for outgoing confirmation based on specific conditions by selecting a final value with these conditions. This can be formalised by a simple pattern and is an algorithm that can be programmed using machine learning.

The decision point X j v divides the input data space, R, into two sections: R Y and R N . The division of R into R Y and R N also derives a binary section of the corresponding input data point D Input. This means that a division point of the form X j v derives the data into sections, as in equations (2) and (3).

(2) $$D_{Y} {\rm {\equals}\{ }x{\rm \mid}\,x\,\in\,D_{{{\rm Input}}} {\rm ,}\,x_{j} \leq v{\rm \} }$$
(3) $$D_{N} {\rm {\equals}\{ }x\,{\rm \mid}\,x\in\,D_{{{\rm Input}}} {\rm ,}\,x_{j} {\rm \,\gt\,}v{\rm \} }$$

where D Y is the group of data points lying in region R Y , and D N is the group of input points lying in R N .Reference Mohammed and Wagner Meira 8

To analyse the decision tree algorithm, we calculated an additional eight plans based on the 12 original treatment plans, expanding the analysis to 20 plans. Table 4 shows the doses (Gy) of 25% bladder, 50% bladder, 25% rectum and 50% rectum using this prediction.

Table 4 Complication prediction for bladder and rectum using 20 representative plans for the decision tree algorithm (n=20)

Abbreviation: NC, non-complication; C, complication.

Results

The results of analysis with the machine learning algorithm showed 91·0% accuracy after the training process with respect to 100 modelled plans using SVM.

In addition, the OAR complication analysis showed possible classification of potential risk factors as complication (C) and non-complication (NC) relative to 25% bladder, 50% rectum and 30% bowel using the decision tree. Therefore, we could combine a programme including this machine learning algorithm and our in-house developed planning decision support system to allow complication predictions for patients based on clinical big data.

Predictive modelling analysis results

SVM

Figure 3 shows the results of classification analysis for bladder and rectum with respect to the 100-model plan. Quadratic SVM analysis correctly separated NC cases: red dots in Figure 3 indicate correct classification, and red crosses show misclassified NC. The true positive rate and false positive rate were obtained to demonstrate the performance of the SVM classifier for the analysis and showed an area under curve of 0·8107 (Figure 4).

Figure 3 Scatter plot of patient organs at risk (OARs) with the support vector machine for 100 modelled plans. Note: red dot (·): correctly classified as NC; red cross (x): misclassified as NC. (a) Bladder scatter plot with modeled plans; (b) rectum scatter plot with modeled plans. Abbreviation: NC, non-complication.

Figure 4 Receiver operating characteristic (ROC) of the classifier with the support vector machine. Note: Area under curve (AUC)=0·8107, (a)=positive class for complication, (b)=positive class for non-complication.

In addition, confusion matrix analysis was performed to calculate the error matrix, showing a 91% rate of accuracy for the predicted class and true class (Figure 5).

Figure 5 Confusion matrix for support vector machine analysis and the predicted class for non-complication (NC) with 91·0% accuracy.

Decision tree

As a result of the decision tree analysis, complication prediction was possibly based on the dose of 25% bladder, 50% rectum and 30% bowel. When the radiation oncologists and medical physicists decide the final treatment plans before radiation therapy, the dose constraint for every OAR makes it complicated to determine an optimal plan; thus, a method considering these complex factors would be a useful analytical tool to predict complications (Figure 6).

Figure 6 Decision tree for grade 2 rectal complication classification for 100 plans with 25% bladder, 50% rectum, 30% bowel.

Integration with SMARTRT

SMARTRT is an in-house radiation treatment planning decision support system (PDSS) that was developed to give a final scoring scheme that included DVH information for the patient from the treatment plan and functions with dosimetrical and biological index analysis through the overall quality factor result. However, if the toxicity prediction function is added into the SMARTRT programme using clinical big data and comprehensive clinical side effects could be linked to solve complication prediction, we might be able to achieve the optimal patient-specific PDSS (Figure 7).

Figure 7 Integrated flow chart of toxicity prediction, dosimetric biological index analysis and overall factors for SMARTRT. Abbreviation: DVH, dose volume histogram; PTV, planning target volume; OAR, organ at risk; TCP, tumour control probability; NTCP, normal tissue complication probability; PITV, prescription isodose to target volume; CI, conformity index; HI, homogeneity index; TCI, target coverage index; MHI, modified homogeneity index; CN, conformity number; COSI, critical organ scoring index; RO, radiation oncology; DB, database; RTOG, radiation therapy oncology group; EUD, equivalent uniform dose.

Discussion and Conclusion

To improve the quality of life of the patient after treatment, more accurate patient treatment plans are needed in the field of radiation oncology. This should allow more accurate prognosis of patient outcomes after treatment. This issue is being addressed by planning decision support system research using fundamental DVH analysis, as well as dosimetrical and biological indices with TCP and NTCP.Reference Cao, Lee and Chang 4 , Reference Sanchez-Nieto and Nahum 11 Reference Bentzen, Constine and Deasy 13

The treatment plan has to be compared more accurately according to the optimum patient-specific PDSS, which includes a predictive model-based function and will represent a significant breakthrough in patient care through machine learning research that can be linked to clinical big data (Figure 7). We present the total artificial intelligence-based integrated clinical decision support system of this study in Figure 8. This system includes an intelligent clinical decision support algorithm with machine learning and deep learning as the artificial intelligence system using clinical big data that could be further expanded.

Figure 8 The artificial intelligence (AI)-based integrated clinical decision support system of this study. Abbreviation: DICOM RT, Digital Imaging and Communications in Medicine Radiation Therapy.

Machine learning analysis-based studies with clinical cases in radiation oncology are being researched.Reference El Naqa, Bradley, PE, Hope and Deasy 14 Reference Coates, Souhami and El Naqa 18 Therefore, it seems likely that more patient cases and multi-institutional studies will be compiled to increase the amount of training data and provide more accurate results. This will be the foundation for the development of optimal patient-specific PDSS for prognosis.

Acknowledgement

This work was supported by a Korea University grant.

Conflicts of Interest

None.

References

1. Meldolesi, E, van Soest, J, Damiani, A et al. Standardized data collection to build rediction models in oncology: a prototype for rectal cancer. Future Oncol 2016; 12 (1): 119136.Google Scholar
2. Zhang, H H, D’Souza, W D, Shi, L, Meyer, R R. Modeling plan-related clinical complications using machine learning tools in a multiplan IMRT framework. Int J Radiat Oncol Biol Phys 2009; 74 (5): 16171626.Google Scholar
3. Guidi, G, Maffei, N, Vecchi, C et al. A support vector machine tool for adaptive tomotherapy treatments: prediction of head and neck patients criticalities. Phys Med 2015; 31 (5): 442451.Google Scholar
4. Cao, Y J, Lee, S, Chang, K H et al. Patient performance-based plan parameter optimization for prostate cancer in tomotherapy. Med Dosim 2015; 40 (4): 285289.CrossRefGoogle ScholarPubMed
5. Cao, Y J, Lee, S, Chang, K H et al. Optimized planning target volume margin in helical tomotherapy for prostate cancer: is there a preferred method? J Kor Phys 2015; 67 (1): 2632.Google Scholar
6. Çınar, M, Engin, M, Engin, E Z, Atesçi, Y Z. Early prostate cancer diagnosis by using artificial neural networks and support vector machines. Exp Sys App 2009; 36: 63576361.Google Scholar
7. De Bari, B, Vallati, M, Gatta, R et al. Could machine learning improve the prediction of pelvic nodal status of prostate cancer patients? Preliminary results of a pilot study. Cancer Invest 2015; 33 (6): 232240.CrossRefGoogle ScholarPubMed
8. Mohammed, J Z, Wagner Meira, J R. Data Mining and Analysis. England: Cambridge University Press, 2014.Google Scholar
9. El Naqa, I, Li, R, Murphy, M J. Machine Learning in Radiation Oncology: Theory and Applications, 2nd edition. Switzerland: Springer, 2015.Google Scholar
10. Lambin, P, van Stiphout, R G, Starmans, M H et al. Predicting outcomes in radiation oncology – multifactorial decision support systems. Nat Rev Clin Oncol 2015; 10 (1): 2740.Google Scholar
11. Sanchez-Nieto, B, Nahum, A E. BIOPLAN: software for the biological evaluation of radiation therapy. Med Dosim 2000; 25 (2): 7176.Google Scholar
12. Pinter, C, Lasso, A, Wang, A, Jaffray, D, Fichtinger, G. SlicerRT: radiation therapy research toolkit for 3D Slicer. Med Phys 2012; 39 (10): 63326338.Google Scholar
13. Bentzen, S M, Constine, L S, Deasy, J O et al. Quantitative analyses of normal tissue effects in the clinic (QUANTEC): an introduction to the scientific issues. Int J Radiat Oncol Biol Phys 2010; 76 (3): S3S9.Google Scholar
14. El Naqa, I, Bradley, J D, PE, L, Hope, A J, Deasy, J O. Predicting radiotherapy outcomes using statistical learning techniques. Phys Med Biol 2009; 54 (18): S9.Google Scholar
15. Kang, J, Schwartz, R, Flickinger, J, Beriwal, S. Machine learning approaches for predicting radiation therapy outcomes: a clinician’s perspective. Int J Radiat Oncol Biol Phys 2015; 93 (5): 11271135.Google Scholar
16. Bibault, J E, Giraud, P, Burgun, A. Big data and machine learning in radiation oncology: state of the art and future prospects. Cancer Lett 2016; 16: 3034630349.Google Scholar
17. Trifiletti, D M, Showalter, T N. Big data and comparative effectiveness research in radiation oncology: synergy and accelerated discovery. Front Oncol 2015; 5: 274.CrossRefGoogle ScholarPubMed
18. Coates, J, Souhami, L, El Naqa, I. Big data analytics for prostate radiotherapy. Front Oncol 2016; 6: 149.CrossRefGoogle ScholarPubMed
Figure 0

Table 1 Characteristics of the patients with prostate cancer

Figure 1

Figure 1 Established dose volume histogram (DVH) of bladder and rectum for 12 patients with prostate cancer (n=12).

Figure 2

Table 2 Dose-volume constraints for bladder and rectum as organs at risk

Figure 3

Figure 2 Model of a radiation treatment planning decision support system (a) and its predictive modelling flow chart for the support vector machine algorithm (b) of this study. Abbreviations: PITV, prescription isodose to target volume; CI, conformity index; HI, homogeneity index; TCI, target coverage index; MHI, modified homogeneity index; CN, conformity number; COSI, critical organ scoring index; QF, quality factor

Figure 4

Table 3 Classification of bladder and rectum complications by 100 modelled plans for the support vector machine algorithm (n=100)

Figure 5

Table 4 Complication prediction for bladder and rectum using 20 representative plans for the decision tree algorithm (n=20)

Figure 6

Figure 3 Scatter plot of patient organs at risk (OARs) with the support vector machine for 100 modelled plans. Note: red dot (·): correctly classified as NC; red cross (x): misclassified as NC. (a) Bladder scatter plot with modeled plans; (b) rectum scatter plot with modeled plans. Abbreviation: NC, non-complication.

Figure 7

Figure 4 Receiver operating characteristic (ROC) of the classifier with the support vector machine. Note: Area under curve (AUC)=0·8107, (a)=positive class for complication, (b)=positive class for non-complication.

Figure 8

Figure 5 Confusion matrix for support vector machine analysis and the predicted class for non-complication (NC) with 91·0% accuracy.

Figure 9

Figure 6 Decision tree for grade 2 rectal complication classification for 100 plans with 25% bladder, 50% rectum, 30% bowel.

Figure 10

Figure 7 Integrated flow chart of toxicity prediction, dosimetric biological index analysis and overall factors for SMARTRT. Abbreviation: DVH, dose volume histogram; PTV, planning target volume; OAR, organ at risk; TCP, tumour control probability; NTCP, normal tissue complication probability; PITV, prescription isodose to target volume; CI, conformity index; HI, homogeneity index; TCI, target coverage index; MHI, modified homogeneity index; CN, conformity number; COSI, critical organ scoring index; RO, radiation oncology; DB, database; RTOG, radiation therapy oncology group; EUD, equivalent uniform dose.

Figure 11

Figure 8 The artificial intelligence (AI)-based integrated clinical decision support system of this study. Abbreviation: DICOM RT, Digital Imaging and Communications in Medicine Radiation Therapy.