Application of soft sets to diagnose the prostate cancer risk
© Yuksel et al.; licensee Springer. 2013
Received: 14 December 2012
Accepted: 20 April 2013
Published: 7 May 2013
In recent years the artificial intelligence has been developed rapidly since it can be applied easily to several areas like medical diagnosis, engineering and economics, among others. In this study we have devised a soft expert system (SES) as a prediction system for prostate cancer by using the prostate specific antigen (PSA), prostate volume (PV) and age factors of patients based on fuzzy sets and soft sets and have calculated the patients’ prostate cancer risk. Our data set has been provided by the Department of Urology, Meram Medical Faculty in Necmettin Erbakan University, Konya, Turkey.
Keywordsfuzzy set soft set prostate cancer soft expert system
In recent years vague concepts have been used in different areas such as medical applications, pharmacology, economics and engineering since the classical mathematics methods are inadequate to solve many complex problems in these areas. Traditionally mathematics uses a crisp (well-defined) property , i.e., properties that are either true or false. Each property defines a set: .
The most successful theoretical approach to vagueness is undoubtedly fuzzy set theory introduced by Zadeh . The theory is used commonly in different areas as engineering, medicine and economics, among others. The fuzzy set theory is based on the fuzzy membership function . By the fuzzy membership function, we can determine the membership grade of an element with respect to a set. A fuzzy set F is described by its membership function . The fuzzy set theory has become very popular and has been used to solve problems in different areas. But there exists a difficulty: how to set the membership function in each particular case. The reason for these difficulties is, possibly, the inadequacy of a parametrization tool of the theory . Soft set theory was initiated by Molodtsov  as a new method for vagueness. Molodtsov showed in his paper that the theory can be applied to several areas successfully; for example, the smoothness of functions, game theory, Riemann-integration, Perron-integration, etc. He also showed that soft set theory is free from the parametrization inadequacy syndrome of other theories developed for vagueness. A soft set can be represented by Boolean-valued information system, and so it can be used to represent a dataset. Also, the hybrid models of the vague sets take attention of researchers. Maji et al.  defined a hybrid model called fuzzy soft sets. This new model is a combination of fuzzy and soft sets and is a generalization of soft sets. Irfan Ali and Shabir  developed the theory. To address decision making problems based on fuzzy soft sets, Feng et al. introduced the concept of level soft sets of fuzzy soft sets and initiated an adjustable decision-making scheme using fuzzy soft sets . Feng et al.  first considered the combination of soft sets, fuzzy sets and rough sets. Using soft sets as the granulation structures, Feng et al.  defined soft approximation spaces, soft rough approximations and soft rough sets, which are generalizations of Pawlak’s rough set model based on soft sets. It has been proven that in some cases Feng’s soft rough set model could provide better approximations than classical rough sets. Simsekler (Dizman) and Yuksel  contributed to fuzzy soft topological structures.
Prostate cancer is the second most common cause of cancer death among men in most industrialized countries, and it depends on various factors such as family cancer history, age, ethnic background and the level of prostate specific antigen (PSA) in blood. The level of PSA in blood is very important method to an initial diagnosis for patients [10–12]. However the level of PSA in blood can be increased by inflammation of prostate and benign prostate hyperplasia (BPH). For this reason, it is difficult to differentiate it from benign prostate hyperplasia (BPH). The definitive diagnose of the prostate cancer is possible with prostate biopsy. The results of PSA test, rectal examination and transrectal findings help the doctor to decide whether biopsy is necessary or not [1, 13, 14]. However the patients with low cancer risk have to avoid this process due to possible complications and its high cost. Because of this reason, before agreeing to biopsy, the patients with low cancer risk can be determined. There are several research works in the area of the prostate cancer prognosis or diagnosis. One of them is FES which is a rule-based fuzzy expert system using the laboratory data PSA, PV and age of the patient and it aims to help to an expert-doctor to determine the necessity of biopsy and the risk factor . Benecchi  developed a neuro-fuzzy system by using both serum data (total prostate specific antigen and free prostate specific antigen) and clinical data (age of patients) to enhance the performance of tPSA (total prostate specific antigen) to distinguish prostate cancer. Keles et al.  built a neuro-fuzzy classifier to be used in the diagnosis of prostate cancer and BPH diseases. Since the symptoms of these two illnesses are very close to each other, the differentiation between them is an important problem. Saritas et al.  have devised an artificial neural network that provides a prognostic result indicating whether patients have cancer or not by using their free prostate specific antigen, total prostate specific antigen and age data.
In this study we aim to discuss how soft set theory can be used for developing knowledge-based system in medicine and devise a prediction system named soft expert system (SES) by using the PSA, PV and age data of patients based on fuzzy sets and soft sets and calculate the patients prostate cancer risk. It is a rule-based system, and according to the rules, we determine the risk of prostate cancer. Our aim is to help the doctor to determine whether the patient needs biopsy or not.
Definition 2.1 
A fuzzy set A in U is a set of ordered pairs:
, where is a mapping and (or ) states the grade of belonging of x in A. The family of all fuzzy sets in U is denoted by .
Definition 2.2 
Let . A pair is called a soft set over U, where F is a mapping given by , where E is the set of parameters. In other words, the soft set is a parametrized family of the subsets of U. Every set , from this family may be considered as the set of e-elements of the soft set , or the set of e-approximate elements of the soft set.
Tabular presentation of the soft set
Definition 2.3 
Let and be two soft sets over U. is called a soft subset of denoted by if and for every , . Two soft sets and over U are said to be equal, denoted by if and .
Definition 2.4 
A soft set over U is said to be a NULL soft set denoted by Φ if , .
Definition 2.5 
A soft set over U is said to be an absolute soft set denoted by if , .
Definition 2.6 
If and are two soft sets, then and denoted by is defined by , where , .
Definition 2.7 
Definition 2.8 
- 1.The extended intersection of and denoted by is defined as the soft set , where , and for all ,
The restricted intersection of and denoted by is defined as the soft set , where , and for every .
Theorem 2.1 
Every fuzzy set can be considered as a soft set.
Definition 2.9 
An information system
Proposition 2.2 
If is a soft set over the universe U, then is a Boolean-valued information system.
The reduction of parameters of soft sets has taken attention of several researchers. Kong  gave an algorithm for the normal parameter reduction of soft sets in 2008. In 2011 Ma  gave a new algorithm for the normal parameter reduction of soft sets and compared this new method with Kong’s method. These two algorithms calculate the same reduction, but Kong’s method is more difficult and complex. Ma gave a new algorithm that is more understandable and easier to avoid the difficulty of Kong’s algorithm.
3 Soft expert system
The input values of several patients
3.1 First step: fuzzyfication of data set
The fuzzy membership values of factors
0.53 S, 0.47 M
0.47 M, 0.53 O
0.2 VL, 0.8 L
0.77 S, 0.23 M
0.48 L, 0.52 M
0.8 S, 0.2 M
0.28 VL, 0.72 L
0,4 S 0,6 M
0,33 M 0,67 O
0,84 VL 0,16 L
0,13 M 0,87 O
0,6 VL 0,4 L
0,93 M 0,07 B
0,41 L 0,59 M
0,6 M 0,4 B
0,18 VL 0,82 L
0,4 M 0,6 B
0,66 VL 0,34 L
0,27 M 0,73 B
0,33 M 0,67 O
0,36 L 0,64 M
0,37 M 0,63 B
3.2 Second step: transforming the fuzzy sets to soft sets
3.3 Third step: parameter reduction of soft sets
3.4 Fourth step: obtaining soft rules
In this way, we obtain 400 rules. Then we eliminate some rules that have the same output (the same patient set), and hence we get 285 rules.
3.5 Fifth step: analysis of soft rules
In this step we analyze the soft rules and calculate the prostate cancer risk percentage. The patients set for each rule was obtained in the fourth step. We consider these sets and observe how many of the patients in the set have prostate cancer, then we rate the patients with prostate cancer to each patient in the set. Therefore we have the prostate cancer risk percentage for each rule. If a patient’s data is convenient to more than one rule and so has more than one rate, then we accept the highest one.
Now we calculate the risk percentage of the first rule:
There are 23 patients who have the properties stated in Rule 1. Prostate cancer is found in eight of these patients. Hence, the risk percentage for first rule is . We can easily say that the patients whose values of PSA, PV and age are convenient to the first rule have cancer risk of 34%. The values of patient are convenient to Rule 3, Rule 4 and Rule 8. When we look at the risk percentage of these rules, we see that Rule 8 has the highest rate. Hence the risk percentage of is 100% (the percentage of Rule 8).
The risk percentage for some rules is as follows:
Rule 1: If a patient has and and , then the cancer risk is 28%.
Rule 2: If a patient has and and , then the cancer risk is 34%.
Rule 3: If a patient has and and , then the cancer risk is 74%.
Rule 4: If a patient has and and , then the cancer risk is 83%.
Rule 5: If a patient has and and , then the cancer risk is 100%.
Rule 6: If a patient has and and , then the cancer risk is 100%.
Rule 7: If a patient has and and , then the cancer risk is 100%.
Rule 8: If a patient has and and , then the cancer risk is 100%.
Finally, we write the soft expert system which calculates the prostate cancer risk by input variables PSA, PV and age.
3.6 Calculation of prostate cancer risk
In this work we designed an expert system SES by using a soft set and it is a pioneering work for applying the soft sets to a medical diagnosis. We also used fuzzy membership functions and an algorithm to reduce the parameter set of soft sets. The expert doctor can reduce unnecessary biopsies in patients undergoing evaluation for prostate cancer by calculating the percentage of prostate cancer risk in the soft expert system. According to our devised system, if the risk percentage is bigger than 50%, then biopsy is necessary. Our data set contains 78 patients. These patients have high values of PSA, PV and age and they are potential prostate cancer patients. For this reason, the biopsy was applied to these patients; however, after biopsy it was seen that 44 of them had cancer. When we calculated the risk percentage of these 78 patients in the soft expert system, we saw that 51 patients needed biopsy, and 27 patients who really had low cancer risk had to avoid biopsy. Our aim is to help the doctor to decide whether the patient needs biopsy or not.
Dedicated to Professor Hari M Srivastava.
- Nguyen HP, Kreinovich V: Fuzzy logic and its applications in medicine. Int. J. Med. Inform. 2001, 62: 165–173. 10.1016/S1386-5056(01)00160-5View ArticleGoogle Scholar
- Zadeh LA: Fuzzy sets. Inf. Control 1965, 8: 338–353. 10.1016/S0019-9958(65)90241-XMathSciNetView ArticleGoogle Scholar
- Molodtsov D: Soft set theory-first results. Comput. Math. Appl. 1999, 37(4–5):19–31. 10.1016/S0898-1221(99)00056-5MathSciNetView ArticleGoogle Scholar
- Maji PK, Roy AR, Biswas R: Fuzzy soft sets. J. Fuzzy Math. 2001, 9(3):589–602.MathSciNetGoogle Scholar
- Ali MI, Shabir M: Comments on De Morgan’s law in fuzzy soft sets. J. Fuzzy Math. 2010, 18(3):679–686.MathSciNetGoogle Scholar
- Feng F, Jun YB, Liu XY, Li LF: An adjustable approach to fuzzy soft set based decision making. J. Comput. Appl. Math. 2010, 234: 10–20. 10.1016/j.cam.2009.11.055MathSciNetView ArticleGoogle Scholar
- Feng F, Li C, Davvaz B, Ali MI: Soft sets combined with fuzzy sets and rough sets. Soft Comput. 2010, 14: 899–911. 10.1007/s00500-009-0465-6View ArticleGoogle Scholar
- Feng F, Liu XY, Leoreanu-Fotea V, Jun YB: Soft sets and soft rough sets. Inf. Sci. 2011, 181: 1125–1137. 10.1016/j.ins.2010.11.004MathSciNetView ArticleGoogle Scholar
- Simsekler TH, Yuksel S: Fuzzy soft topological spaces. Ann. Fuzzy Math. Inf. 2012, 5(1):87–96.MathSciNetGoogle Scholar
- Catolona WJ, Partin AW, Slawin KM, Brawer MK, Flanigan RC, Patel A, et al.: Use of the percentage of free prostate-specific antigen to enhance differentiation of prostate cancer from benign prostatic disease: a prospective multicenter clinical trial. JAMA J. Am. Med. Assoc. 1998, 279: 1542–1547. 10.1001/jama.279.19.1542View ArticleGoogle Scholar
- Egawa S, Soh S, Ohori M, Uchida T, Gohji K, Fujii A, et al.: The ratio of free to total serum prostate specific antigen and its use in differential diagnosis of prostate carcinoma in Japan. Cancer 1997, 79: 90–98. (Online) 10.1002/(SICI)1097-0142(19970101)79:1<90::AID-CNCR13>3.0.CO;2-1View ArticleGoogle Scholar
- Van Cangh PJ, De Nayer P, De Vischer L, Sauvage P, Tombal B, Lorge F, et al.: Free to total prostate-specific antigen (PSA) ratio is superior to total PSA in differentiating benign prostate hypertrophy from prostate cancer. Prostate 1996, 29: 30–34. (Online)View ArticleGoogle Scholar
- Metlin C, Lee F, Drago J: The American cancer society national prostate cancer detection project. Findings on the detection of early prostate cancer in 2425 men. Cancer 1991, 67: 2949–2958. (Online) 10.1002/1097-0142(19910615)67:12<2949::AID-CNCR2820671202>3.0.CO;2-XView ArticleGoogle Scholar
- Seker H, Odetayo M, Petrovic D, Naguib RNG: A fuzzy logic based method for prognostic decision making in breast and prostate cancers. IEEE Trans. Inf. Technol. Biomed. 2003, 7: 114–122. 10.1109/TITB.2003.811876View ArticleGoogle Scholar
- Saritas I, Allahverdi N, Sert U: A fuzzy expert system design for diagnosis of prostate cancer. International Conference on Computer Systems and Technologies - CompSysTech’2003 2003.Google Scholar
- Benecchi L: Neuro-fuzzy system for prostate cancer diagnosis. Urology 2006, 68(2):357–361. 10.1016/j.urology.2006.03.003View ArticleGoogle Scholar
- Keles A, Hasiloglu AS, Keles A, Aksoy Y: Neuro-fuzzy classification of prostate cancer using NEFCLASS-J. Comput. Biol. Med. 2007, 37: 1617–1628. 10.1016/j.compbiomed.2007.03.006View ArticleGoogle Scholar
- Saritas I, Ozkan IA, Sert U: Prognosis of prostate cancer by artificial neural networks. Expert Syst. Appl. 2010, 37: 6646–6650. 10.1016/j.eswa.2010.03.056View ArticleGoogle Scholar
- Maji PK, Biswas R, Roy AR: Soft set theory. Comput. Math. Appl. 2003, 45: 555–562. 10.1016/S0898-1221(03)00016-6MathSciNetView ArticleGoogle Scholar
- Ali MI, Feng F, Liu X, Min WK, Shabir M: On some new operations in soft set theory. Comput. Math. Appl. 2009, 57: 1547–1553. 10.1016/j.camwa.2008.11.009MathSciNetView ArticleGoogle Scholar
- Aktas H, Cagman N: Soft sets and soft groups. Inf. Sci. 2007, 77: 2726–2735.MathSciNetView ArticleGoogle Scholar
- Herewan T, Deris MM: A soft set approach for association rules mining. Knowl.-Based Syst. 2011, 24: 186–195. 10.1016/j.knosys.2010.08.005View ArticleGoogle Scholar
- Kong Z, Gao L, Wang L, Li S: The normal parameter reduction of soft sets and its algorithm. Comput. Math. Appl. 2008, 56(12):3029–3037. 10.1016/j.camwa.2008.07.013MathSciNetView ArticleGoogle Scholar
- Ma X, Sulaiman N, Qin H, Herewan T, Zain JM: A new efficient normal parameter reduction algorithm of soft set. Comput. Math. Appl. 2011, 62: 588–598. 10.1016/j.camwa.2011.05.038MathSciNetView ArticleGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.