Applying Ensemble Techniques for Improving the Performance of Rule-based Models in Data Mining

Main Article Content

ปพิชญา กลางนอก จารี ทองคำ

Abstract

Modeling for high-performance forecasting is a challenging research. This research aims to enhance the performance of basic models including FURIA, MODLEM and RIPPER with popular integration techniques, including Bagging and Weighted Instances handler wrapper (WI). Data were collected from 699 breast cancer patients and 768 diabetic patients. In order to evaluate the prediction model, 10-fold cross validation was applied to divide dataset into training and testing sets. 10 experiments were conducted to reduce the bias of the experiment. sensitivity, specificity and accuracy were used to measure the predictive performance of the model generated by each technique. Based on the study, it was found that Bagging can increase the accuracy of breast cancer prediction by 4.91%.

Keywords

Article Details

How to Cite
กลางนอกป., & ทองคำจ. (2019). Applying Ensemble Techniques for Improving the Performance of Rule-based Models in Data Mining. Journal of Industrial Technology Ubon Ratchathani Rajabhat University, 9(1), 97-108. Retrieved from https://www.tci-thaijo.org/index.php/jitubru/article/view/183422
Section
Research Article

References

[1] Jian Y et al. A Rule Based Approach to The Service Composition Life-cycle. Proceedings of The Fourth International Conference on Web Information Systems Engineering, 2003 WISE 2003. 2003: 295-298.
[2] Hühn J. and Hüllermeier E. Furia: An algorithm for unordered fuzzy rule induction. Data Miningand Knowledge Discovery. 2007; 19(3): 293–319.
[3] Stefanowski J. The rough set based rule induction technique for classification problems.
In Proceedings of 6th European Conference on Intelligent Techniques and Soft Computing EUFIT. 1998; 98: 109-113.
[4] Cohen W W. Fast effective rule induction. In Machine Learning Proceedings 1995. 1995: 115-123.
[5] Wichareung S. Applying Data Mining Technique in Loan Approval Process. Bangkok: Dhurakij Pundit University; 2010.
[6] Lotte F, Lécuyer A and Arnaldi B. Furia: An Inverse Solution Based Feature Extraction Algorithm Using Fuzzy Set Theory for Brain –computer interfaces.. IEEE Transactions on Signal Processing. 2009; 57(8): 3253-3263.
[7] Gil-Herrera E et all. Rough Set Theory Based Prognostic Classification Models for Hospice Referral. BMC Medical Informaitics and Decision Making. 2015; 15(1): 98.
[8] Thakur S, Meenakshi, E. and Priya, A, "Detection of malicious URLs in big data using RIPPER algorithm," in 2017 2nd IEEE International Conference on Recent Trends in Electronics, Information & Communication Technology, 2017, pp. 1296-1301.
[9] Belhadj S A, Benmoussat N and Della K M. A Bagging Svms to Learn from Event Related Potentials Using Electroencephalography. 2017 5th International Conference on Electrical Engineering - Boumerdes (ICEE-B). 2017: 1-7.
[10] Riana D and Hidayanto A N. Integration of Bagging and Greedy Forward Selection on Image Pap Smear Classification Using Naïve Bayes. 2017 5th International Conference on Cyber and IT Service Management (CITSM). 2017: 1-7.
[11] Basterrech S and Mesa A. Bagging Technique Using Temporal Expansion Functions. In al. PKe, editor. Proceedings of the Fifth Internal Conference on Innovation in Bio-Inspired Computer and Application. 2014: 395-404.
[12] Michalski R S, Mozetic I, Hong J. and Lavrac N. The Multi-Purpose Incremental Learning System AQ15 and its Testing Application to Three Medical Domains. In Proceedings of the Fifth National Conference on Artificial Intelligence, Philadelphia, PA: Morgan Kaufmann. 1986: 1041-1045.
[13] Strack, B., et al. Impact of HbA1c Measurement on Hospital Readmission Rates: Analysis of 70,000 Clinical Database Patient Records, BioMed Research International. 2014: 11.