Madadkar A, karimlo M, rahgozar M, jamaldini H, mozaffari R. Application and Comparison of Random Forest and CART in Genetic Association Study in Coronary Artery Disease. MEJDS 2013; 3 (2) :1-8
URL:
http://jdisabilstud.org/article-1-341-fa.html
مددکار آتوسا، کریملو مسعود، رهگذر مهدی، جمال الدینی حمید، مظفری رضا. کاربرد و مقایسه روشهای یادگیری ماشینی رندم فارست و درخت کلاس بندی-رگرسیونی در مطالعه وابستگی ژنتیکی در بیماران عروق کرونری. مجله مطالعات ناتوانی. 1392; 3 (2) :1-8
URL: http://jdisabilstud.org/article-1-341-fa.html
چکیده: (11328 مشاهده)
Objective: In the studies of genomics, it is essential to select a small number of single nucleotide polymorphisms (SNPs). That is more significant than the others for the association studies of disease susceptibility. Data mining technology provides an important means for extracting valuable medical rules hidden in medical data and acts as an important role in disease prediction and clinical diagnosis. In this study, our goal was to compare two machine learning methods using genetic factor and single nucleotide polymorphisms.
Methods: In order to perform the data analysis, a total of 141 patients and 83 controls in the genetics' section of Shahid Rajaee's heart center. The blood samples to draw conclusions about the LDLR and PCSK9 genes' SNPs was used. Also, the random forest and CART was used in order to discover the relationship between CAD and SNPs. These models were assessed by using four criteria including: sensitivity, specificity, precision and error. Data analysis was performed by SPSS (16.0) and R (2.15.0).
Results: CART had the better performance than Random Forest. Sensitivity, specificity, precision and error were 0.893, 0.506, 0.250 and 0.754 relatively. We introduced an algorithm to classify the high risk and low risk cases.
Conclusion:CART is suggested in order to assess the relationship between CAD and SNPs.
نوع مطالعه:
مقاله پژوهشی اصیل |
موضوع مقاله:
توانبخشی