References :
- Sheikhpour R, Hekmat Moghadam H. The effect of estrogen on p53 protein in T47D breast cancer cell line. Razi J Med Sci 2015; 22(133): 51-8. [Article in Farsi]
- Torkaman A, Charkari NM, Aghaeipour M. An approach for leukemia classification based on cooperative game theory. Anal Cell Pathol(Amst) 2011; 34(5): 235-46.
- Zand AM, Imani S, Saadati M, Borna H, Ziaei R, Honari H. Effect of age, gender and blood group on blood cancer types. Kowsar Med J 2010 15(2): 111-4. [Article in Farsi]
- Zali H, Amini R, Shiri Haris R. Gene expression analysis of leukemia microarray data by David program . J Ilam Uni Med Sci 2013; 21(2): 92-102. [Article in Farsi]
- Parsa N. Environmental factors, genes and human cancers. Sci Cultivation J 2012; 2(1): 12-9. [Article in Farsi]
- Sheikhpour R, Ghasemi N, Yaghmaei P, Mohiti J. Immunohistochemical assessment of p53 protein and its correlation with clinicopathological parameters in breast cancer patients. Indian J Sci Technol 2014; 7(4): 472-9.
- Sheikhpour R, Aghaseram M, Sheikhpour R. Diagnosis of acute myeloid and lymphoblastic leukemia using gene selection of microarray data and data mining algorithm. Sci J Iran Blood Transfus Organ 2016; 12 (4): 347-57. [Article in Farsi]
- Duggan DJ, Bittner M, Chen Y, Meltzer P, Trent JM. Expression profiling using cDNA microarrays. Nat Genet 1999; 21(1 Suppl): 10-4.
- Golub TR, Slonim DK, Tamayo P, Huard C, Gaasenbeek M, Mesirov JP, et al. Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. Science
1999; 286(15): 530-8.
- Azadi NA, Nouri – Jaliani K, Taheri - Kalani M. Identifying differentially expressed genes based on their expressions in leukemia. Koomesh 2005; 6(4): 259-64. [Article in Farsi]
- Vahedi M, Alavi Majd H, Mehrabi Y, Naghavi B. Gene expression data clustering and its application in differential analysis of leukemia. J Semnan Uni Med Sci 2008; 9(2): 163-8. [Article in Farsi]
- Joroughi M, Shamsi M, Saberkari HR, Sedaaghi MH, Momennezhad A. Gene selection and cancer classification based on microarray data using combined BPSO and BLDA algorithm. Computational Intelligence Electrical Engineering 2014; 5(2): 29-47 [Article in Farsi]
- Eisen MB, Spellman PT, Brown PO, Botstein D. Cluster analysis and display of genome-wide expression patterns. Proc Natl Acad Sci U S A 1998; 95(25): 14863-8.
- Habeck M. DNA microarray technology to revolutionise cancer treatment. Lancet Oncol 2001; 2(1): 5.
- Alavimajd H, Vahedi M, Mehrabi Y, Naghavi B. Clustering approach in DNA microarray analysis. Research in Medicine 2007; 31(1): 19-25. [Article in Farsi]
- Alba E, García-Nieto J, Jourdan L, Talbi EG. Gene selection in cancer classification using PSO/SVM and GA/SVM hybrid algorithms. Congr Evol Comput Singapore 2007; 1-7. Available from: file:///C:/Documents%20and%20Settings/m.mokhtari/My%20Documents/Downloads/JMcec2007.pdf.
- Mahmoud AM, Maher BA, El-Horbaty ES, Salem AB. Analysis of machine learning techniques for gene selection and classification of micoarray data. ICIT
2013. 6th Int Conf Inform Technol 2013; 1-9.
- Cho SB, Won HH. Machine learning in DNA microarray analysis for cancer classification. Bioinformatics 2003; 19: 189-98.
- Nguyen DV, Rocke DM. Tumor classification by partial least squares using microarray gene expression data. Bioinformatics 2002; 18(1): 39-50.
- Li L, Weinberg CR, Darden TA, Pedersen LG. Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method. Bioinformatics 2001; 17(12): 1131-42.
- Chen AH, Lin EJ. The prediction of cancer classification using a novel multi-task support vector sample learning technique. AISS: Adv Inform Sci Serv
Sci 2011; 3(3): 92-9.
- Wang Z, Palade V, Xu Y. Neuro-fuzzy ensemble approach for microarray cancer gene expression data analysis. In Evolving Fuzzy Systems. In: International Symposium on Evolving Fuzzy Systems 2006; p. 241-6.
- Cai H, Ruan P, Ng M, Akutsu T. Feature weight estimation for gene selection: a local hyperlinear learning approach. BMC Bioinformatics 2014; 15(1): 70.
- Zhang H, Wang H, Dai Z, Chen M.S., Yuan Z. Improving accuracy for cancer classification with a new algorithm for genes selection. BMC Bioinformatics 2012; 13(1): 298.
Sci J Iran Blood Transfus Organ 2016; 13(3): 207-214
|
A new approach for diagnosis of Acute Myeloid and Lymphoblastic Leukemia using gene expression profile
and machine learning techniques
Sheikhpour R.1,2, Sheikhpour R.3, Aghasaram M.3
1Department of Physical Activity & Sport Science, Taft Branch, Islamic Azad University, Taft, Iran
2Hematology & Oncology Research Center, Shahid Sadoughi University of Medical Sciences, Yazd, Iran
3Department of Computer Engineering, Yazd University, Yazd, Iran
Abstract
Background and Objectives
Leukemia is a cancer type in the world. One of the most accurate methods for detection and prediction of Acute Myeloid Leukemia and Acute Lymphoblastic Leukemia is to use DNA and genetic information of people. Microarray technology is a tool to study the expression of thousands of genes in shortest possible time. Analyzing the microarray datasets may not be possible without the statistical analysis and machine learning techniques. In this paper, microarray data sets and machine learning techniques are used for the diagnosis of leukemia.
Materials and Methods
The data used in this descriptive study are the expression of 7129 genes of 72 patients with leukemia which have been achieved by the microarray technology. Then, the diagnosis of AML and ALL was performed using the microarray data based on anisotropic radial basis function with the gain ratio and information gain.
Results
The proposed method using information gain with the selection of 230 important genes and using gain ratio with the selection of 86 important genes was able to detect AML and ALL with accuracy of 97.06%, whereas non-parametric kernel classification method based on the radial basis function has the accuracy of 35.29
٪ with 7129 genes.
Conclusions
The results of this study showed that the gene expression data and proposed method with gain ratio method are able to detect leukemia with high accuracy. Therefore, it seems that proposed method can help to accurately diagnose leukemia for a better decision making about the diagnosis of diseases and treatment of patients.
Key words: Leukemia, Gene Expression, Microarray Analysis, Machine Learning
Received: 26 Dec 2015
Accepted: 11 May 2016
Correspondence: Sheikhpour R., PhD of Biochemistry. Department of Physical Activity & Sport Science, Taft Branch, Islamic Azad University and Hematology & Oncology Research Center, Shahid Sadoughi University of Medical Sciences.
P.O.Box: 89156-56965, Yazd, Iran. Tel: (+9835) 36282884; Fax: (+9835) 36235958
E-mail:
robab.sheikhpour@iauyazd.ac.ir