Predicting Heart Diseases from Large Scale IoT Data Using a Map-Reduce Paradigm

Faris Mohammad Abd 2  and Mehdi Ebady Manaa 1
  • 1 University of Babylon, College of IT, Department of Information Networks
  • 2 University of Babylon, College of IT, Department of Information Networks


Over the last few years, the huge amount of data represented a major obstacle to data analysis. Big data implies that the volume of data undergoes a faster progress than computational speeds, thereby demanding a larger data storage capacity. The Internet of Things (IoT) is a main source of data that is closely related to big data, as the former extends to a variety of fields such as healthcare, entertainment, and disaster control. Despite the different advantages associated with the composition of Big Data analytics and IoT, there are a number of complex difficulties and issues involved that need to be resolved and managed to ensure an accurate data analysis. Some of these solutions include the utilization of map-reduce techniques, processing, and large data scale, particularly for the relatively less time that this method requires to process large data from the Internet of Things. Machine learning algorithms of this kind are often implemented in the healthcare sector. Medical facilities need to be advanced so that more appropriate decisions can be made in terms of patient diagnosis and treatment options. In this work, two datasets have been used: the first set, used in the prediction of heart diseases, obtained an accuracy rate of 84.5 for RF and 83 for J48, whereas the second dataset is related to weather stations (automated sensors) and obtained accuracy rates of 88.5 and 86.5 for RF and J48, respectively.

If the inline PDF is not rendering correctly, you can download the PDF file here.

  • [1] S. Kumar Basak, M. Wotto, and P. Bélanger, “E-learning, M-learning and D-learning: Conceptual definition and comparative analysis,” E-Learning Digit. Media, vol. 15, no. 4, pp. 191–216, 2018.

  • [2] J. Qi, P. Yang, G. Min, O. Amft, F. Dong, and L. Xu, “Advanced internet of things for personalised healthcare systems: A survey,” Pervasive Mob. Comput., vol. 41, pp. 132–149, 2017.

  • [3] M. Marjani et al., “Big IoT data analytics: architecture, opportunities, and open research challenges,” IEEE Access, vol. 5, pp. 5247–5261, 2017.

  • [4] Suzuki, Larissa R. "Smart cities IoT: Enablers and technology road map." In Smart City Networks, pp. 167-190. Springer, Cham, 2017

  • [5] J. M. Talavera et al., “Review of IoT applications in agro-industrial and environmental fields,” Comput. Electron. Agric., vol. 142, pp. 283–297, 2017.

  • [6] A. Sinha, P. Kumar, N. P. Rana, R. Islam, and Y. K. Dwivedi, “Impact of internet of things (IoT) in disaster management: a task-technology fit perspective,” Ann. Oper. Res., vol. 283, no. 1–2, pp. 759–794, 2019.

  • [7] M. Elhoseny, G. Ramírez-González, O. M. Abu-Elnasr, S. A. Shawkat, N. Arunkumar, and A. Farouk, “Secure medical data transmission model for IoT-based healthcare systems,” Ieee Access, vol. 6, pp. 20596–20608, 2018.

  • [8] Y. Mehmood, F. Ahmad, I. Yaqoob, A. Adnane, M. Imran, and S. Guizani, “Internet-of-things-based smart cities: Recent advances and challenges,” IEEE Commun. Mag., vol. 55, no. 9, pp. 16–24, 2017.

  • [9] Musleh, Ahmed S., Mahdi Debouza, and Mohamed Farook. "Design and implementation of smart plug: An Internet of Things (IoT) approach." In 2017 International Conference on Electrical and Computing Technologies and Applications (ICECTA), pp. 1-4. IEEE, 2017.

  • [10] A. Munir, P. Kansakar, and S. U. Khan, “IFCIoT: Integrated Fog Cloud IoT: A novel architectural paradigm for the future Internet of Things.,” IEEE Consum. Electron. Mag., vol. 6, no. 3, pp. 74–82, 2017.

  • [11] T. Economist, “Data, data everywhere: A special report on managing information,” Econ., 2010.

  • [12] Sagiroglu, Seref, and Duygu Sinanc. "Big data: A review." In 2013 international conference on collaboration technologies and systems (CTS), pp. 42-47. IEEE, 2013.

  • [13] Ishwarappa and J. Anuradha, “A brief introduction on big data 5Vs characteristics and hadoop technology,” Procedia Comput. Sci., vol. 48, no. C, pp. 319–324, 2015.

  • [14] Y. Dandawate, “Big Data: Challenges and Opportunities, volume 11 of Infosys Labs Briefings. Infosys Labs.” 2013.

  • [15] S. Regha and M. Manimekalai, “HADOOP VS BIG DATA,” 2015.

  • [16] Z. Lu, N. Wang, J. Wu, and M. Qiu, “IoTDeM: An IoT Big Data-oriented MapReduce performance prediction extended model in multiple edge clouds,” J. Parallel Distrib. Comput., vol. 118, pp. 316–327, 2018.

  • [17] T. White, “Hadoop: The Definitive Guide, Yahoo.” Press, 2010.

  • [18] Z. Khanam and S. Agarwal, “Map-reduce implementations: survey and performance comparison,” Int. J. Comput. Sci. Inf. Technol.(IJCSIT), vol. 7, no. 4, 2015.

  • [19] T. S. Buda, H. A. A. A. Salama, L. Xu, P. J. O’sullivan, C. Thorpe, and L. Almeida, “Data replication in a distributed file system.” Google Patents, 26-Sep-2019.

  • [20] P. Vishruti and M. D. Ingle, “Disease Analytics in Healthcare System Using Hadoop,” Int. J. Mod. Trends Sci. Technol., vol. 4, 2018

  • [21] Z. Lu, N. Wang, J. Wu, and M. Qiu, “IoTDeM: An IoT Big Data-oriented MapReduce performance prediction extended model in multiple edge clouds,” J. Parallel Distrib. Comput., vol. 118, pp. 316–327, 2018.

  • [22] T. Nagamani, S. Logeswari, and B. Gomathy, “Heart Disease Prediction using Data Mining with Mapreduce Algorithm,” no. 3, pp. 137–140, 2019.

  • [23] T. Mylsami and B. L. Shivakumar, “Role of Map reduce Algorithm to Improve,” vol. 5, no. 11, pp. 47–51, 2017.

  • [24] A. Kamilaris, A. Kartakoullis, and F. X. Prenafeta-Boldú, “A review on the practice of big data analysis in agriculture,” Comput. Electron. Agric., vol. 143, no. September, pp. 23–37, 2017, doi:10.1016/j.compag.2017.

  • [25] T. Mahboob, S. Irfan, and A. Karamat, “A machine learning approach for student assessment in E-learning using Quinlan’s C4. 5, Naive Bayes and Random Forest algorithms,” in 2016 19th International Multi-Topic Conference (INMIC), pp. 1-8. IEEE, 2016.

  • [26] T. M. Lakshmi, A. Martin, R. M. Begum, and V. P. Venkatesan, “An analysis on performance of decision tree algorithms using student’s qualitative data,” Int. J. Mod. Educ. Comput. Sci., vol. 5, no. 5, p. 18, 2013.

  • [27] F. Paquin, J. Rivnay, A. Salleo, N. Stingelin, and C. Silva, “Multi-phase semicrystalline microstructures drive exciton dissociation in neat plastic semiconductors,” J. Mater. Chem. C, vol. 3, pp. 10715–10722, 2015.

  • [28] M. Belouch, S. El Hadaj, and M. Idhammad, “A two-stage classifier approach using reptree algorithm for network intrusion detection,” Int. J. Adv. Comput. Sci. Appl., vol. 8, no. 6, pp. 389– 394, 2017.

  • [29] S. Gupta and N. Verma, “Comparative Analysis of classification Algorithms using WEKA tool,” Int. J. Sci. Eng. Res., vol. 2018, 2014.

  • [30] Banos, Oresti, Rafael Garcia, Juan A. Holgado-Terriza, Miguel Damas, Hector Pomares, Ignacio Rojas, Alejandro Saez, and Claudia Villalonga. "mHealthDroid: a novel framework for agile development of mobile health applications." In International workshop on ambient assisted living, pp. 91-98. Springer, Cham, 2014.


Journal + Issues