Skip to content
BY-NC-ND 3.0 license Open Access Published by De Gruyter Open Access May 29, 2012

Instance-based regression with missing data applied to a photocatalytic oxidation process

Florin Leon, Ciprian Piuleac, Silvia Curteanu and Ioannis Poulios
From the journal Open Chemistry


In this paper, a modified nearest-neighbor regression method (kNN) is proposed to model a process with incomplete information of the measurements. This technique is based on the variation of the coefficients used to weight the distances of the instances. The case study selected for testing this algorithm was the photocatalytic degradation of Reactive Red 184 (RR184), a dye belonging to the group of azo compounds, which is widely used in manufacturing paint paper, leather and fabrics. The process is conducted with TiO2 as catalyst (an inexpensive semiconductor material, completely inert chemically and biologically), in the presence of H2O2 (with the role of increasing the rate of photo-oxidation), at different pH values. The final concentration of RR184 is predicted accurately with the modified kNN regression method developed in this article. A comparison with other machine learning methods (sequential minimal optimization regression, decision table, reduced error pruning tree, M5 pruned model tree) proves the superiority and efficiency of the proposed algorithm, not only for its results, but for its simplicity and flexibility in manipulating incomplete experimental data.

[1] P. Anjali, S. Poonam, I. Leela, Int. Biodeter. Biodegr. 59, 73 (2007) in Google Scholar

[2] K.H. Gregor, In: W. Wesley Eckenfelder and A. Bowers (Eds.), Chemical Oxidation (J. Roth, Lancaster, Pensilvania, USA, 1994) Vol. 1-6 10.2166/wst.1994.0370Search in Google Scholar

[3] D. Bahnemann, J. Cunningham, M.A. Fox, E. Pelizzetti, P. Pichat, N. Serpone, In: G. Helz, R. Zepp, D. Crosby (Eds.), Photocatalytic treatment of waters, in Aquatic and Surface Photochemistry (Lewis Publs., Boca Raton, FL, 1994) 261 10.1201/9781351069847-23Search in Google Scholar

[4] M.R. Hoffman, S. Martin, W. Choi, D.W. Bahnemann, Chem. Rev. 95, 69 (1995) in Google Scholar

[5] D.Y. Goswami, In: K.W. Boer (Ed.), Engineering of the Solar Photocatalytic Detoxification and Disinfection Processes, in Advances in Solar Energy (American Solar Energy Society Inc., Boulder, Colorado, 1995) Vol. 10, 165 Search in Google Scholar

[6] S. Malato, J. Blanco, C. Richter, M. Maldonado, Appl. Catal. B: Environ. 37, 1 (2002) in Google Scholar

[7] C.G. Piuleac, I. Poulios, S. Curteanu, Env. Eng. Manag. J. 8, 439 (2009) 10.30638/eemj.2009.059Search in Google Scholar

[8] I. Poulios, A. Papathanasiou, E. Ntarakas, H. Xatziefangelou, E. Papachristou, In: Seventh National Conference on Renewable Energy Sources, 6–8 November 2002, Patras, Greece, (Patras 2002) Search in Google Scholar

[9] R.D. Cook, S. Weisberg, Sociol. Methodol. 13, 313 (1982) in Google Scholar

[10] Q. Li, J. S. Racine, Nonparametric Econometrics: Theory and Practice (Princeton University Press, Princeton, New Jersey, USA, 2006) Search in Google Scholar

[11] A. Navot, L. Shpigelman, N. Tishby, E. Vaadia, Advances in Neural Information Processing Systems 18, 995 (2006) Search in Google Scholar

[12] D. Shepard, In: 23rd ACM National Conference, 27–29 Aug. 1968, New York, USA (Brandon Systems Press, Princeton, New Jersey, USA, 1968) 524 Search in Google Scholar

[13] D. Ruprecht, H. Müller, In: 5th Eurographics Workshop on Visualization in Scientific Computing, 30 May–1 Jun. 1994, Rostock, Germany (Eurographics Association and Blackwell Publishers, Oxford, UK 1994) 517 Search in Google Scholar

[14] G. Wolberg, Digital Image Warping (IEEE Computer Society Press, Los Alamitos, Californica, USA, 1990) Search in Google Scholar

[15] P. Deheuvels, RSA 25, 5 (1977) Search in Google Scholar

[16] M.P. Wand, W.R. Schucany, Can. J. Stat. 18, 197 (1990) in Google Scholar

[17] D.W. Aha, R.L. Goldstone, In: Proceedings of the 14th Annual Conference of the Cognitive Science Society, 29 July–1 August (Indiana University, Bloomington, USA, 1992) 534 Search in Google Scholar

[18] G.C. Atkeson, A.W. Moore, S. Schaal, J. Artif. Intell. Rev. 11, (1997) 10.1007/978-94-017-2053-3_2Search in Google Scholar

[19] T.M. Cover, P.E. Hart, Nearest neighbor pattern classification, IEEE Transactions on Information Theory 13(1), 21 (1967) in Google Scholar

[20] J.L. Bentley, Comm. ACM 18, 509 (1975) in Google Scholar

[21] V.N. Vapnik, The Nature of Statistical Learning Theory (Springer-Verlag, Berlin, Germany 1995) 10.1007/978-1-4757-2440-0Search in Google Scholar

[22] J.C. Platt, Sequential Minimal Optimization: A Fast Algorithm for Training Support Vector Machines, Advances in Kernel Methods — Support Vector Learning, Technical Report MSR-TR-98-14, Microsoft Research (Microsoft Press, Redmond, Washington, USA, 1998) Search in Google Scholar

[23] B. Schölkopf, A.J. Smola, Learning with Kernels (MIT Press, 2002) Search in Google Scholar

[24] S.K. Shevade, S.S. Keerthi, C. Bhattacharyya, K.R.K. Murthy, IEEE Transactions on Neural Networks 11, 1188 (2000) in Google Scholar

[25] R. Kohavi, In: 8th European Conference on Machine Learning, 25–27 Apr. 1995, Heraclion, Crete, Greece (Springer, Berlin-Heidelberg-New York 1995) 174 Search in Google Scholar

[26] R.J. Quinlan, Mach. Learn. 1, 81 (1986) 10.1007/BF00116251Search in Google Scholar

[27] R.J. Quinlan, In: 5th Australian Joint Conference on Artificial Intelligence, 16–18 Nov. 1992, Hobart, Tasmania, Australia (World Scientific, Singapore 1992) 343 Search in Google Scholar

[28] G.D. Suditu, M. Secula, C.G. Piuleac, S. Curteanu, I. Poulios, Rev. Chim.-Bucharest 59, 816 (2008) 10.37358/RC.08.7.1901Search in Google Scholar

[29] C.G. Piuleac, I. Poulios, F. Leon, S. Curteanu, A. Kouras, Sep. Sci. Technol. 45, 1644 (2010) in Google Scholar

[30] F.A. Caliman, S. Curteanu, C. Betianu, M. Gavrilescu, I. Poulios, J. Adv. Oxid. Technol. 11, 316 (2008) 10.1515/jaots-2008-0217Search in Google Scholar

[31] F. Leon, S. Curteanu, C. Lisa, N. Hurduc, Mol. Cryst. Liq. Cryst. 469, 1 (2007) in Google Scholar

[32] C. Lisa, S. Curteanu, V. Bulacovschi, D. Apreutesei, Rev. Roum. Chim. 53(4), 283 (2008) Search in Google Scholar

[33] C. Lisa, S. Curteanu, Comp. Aided Chem. Eng. 24, 39 (2007) in Google Scholar

[34] M. Hall, E. Frank, G. Holmes, B. Pfahringer, P. Reutemann, I.H. Witten, SIGKDD Explorations 11 (2009) 10.1145/1656274.1656278Search in Google Scholar

Published Online: 2012-5-29
Published in Print: 2012-8-1

© 2012 Versita Warsaw

This work is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 3.0 License.

Scroll Up Arrow