Data Mining using Inductive Modeling Approach. Godfrey C. Onwubolu. IWIM, Prague, 2007.

 Article (in pdf)

Abstract. The rate at which organizations are acquiring data is getting out of proportion and managing such data so as to infer useful knowledge that can be put to use is increasingly becoming important and challenging. Data Mining (DM) is one such relatively recently technology that has emerged that is employed in inferring useful knowledge that can be put to used from a vast amount of data. This paper proposes a new design methodology which is a hybrid of differential evolution (DE) and Group Method of Data Handling (GMDH) for self-organizing data mining. The new hybrid implementation is applied to the data mining activity of prediction of soil moisture, which is an aspect of hydrology. Experimental results indicate that the proposed approach is useful for data mining technique for forecasting hydrological data.

Keywords. Inductive modeling; Self-Organizing Data Mining, DE, GMDH

References.

  1. Codd, E. F. 1993. Providing OLAP (On-Line Analytical Processing) to User-Analysts: An IT Mandate. E. F. Codd and Associates.
  2. Fayyad, U. M.; Djorgovski, S. G.; and Weir, N. 1996. From Digitized Images to On-Line Catalogs: Data Mining a Sky Survey. AI Magazine 17(2): 51–66.
  3. Fayyad, U. M.; Haussler, D.; and Stolorz, Z. 1996. KDD for Science Data Analysis: Issues and Examples. In Proceedings of the Second International Conference on Knowledge Discovery and Data Mining (KDD-96), 50–56. Menlo Park, Calif.: American Association for Artificial Intelligence.
  4. Fayyad, U. M.; Piatetsky-Shapiro, G.; and Smyth, P. 1996. From Data Mining to Knowledge Discovery: An Overview. In Advances in Knowledge Discovery and Data Mining, eds. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, 1–30. Menlo Park, Calif.: AAAI Press.
  5. Fayyad, U. M.; Piatetsky-Shapiro, G.; Smyth, P.; and Uthurusamy, R. 1996. Advances in Knowledge Discovery and Data Mining. Menlo Park, Calif.: AAAI Press.
  6. Brachman, R., and Anand, T. 1996. The Process of Knowledge Discovery in Databases: Human-Centered Approach. In Advances in Knowledge Discovery and Data Mining, 37–58, eds. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy. Menlo Park, Calif.: AAAI Press.
  7. Elder, J., and Pregibon, D. 1996. A Statistical Perspective on KDD. In Advances in Knowledge Discovery and Data Mining, eds. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, 83–116. Menlo Park, Calif.: AAAI Press.
  8. Quinlan, J. 1992. C4.5: Programs for Machine Learning. San Francisco, Calif.: Morgan Kaufmann.
  9. Breiman, L.; Friedman, J. H.; Olshen, R. A.; and Stone, C. J. 1984. Classification and Regression Trees.Belmont, Calif.: Wadsworth.
  10. Friedman, J. H. 1989. Multivariate Adaptive Regression Splines. Annals of Statistics
  11. Dasarathy, B. V. 1991. Nearest Neighbor (NN) Norms: NN Pattern Classification Techniques. Washington, D.C.: IEEE Computer Society.
  12. Heckerman, D. 1996. Bayesian Networks for Knowledge Discovery. In Advances in Knowledge Discovery and Data Mining, eds. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, 273–306. Menlo Park, Calif.: AAAI Press.
  13. Whittaker, J. 1990. Graphical Models in Applied Multivariate Statistics. New York: Wiley.
  14. Pearl, J. 1988. Probabilistic Reasoning in Intelligent Systems. San Francisco, Calif.: Morgan Kaufmann.
  15. Dzeroski, S. 1996. Inductive Logic Programming for Knowledge Discovery in Databases. In Advances in Knowledge Discovery and Data Mining, eds. U. Fayyad, G. Piatetsky-Shapiro, P. Smyth, and R. Uthurusamy, 59–82. Menlo Park, Calif.: AAAI Press. 19:1–141.
  16. Ivakhnenko, A. G.; 1971. Polynomial Theory of Complex Systems, IEEE Transactions on Systems, Man, and Cybernetics, pp. 364-378.
  17. Madala, H. R. Ivakhnenko, A. G.; 1994. Inductive Learning Algorithms for Complex Systems Modeling, CRC Press, Boca Raton.
  18. Mueller, J-A, and Lemke, F.; 1999. Self-organizing Data Mining: An Integrated Approach to Extract Knowledge From Data, Dresden, Berlin.
  19. Oh, S. K., Pedrycz, W.; 2002. The design of self-organizing polynomial neural networks, Inf. Sci. 141 237-258.
  20. Elder IV, J. F., and Brown, D. E.: Inductive and Polynomial Networks, Technical Report: IPC-TR-92-009, Institute for Parallel Computation and Department of Systems Engineering, University of Virginia, Charlottesville, VA, USA, 1992
  21. Press, W H, Teukolsky, S A, Vetterling, W. T., Flannery, B. P.; 1992. “Numerical recipes in C: The art of scientific computing”. Cambridge University Press.
  22. Nariman-Zadeh, N., Darvizeh, A., and Ahmad-Zadeh, G. R., Hybrid genetic design of GMDH-type neural networks using singular value decomposition for modeling and predicting of the explosive cutting process, Nariman-Zadeh, Proc. Instn Mech. Engrs Vol 217 Part B: Nariman-Zadeh, 779-790.
  23. Davendra, D., Hybrid Differential Evolution and Scatter Search, MSc Thesis, The University of the South Pacific, 2003.
  24. Onwubolu, G. C., and Davendra, D., Scheduling flow shops using differential evolution algorithm, European Journal of Operational Research, 171, 674-692, 2006.
  25. Davendra, D., and Onwubolu, G. C., Scheduling flow shops using enhanced differential evolution algorithm, European Conference on Modeling and Simulation (ECMS), Prague, Czech, 2007.
  26. Onwubolu, G. C., Design of Hybrid Differential Evolution and Group Method of Data Handling for Inductive Modeling, International Workshop on Inductive Modeling, Prague, Czech, September 23-26, 2007
  27. S.-K. Oh, W. Pedrycz, and B.-J. Park, “Polynomial neural networks architecture: analysis and design”, Computers and Electrical Engineering, vol. 29, no. 6, pp. 703-725, 2003.
  28. S.-K. Oh, Park, B.-J., and Kim, H.-K, Genetically optimized hybrid fuzzy neural networks based on linear fuzzy inference rules, International Journal of Control, Automation, and Systems, 3(2), pp. 183-194, 2005.
Last modified by anonymous on 11/05/07 06:08:10 (4 years ago)