The Ensemble of Unsupervised Incremental Learning Algorithm for Time Series Data

Document Type : Original Article

Authors

1 Aditya Engineering College, Surampalem, AP, India

2 CIIS, Swarnandhra College of Engineering. & Tech. (Autonomous), Narsapur, AP, India

Abstract

Data mining is one of the key concepts to discover hidden knowledge from available data. Along with the data mining, data analytics is a field to analyze and process data in a scientific and cognitive angle. It is more helpful to convert knowledge to actionable knowledge for accurate decision making. Data Stream Mining is another challenging area than normal Data Mining due to its dynamics. Dynamics of data in a stream includes changes in data frequency, volume and nature. This paper focuses on the behavior of data mining of machines in process/manufacturing industries. In general, such data is continuous numerical and time series data captured by various industrial sensors. By nature, equipment or machinery behaviour can change over time. It requires calibration/replacement before failure of machinery. By analyzing data, one can find the behavior or state change. To identify that, dynamic models are required to be built using data mining and data stream mining. Thus, we are following a semi-novel approach for building such models using “Ensemble of Unsupervised Incremental Learning" method. Results show how the existing methods are different from the proposed method. This method can be applied for any other domain like image/audio/video or text mining.

Keywords

Main Subjects


  1. Silver, D.L., Yang, Q. and Li, L., "Lifelong machine learning systems: Beyond learning algorithms", in 2013 AAAI spring symposium series. (2013).
  2. Silver, D.L., "Machine lifelong learning: Challenges and benefits for artificial general intelligence", in International conference on artificial general intelligence, Springer. (2011), 370-375.
  3. Luo, Y., Yin, L., Bai, W. and Mao, K.J.E., "An appraisal of incremental learning methods", Vol. 22, No. 11, (2020), 1190, doi: 10.3390/e22111190.
  4. Rico-Juan, J.R. and Iñesta, J.M.J.N., "Adaptive training set reduction for nearest neighbor classification", Vol. 138, (2014), 316-324, doi: 10.1016/j.neucom.2014.01.033.
  5. Morales, G.D.F. and Bifet, A.J.J.M.L.R., "Samoa: Scalable advanced massive online analysis", Vol. 16, No. 1, (2015), 149-153, doi.
  6. O'callaghan, L., Mishra, N., Meyerson, A., Guha, S. and Motwani, R., "Streaming-data algorithms for high-quality clustering", in Proceedings 18th International Conference on Data Engineering, IEEE. (2002), 685-694.
  7. Wankhade, K.K., Jondhale, K.C. and Dongre, S.S.J.A.S.C., "A clustering and ensemble based classifier for data stream classification", Vol. 102, (2021), 107076, doi: 10.1016/j.asoc.2020.107076.
  8. Dubey, A.K., Gupta, R. and Mishra, S., "Data stream clustering for big data sets: A comparative analysis", in IOP Conference Series: Materials Science and Engineering, IOP Publishing. Vol. 1099, No. 1, (2021), 012030.
  9. Aggarwal, C.C., Philip, S.Y., Han, J. and Wang, J., "A framework for clustering evolving data streams", in Proceedings 2003 VLDB conference, Elsevier. (2003), 81-92.
  10. Greenwald, M. and Khanna, S.J.A.S.R., "Space-efficient online computation of quantile summaries", Vol. 30, No. 2, (2001), 58-66, doi: 10.1145/1055558.1055598.
  11. Arasu, A. and Manku, G.S., "Approximate counts and quantiles over sliding windows", in Proceedings of the twenty-third ACM SIGMOD-SIGACT-SIGART symposium on Principles of database systems. (2004), 286-296.
  12. Hautamaki, V., Nykanen, P. and Franti, P., "Time-series clustering by approximate prototypes", in 2008 19th International conference on pattern recognition, IEEE. (2008), 1-4.
  13. Lian, X., Chen, L.J.I.T.o.K. and Engineering, D., "Efficient similarity search over future stream time series", Vol. 20, No. 1, (2007), 40-54, doi: 10.1109/TKDE.2007.190666.
  14. Parthasarathy, D., Shah, D. and Zaman, T.J.a.p.a., "Leaders, followers, and community detection", (2010), arXiv preprint arXiv:1011.0774.
  15. Davies, D.L., Bouldin, D.W.J.I.t.o.p.a. and intelligence, m., "A cluster separation measure", No. 2, (1979), 224-227, doi: 10.1109/T PAMI.1979.4766909.