Hierarchical Clustering in Temporal Databases
International Journal of Recent Engineering Science (IJRES) | |
|
© 2014 by IJRES Journal | ||
Volume-1 Issue-1 |
||
Year of Publication : 2014 | ||
Authors : Dr.G.Sreesharma |
||
DOI : 10.14445/23497157/IJRES-V1I1P104 |
How to Cite?
Dr.G.Sreesharma, "Hierarchical Clustering in Temporal Databases," International Journal of Recent Engineering Science, vol. 1, no. 1, pp. 17-23, 2014. Crossref, https://doi.org/10.14445/23497157/IJRES-V1I1P104
Abstract
This project report discusses about the design and implementation of an incremental system for clustering the data stream that are present in time series databases. The Online Divisive-Agglomerative Clustering system implemented in this project work continuously maintains a tree-like hierarchy of clusters that evolves with data, using a top-down strategy. Using correlation-based dissimilarity measure, each node is split by the farthest pair of streams. This system uses a merge operator which re-aggregates a previously split node, in order to react to changes in the correlation structure between time series. The split and merge operators act in response to changes in the diameters of existing clusters. Expanding the structure in this way leads to a decrease in the diameters of the clusters. This system has been designed to process thousands of data streams that flow at high rate. The main advantage of this system is the reduction in update time and memory usage. This system has been implemented using Java language in Windows Platform.
Keywords
Hierarchical Clustering, Database and System
Reference
[1] P. Rodrigues P, J. Gama, and J. P. Pedroso, “Hierarchical Clustering Of Time Series Data Streams”, in Proceedings of the Sixth International Conference on Knowledge Discovery and Data Mining. , Vol 20, No 5, pp. 615 -627.
[2] P. Rodrigues P, J. Gama, and J. P. Pedroso, “ODAC: Hierarchical Clustering Of Time Series Data Streams,” in Proceedings of the Sixth SIAM International Conference on Data Mining, April 2006, pp. 499 -503.
[3] Ying Zhao and George Karypis, ”Hierarchical Clustering Algorithms For Document Dataset” University of Minnesota, Technical Report pp 1 -22, 2003.
[4] A.Kannan, C.J.Date and S.Swamynathan, “Introduction to Database Systems”, Pearson Education Asia Ltd, 2006
[5] Jessica Lin, Michail Vlachos, Eamonn Keogh and Dimitrios Gunopulos, “Iterative Incremental Clustering of Time Series,” IEEE Transactions on Knowledge and Data Engineering, Vol. 15, No. 3, pp. 515 -528, 2003.
[6] Bruce Walter, Kavitha Bala, Milind Kulkarni, Keshav Pingali, “Fast Agglomerative Clustering for Rendering,” Journal of Universal Computer Science,Vol.11,No.8, pp. 1426-1439, 2007
[7] Sergio M. Savaresi, Daniel L.Boley, Sergio Bittanti and Giovanna Gazzaniga ,” Cluster Selection in Divisive Clustering algorithms” University of Minnesota, Technical Report pp 24 -51, 2005.
[8] Martin Ester, Hans-Peter Kreigel, Jorg Sander, Michael Wimmer, Xiaowei Xu,” Incremental Clustering for Mining in a Data Warehousing Environment” Proceedings of the 24th International Conference on Very Large Data Bases, pp 323 -333, 1998.