Notice of Retraction Dynamic clustering of data with modified K-prototype algorithm

Anupama Chadha

Abstract


Notice of Retraction

-----------------------------------------------------------------------
After careful and considered review of the content of this paper by a duly constituted expert committee, this paper has been found to be in violation of APTIKOM's Publication Principles.

We hereby retract the content of this paper. Reasonable effort should be made to remove all past references to this paper.

The presenting author of this paper has the option to appeal this decision by contacting ij.aptikom@gmail.com.

-----------------------------------------------------------------------

Clustering of mixed numerical and categorical data has become a challenge now a days. A number of algorithms dealing with mixed data have been proposed. Speed and simplicity are the two major features that have made the K-Prototype algorithm a famous partition based clustering algorithm. This algorithm has a constraint of providing the value of K initially and sometimes predicting the optimum number of clusters in advance becomes practically impossible. In this paper, a new algorithm based on the K-Prototype algorithm for clustering mixed data with advanced features for automatic generation of appropriate number of clusters is presented.


References


Cheung, Y, Jia, H. Categorical-and-Numerical-Attribute Data clustering based on a Unified Similarity Metric without Knowing Cluster Number. Pattern Recognition. 2013; 46: 2228-2238.

Liang, Jiye, Zhao, Xingwang, Li, Deyu, Cao, Fuyuan, Dang, Chuangyin. Determining the Number of Clusters using Information Entropy for Mixed Data. Pattern Recognition. 2012; 45(6): 2251-2265.

Chadha, Anupama, Kumar, Suresh. Extension to the K-Means Algorithm for Automatic Generation of Clusters for Mixed Datasets. International Conference on Advances on Computers, Communication and Electronic Engineering, 2015: 480-485.

Ahmad, A, Dey, L. A K-Mean Clustering Algorithm for Mixed Numeric and Categorical Data. Data & Knowledge Engineering. 2007; 63: 503-527.

RapidMiner.com, available from: . [03 October 2016].

Keel Data Repository, available from: . [01 September 2016].

Jacob, Gracia, Shomona, Ramani, Geetha, R. Evolving Efficient clustering and Classification Patterns in Lymphography Data through Data Mining Techniques. International Journal on Soft Computing. 2012; 3(3): 119-132.

He, Zengyou, Xu, Xiaofei, Deng, Shengchun. Attribute value weighting in k-modes clustering, Expert Systems with Applications. 2011; 38(12): 15365-15369.

Ienco, Dino, Pensa, G, Riggero, Meo, Rosa. From Context to Distance: Learning Dissimilarity for Categorical Data Clustering. ACM Transactions on Knowledge Discovery from Data. 2012; 6(1): 1-25.

Asok, Aswathy, Jisha, J, T, Ashok, Sreeja, Judy, V, M. Integrated Framework Using Frequent Pattern for Clustering Numeric and Nominal Data Sets. Proceedings of International Conference on ICT for Sustainable Development. 2016; 408: 523-530.




DOI: https://doi.org/10.11591/APTIKOM.J.CSIT.85

Refbacks

  • There are currently no refbacks.


Copyright (c) 2019 APTIKOM Journal on Computer Science and Information Technologies



ISSN: 2722-323X, e-ISSN: 2722-3221

CSIT Stats

 

Creative Commons License
This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.