Twitter Data Clustering on issues of Children with Special Needs using Hybrid Topic Models with Multi-viewpoints Similarity Metric

  • Noorullah R.M
  • Moulana Mohammed
Keywords: Issues of Children with Special Needs, Multi-viewpoints Similarity Metric, Hybrid Topic Models, Topic Cloud Visualization, Cluster Tendency

Abstract

Social networks are an excellent source for users to share or exchange information on topics. Twitter is the most prioritized social network concerning the issues of children with special needs related topics of social users. Extracting good quality of topics from twitter corpus depends on the quality of text pre-processing and in finding optimal cluster tendency. With traditional topic models, cluster tendency identification is difficult because they use less frequent words in tweets. In traditional topic models, k value (number of clusters) decided manually and used Euclidean distance metric in most methods and cosine distance metrics in some methods. Proper Visualization of cluster tendency is also essential as corpus consists of a large number of documents and billions of words. In this paper, hybrid topic models with multi-viewpoints based similarity metric proposed to Visualize topic clouds, to find cluster tendency of various topics related to issues of children with special needs twitter datasets. Experimental evaluation and comparison of these proposed hybrid models done with other distance metrics. Empirical analysis performed with convergence speed and computational complexities. Cluster validity of proposed models done with external validity indices to quantify the quality of cluster and with internal validity indices to evaluate clustering structure. Visual Non-Matrix Factorization (VIS NMF) under multi-viewpoints similarity metric performed well than other models with a more informative assessment.

Downloads

Download data is not yet available.

References

Hassan, A. E. H. (2015). Emotional and behavioral problems of children with learning disabilities. Journal of Educational Policy and Entrepreneurial Research (JEPER), 2(10), 66-74.
https://www.researchgate.net/publication/282733476.
Amelio, A., &Pizzuti, C. (2015, August). Is normalized mutual information a fair measure for comparing community detection methods?.In Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, 1584-1585.
Published
2020-05-15
How to Cite
R.M, N., & Mohammed, M. (2020). Twitter Data Clustering on issues of Children with Special Needs using Hybrid Topic Models with Multi-viewpoints Similarity Metric. International Journal of Early Childhood Special Education, 12(1), 159-184. https://doi.org/10.9756/INT-JECSE/V12I1.201003