Classification of Public Complaints Basedon Text Mining Using Modified K-Nearest Neighbor, Naïve Bayes and C4.5 Algorithm

Samsul Bahri; Ema Utami; Asro Nasiri

doi:10.33050/ccit.v15i2.2286

PDF

Published Aug 4, 2022

DOI https://doi.org/10.33050/ccit.v15i2.2286

Samsul Bahri

AMIKOM University Yogyakarta

Ema Utami

AMIKOM University Yogyakarta

Asro Nasiri

AMIKOM University Yogyakarta

Abstract

To improve public services, accuracy and acceleration are needed in classifying the types of complaints so that complaints can immediately get a response from the relevant regional apparatus. This public complaint data is in text form and is not balanced in each category of regional apparatus, so we contribute to research to compare the performance of different text mining-based classification algorithms. In addition, we also tested the resampling method to overcome imbalanced data. In the final stage, testing is carried out using a multiclass confusion matrix table to show accuracy, precision, recall, and f1-score. The test results show the highest value in the Naïve Bayes algorithm with the ComplementNB model without resampling data, which is 89.58% accuracy, 86.72% precision, 82.40% recall, 84.09% f1-score. However, all scores decreased when combined with SMOTE resampling of 83.66% accuracy, 67.79% precision, 80.35% recall, 71.68% f1-score. ComplementNB can be an alternative model in the classification of public complaints with imbalanced datasets

Downloads

Download data is not yet available.

How to Cite

[1]

S. Bahri, E. Utami, and A. Nasiri, “Classification of Public Complaints Basedon Text Mining Using Modified K-Nearest Neighbor, Naïve Bayes and C4.5 Algorithm”, CCIT (Creative Communication and Innovative Technology) Journal, vol. 15, no. 2, pp. 198-207, Aug. 2022.

Issue

Vol 15 No 2 (2022): CCIT JOURNAL

Section

Articles

Article Sidebar

Main Article Content

Abstract

Downloads

Article Details