Classification of Public Complaints Basedon Text Mining Using Modified KNearest Neighbor Naïve Bayes and C45 Algorithm

Authors

  • Samsul Bahri AMIKOM University Yogyakarta
  • Ema Utami AMIKOM University Yogyakarta
  • Asro Nasiri AMIKOM University Yogyakarta

DOI:

https://doi.org/10.33050/ccit.v15i2.2286

Keywords:

Imbalanced Data, Multiclass, Resampling Method

Abstract

To improve public services, accuracy and acceleration are needed in classifying the types of complaints so that complaints can immediately get a response from the relevant regional apparatus. This public complaint data is in text form and is not balanced in each category of regional apparatus, so we contribute to research to compare the performance of different text mining-based classification algorithms. In addition, we also tested the resampling method to overcome imbalanced data. In the final stage, testing is carried out using a multiclass confusion matrix table to show accuracy, precision, recall, and f1-score. The test results show the highest value in the Naïve Bayes algorithm with the ComplementNB model without resampling data, which is 89.58% accuracy, 86.72% precision, 82.40% recall, 84.09% f1-score. However, all scores decreased when combined with SMOTE resampling of 83.66% accuracy, 67.79% precision, 80.35% recall, 71.68% f1-score. ComplementNB can be an alternative model in the classification of public complaints with imbalanced datasets

Downloads

Download data is not yet available.

Downloads

Published

2022-08-04

Most read articles by the same author(s)

1 2 3 4 5 6 7 8 9 10 > >>