Penerapan Hadoop untuk Analisis Sentimen Berbasis Big Data pada Ulasan Aplikasi Transportasi Online

Authors

  • Putri Angraini Aziz Universitas Halu Oleo
  • Syaban Barokah Nur Ilahi Universitas Halu Oleo
  • Sumiarni Moka Universitas Halu Oleo
  • Adha Mashur Sajiah Universitas Halu Oleo

DOI:

https://doi.org/10.54259/satesi.v5i1.4051

Keywords:

Hadoop, Apache Spark, Analisis Sentimen, Logistic Regression

Abstract

The rapid growth of application-based transportation services in Indonesia has generated a large volume of user reviews that contain essential information for service development. However, significant challenges arise in processing and analyzing data on a large scale. This study utilizes Hadoop and Apache Spark technology to conduct sentiment analysis on online transportation application reviews, focusing on Gojek user reviews. The dataset comprises 1.880.112  reviews obtained from Kaggle and Google Play Store. The research method includes data preprocessing using distributed computing with Hadoop and Spark, followed by sentiment labeling based on user ratings. The sentiment analysis model used is Logistic Regression, with hyperparameter tuning through Cross Validation. The evaluation results show a model accuracy of 80%, demonstrating the model's capability in effectively classifying sentiments, supported by PySpark implementation which enables efficient training and evaluation processes despite working with large-scale datasets. Text visualization in the form of a word cloud reveals that negative sentiment is often associated with app performance and digital wallet issues, while neutral sentiment focuses more on driver services. On the other hand, positive sentiment highlights user satisfaction with the overall service. The findings of this study demonstrate the effectiveness of Hadoop in large-scale sentiment analysis processing and provide valuable insights for improving online transportation services.

Downloads

Download data is not yet available.

References

G. A. Pratiwi, R. M. Almakhsum, R. D. Setiyawati, A. P. Farahdila, and A. Zaki, “Kontestasi Start-up Ojek Online di Indonesia: Strategi Promosi Digital Gojek, Grab, Indriver, dan Maxim,” OIKONOMIKA : Jurnal Kajian Ekonomi dan Keuangan Syariah, vol. 5, no. 1, pp. 64–79, Aug. 2024, doi: 10.53491/oikonomika.v5i1.955.

R. Wahyudi et al., “Analisis Sentimen pada review Aplikasi Grab di Google Play Store Menggunakan Support Vector Machine,” JURNAL INFORMATIKA, vol. 8, no. 2, pp. 200–207, Sep. 2021, doi: https://doi.org/10.31294/ji.v8i2.9681.

M. Syaharani, “Moda Transportasi Online Menjadi Pilihan Masyarakat, Apa Alasannya?,” GoodStats. [Online]. Available: https://goodstats.id/article/moda-transportasi-online-semakin-menjadi-favorit-masyarakat-apa-alasannya-BFsC9

V. K. S. Que, A. Iriani, and H. D. Purnomo, “Analisis Sentimen Transportasi Online Menggunakan Support Vector Machine Berbasis Particle Swarm Optimization,” Jurnal Nasional Teknik Elektro dan Teknologi Informasi |, vol. 9, no. 2, pp. 162–170, May 2020, doi: https://doi.org/10.22146/jnteti.v9i2.102.

N. Yurindera, “Dampak Kepercayaan Terhadap Loyalitas Pelanggan Melalui Kepuasan pada Layanan Transportasi Ojek Online,” Jurnal Esensi Infokom, vol. 8, no. 2, pp. 41–47, Oct. 2024, doi: https://doi.org/10.55886/infokom.v8i2.931.

R. A. Fauzi, I. Cholissodin, and B. Rahayudi, “Pemanfaatan Spark untuk Analisis Sentimen Mengenai Netralitas Berita dalam Membahas Pemilu Presiden 2019 Menggunakan Metode Naïve Bayes Classifier,” Jurnal Pengembangan Teknologi Informasi dan Ilmu Komputer, vol. 5, no. 3, pp. 1070–1077, 2021, [Online]. Available: http://j-ptiik.ub.ac.id

M. N. Fadillah and D. Bernadisman, “Peranan Ojek Online dalam Meningkatkan Omzet UMKM dan Pertumbuhan Ekonomi Digital Indonesia,” JUARA: Jurnal Pengabdian Kepada Masyarakat, vol. 1, no. 1, pp. 32–35, 2023, Accessed: Jan. 15, 2025. [Online]. Available: https://jurnas.saintekmu.ac.id/index.php/juara/article/view/57

N. A. Amalia, I. T. Utami, and Y. Wilandari, “ANALISIS SENTIMEN KEBIJAKAN PENYELENGGARA SISTEM ELEKTRONIK LINGKUP PRIVAT MENGGUNAKAN PENALIZED LOGISTIC REGRESSION DAN SUPPORT VECTOR MACHINE,” Jurnal Gaussian, vol. 12, no. 4, pp. 560–569, Jul. 2024, doi: 10.14710/j.gauss.12.4.560-569.

I. Rahmawati and T. R. Fitriani, “Analisis Sentimen Menggunakan Algoritma Logistic Regression Pada Penerbangan Lion Air berdasarkan Ulasan Pengguna Platform Online,” Jejaring Penelitian dan Pengabdian Masyarakat (JPPM), vol. 1, no. 1, pp. 11–16, Aug. 2023, doi: https://doi.org/10.58776/jriti.v1i1.60.

K. D. Indarwati and H. Februariyanti, “ANALISIS SENTIMEN TERHADAP KUALITAS PELAYANAN APLIKASI GO-JEK MENGGUNAKAN METODE NAIVE BAYES CLASSIFIER,” JATISI (Jurnal Teknik Informatika Dan Sistem Informasi), vol. 10, no. 10, Mar. 2023, doi: https://doi.org/10.35957/jatisi.v10i1.2643.

Junaedi, A. Hendra Gunawan, V. Kuswanto, and Jonathan, “Eksplorasi Algoritma Support Vector Machine untuk Analisis Sentimen Destinasi Wisata di Indonesia,” Bit-Tech (Binary Digital - Technology), vol. 7, no. 2, pp. 323–330, Dec. 2024, doi: 10.32877/bt.v7i2.1810.

A. Gugun, B. R. Sanjaya, F. Rahmadani, and J. C. Key, “LITERATURE REVIEW: Penggunaan Support Vector Machine (SVM) untuk Klasifikasi Penyakit Lambung,” Buletin Ilmiah Ilmu Komputer Dan Multimedia (BIIKMA), vol. 2, no. 3, pp. 546–549, Oct. 2024, doi: 10.32493/jtsi.v7i3.42254.

I. Permatahati, M. N. Perdana, N. Apriadi, T. P. Amanda, and Z. Maharani, “Analisis Dataset TOP 1000 IMDb Movies Menggunakan Hadoop,” Journal of Network and Computer, vol. 2, no. 2, pp. 23–36, 2023, [Online]. Available: https://jurnal.netplg.com/

L. Arrahmando Romadhona, R. Febrianti, A. Winata, C. Putri Amanda, and R. Julia Erizka, “Hadoop-MapReduce Pada YARN Framework,” Journal of Network and Computer, vol. 1, no. 2, pp. 91–101, 2022, [Online]. Available: https://jurnal.netplg.com/jnca

S. Aini Pohan and F. Hasyifah Sibarani, “ANALISIS SENTIMEN TERHADAP APLIKASI MAXIM MENGGUNAKAN ALGORITMA RANDOM FOREST,” Journal of Science and Social Research, no. 3, pp. 1201–1208, 2024, [Online]. Available: http://jurnal.goretanpena.com/index.php/JSSR

B. Irawan and O. Nurdiawan, “ANALISIS SENTIMEN TERHADAP PENGGUNA GOJEK DAN GRAB PADA MEDIA SOSIAL TWITTER MENGGUNAKAN RANDOM FOREST,” Jurnal Mahasiswa Teknik Informatika, vol. 7, no. 5, pp. 3614–3618, 2023.

S. Khairunnisa, A. Adiwijaya, and S. Al Faraby, “Pengaruh Text Preprocessing terhadap Analisis Sentimen Komentar Masyarakat pada Media Sosial Twitter (Studi Kasus Pandemi COVID-19),” JURNAL MEDIA INFORMATIKA BUDIDARMA, vol. 5, no. 2, pp. 406–414, Apr. 2021, doi: 10.30865/mib.v5i2.2835.

D. A. Hamidah, R. Salkiawati, and R. Sari, “Analisis Sentimen Ulasan Customer Kopi TMLST Menggunakan Algoritma Naïve Bayes,” Journal of Students‘ Research in Computer Science, vol. 5, no. 1, pp. 27–40, May 2024, doi: 10.31599/mrm89y71.

W. Wijiyanto, A. I. Pradana, S. Sopingi, and V. Atina, “Teknik K-Fold Cross Validation untuk Mengevaluasi Kinerja Mahasiswa,” Jurnal Algoritma, vol. 21, no. 1, pp. 239–248, May 2024, doi: 10.33364/algoritma/v.21-1.1618.

A. N. Hasanah and B. N. Sari, “ANALISIS SENTIMEN ULASAN PENGGUNA APLIKASI JASA OJEK ONLINE MAXIM PADA GOOGLE PLAY DENGAN METODE NAÏVE BAYES CLASSIFIER,” Jurnal Informatika dan Teknik Elektro Terapan, vol. 12, no. 1, pp. 90–96, Jan. 2024, doi: 10.23960/jitet.v12i1.3628.

J. A. Wibowo, V. C. Mawardi, and T. Sutrisno, “VISUALISASI WORD CLOUD HASIL ANALISIS SENTIMEN BERBASIS FITUR LAYANAN APLIKASI GOJEK DENGAN SUPPORT VECTOR MACHINE,” Jurnal Serina Sains, Teknik dan Kedokteran, vol. 2, no. 1, pp. 61–70, Mar. 2024, doi: 10.24912/jsstk.v2i1.32058.

Downloads

Published

2025-04-20

How to Cite

Putri Angraini Aziz, Nur Ilahi, S. B., Sumiarni Moka, & Sajiah, A. M. (2025). Penerapan Hadoop untuk Analisis Sentimen Berbasis Big Data pada Ulasan Aplikasi Transportasi Online. SATESI: Jurnal Sains Teknologi Dan Sistem Informasi, 5(1), 51–60. https://doi.org/10.54259/satesi.v5i1.4051

Issue

Section

Articles