Analisis Sentimen Berbasis Topik pada Ulasan Aplikasi Maxim Menggunakan Pendekatan Semisupervised Learning dengan Indobert, Lda, dan Generative Ai Laporan Skripsi

dc.contributor.authorValentino Hariyanto
dc.date.accessioned2026-06-23T07:20:07Z
dc.date.issued2026-06-12
dc.descriptionApproved by Teddy
dc.description.abstractMaxim is a ride-hailing service that has been operating in Indonesia since July 2018. The number of users continues to grow, which also increases the number of reviews on platforms like Google Play Store and X (Twitter). These reviews contain a lot of information about user perceptions of the service. However, the large volume makes manual analysis difficult and impractical to do. This study uses sentiment analysis to understand user perceptions of Maxim. The approach combines IndoBERT with semi-supervised learning for sentiment classification, Latent Dirichlet Allocation (LDA) for topic extraction, and Generative AI for topic interpretation. Data was collected from 2018 to 2025, totaling 252.321 reviews. After preprocessing, 209,665 reviews remained usable. An initial dataset of 2,500 reviews was manually labeled by three annotators. Inter-rater agreement was tested using Fleiss' Kappa and obtained a score of 0.6286, which falls in the substantial agreement category. This labeled dataset was used to train IndoBERT-base-p2 as the initial model. Iterative bootstrapping was then carried out for five iterations by adding pseudo-labels with a confidence score of at least 0.98. The best model, M3, reached 96% accuracy and a macro F1-score of 0.93. For topic modeling, LDA identified five main topics: Driver Attitude, Application Performance, Order Process, GPS/Map Accuracy, and Price and Fare. Topic labeling was done using Google Gemini through a structured prompt. Results show that Driver Attitude is mostly positive, while Application Performance, Order Process, and GPS/Map Accuracy are dominated by negative sentiment. Price and Fare has a relatively balanced distribution. Sentiment trends from 2018 to 2024 show gradual improvement. The findings suggest that technical issues in the application and the driver-matching process are the main areas that need improvement.
dc.description.sponsorshipAchmad Maududie, ST., M.Sc.
dc.identifier.urihttps://repository.unej.ac.id/handle/123456789/9895
dc.publisherFakultas Ilmu Komputer
dc.subjectAnalisis Sentimen
dc.subjectIndoBERT
dc.subjectLatent Dirichlet Allocation
dc.subjectGenerative AI
dc.subjectSemi-supervised Learning
dc.titleAnalisis Sentimen Berbasis Topik pada Ulasan Aplikasi Maxim Menggunakan Pendekatan Semisupervised Learning dengan Indobert, Lda, dan Generative Ai Laporan Skripsi
dc.typeThesis

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Skripsi_Valentino Hariyanto_RepoUNEJ.pdf
Size:
1.59 MB
Format:
Adobe Portable Document Format

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed to upon submission
Description: