Show simple item record

dc.contributor.authorADRIENUFA, Kharisma Trinanda
dc.date.accessioned2025-08-04T07:13:17Z
dc.date.available2025-08-04T07:13:17Z
dc.date.issued2023-07-26
dc.identifier.nim192410103019en_US
dc.identifier.urihttps://repository.unej.ac.id/xmlui/handle/123456789/127707
dc.descriptionvalidasi_repo_ratna_Agustus 2025; Finalisasi oleh Taufik_Alya Tgl 4 Agustus 2025en_US
dc.description.abstractDocument clustering can’t avoid the problem of high dimensionality, which can be overcome by combining the advantage of statistical and semantic features. This study aims to determine the performance of clustering with the Stamantic (statistical and semantic) feature extraction technique compared to the several Bag Of Words Model (Bag Of All Word, Bag Of Noun, Bag Of Noun and Adjective) as well as a comparison between Spherical K-Means and K-Means++ clustering algorithm. Stamantic feature extraction use the Wordnet (Wn) database to form semantic features, while statistical features are obtained from TF-IDF (Term Frequency Inverse Document Frequency) word sense. Evaluation were carried out on clustering results with several metrics. The highest Silhouette score is 0.162213 on the BONA feature from Pubmed dataset which clustered with K-Means++ algorithm. The highest Purity score around 0.949643 on the BONA feature from Scopus dataset with Spherical K-Means algorithm. The highest AMI (Adjusted Mutual Information) score is 0.880835 on the BONA feature from Scopus dataset with Spherical K-Means clustering algorithm. The test results show that the Stamantic feature loses to all BOW features. Due to the loss data information from the effect of using Wn library and disambiguation process which inappropriate.en_US
dc.language.isootheren_US
dc.publisherFakultas Ilmu Komputeren_US
dc.subjectDOCUMENT CLUSTERINGen_US
dc.subjectLEXICAL CHAINen_US
dc.titlePeningkatan Performa Clustering Pada Large Text Dataset Menggunakan Stamantic Spherical K-Meansen_US
dc.title.alternativeClustering Enhancement Of Large Text Dataset Using Stamantic Spherical K-Meansen_US
dc.typeSkripsien_US
dc.identifier.prodiInformatikaen_US
dc.identifier.pembimbing1Achmad Maududie ST, M.Sc.en_US
dc.identifier.pembimbing2Tio Dharmawan, S.Kom., M.Komen_US
dc.identifier.validatorvalidasi_repo_ratna_Agustus 2025en_US
dc.identifier.finalizationTaufiken_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record