Forming Dataset of The Undergraduate Thesis using Simple Clustering Methods

DHARMAWAN, Tio; CANDRAMAYA, Chinta ’Aliyyah; WIDHARTA, Vandha Pradwiyasma

dc.contributor.author	DHARMAWAN, Tio
dc.contributor.author	CANDRAMAYA, Chinta ’Aliyyah
dc.contributor.author	WIDHARTA, Vandha Pradwiyasma
dc.date.accessioned	2023-03-07T07:03:20Z
dc.date.available	2023-03-07T07:03:20Z
dc.date.issued	2023-01-01
dc.identifier.uri	https://repository.unej.ac.id/xmlui/handle/123456789/112593
dc.description.abstract	Each university collects many undergraduate theses data but has yet to process it to make it easier for students to find references as desired. This study aims to classify and compare the grouping of documents using expert and simple clustering methods. Experts have done ground truth using OR Boolean Retrieval and keyword generation. The best cluster was discovered by the experiments using the K-Means, K-Medoids, and DBSCAN clustering methods and using Euclidean, Manhattan, City Block, and Cosine Similarity metrics. The cluster with the best Silhouette Score compared to the accuracy of the categorization of each document. The K-Means clustering method and the Cosine Similarity metric gave the best results with a Silhouette Score value of 0.105534. The comparison between ground truth and the best cluster results shows an accuracy of 33.42%. The result shows that the simple clustering method cannot handle data with Negative Skewness and Leptokurtic Kurtosis.	en_US
dc.language.iso	en	en_US
dc.publisher	INTERNATIONAL JOURNAL OF INNOVATION IN ENTERPRISE SYSTEM	en_US
dc.subject	Document Clustering	en_US
dc.subject	Text Mining	en_US
dc.subject	Relevant Term	en_US
dc.subject	Information Retrieval	en_US
dc.subject	Topic Identification	en_US
dc.title	Forming Dataset of The Undergraduate Thesis using Simple Clustering Methods	en_US
dc.type	Article	en_US

Files in this item

Name:: FASILKOM_Forming Dataset of The ...
Size:: 1.141Mb
Format:: PDF

View/Open

This item appears in the following Collection(s)

LSP-Jurnal Ilmiah Dosen [7415]
Koleksi Jurnal Ilmiah Dosen

Show simple item record