skip to main content

LATENT DIRICHLET ALLOCATION DALAM IDENTIFIKASI RESPON MASYARAKAT INDONESIA TERHADAP PROFESI PEGAWAI NEGERI SIPIL

*Nurul Fajrin Aghentika  -  Departemen Statistika, Fakultas Sains dan Matematika, Universitas Diponegoro, Indonesia
Sugito Sugito  -  Departemen Statistika, Fakultas Sains dan Matematika, Universitas Diponegoro, Indonesia
Budi Warsito  -  Departemen Statistika, Fakultas Sains dan Matematika, Universitas Diponegoro, Indonesia
Open Access Copyright 2026 Jurnal Gaussian under http://creativecommons.org/licenses/by-nc-sa/4.0.

Citation Format:
Abstract

Data on social media comments can be extracted to produce hidden information that is useful as a guide for evaluation and decision making. YouTube has a comments feature as a forum for expressing opinions, experiences and questions. Civil Servants are known as one of the job choices of Indonesian people, the government announced that there were resignations of Civil Servant Candidates in 2022. Responses written in the comment column are difficult to understand, topic modeling can be applied as a text analysis process to find descriptions from unstructured data. Latent Dirichlet Allocation method is able to find out hidden topics in a document as well as the words that make up a topic so that the application of this method will help in identifying responses discussed by the audience. The data used is textual data in the form of comments from YouTube scrapping during 2022. The results of topic modeling form eight topics, namely retirement life, parents hopes, dream jobs, civil servants, job differences, characteristics of generation Z, salary and benefits, and reasons for resignation. The RStudio GUI program can make it easier for users to analyze topic modeling with similar methods.

Note: This article has supplementary file(s).

Fulltext View|Download |  Research Instrument
Untitled
Subject
Type Research Instrument
  Download (269KB)    Indexing metadata
Email colleagues
Keywords: Topic Modeling; Latent Dirichlet Allocation; Text Mining; Information Retrieval; Pegawai Negeri Sipil; YouTube

Article Metrics:

Article Info
Section: Articles
Language : EN
  1. Badan Kepegawaian Negara. 2022. Siaran Pers Peserta Seleksi CASN Mengundurkan Diri. Jakarta: BKN
  2. Blei, D. M. 2012. Probabilistic Topic Models. Communications of The ACM Vol. 55, No. 4: Hal. 77-84
  3. Blei, D. M., Ng, A. Y., & Jordan, M. I. 2003. Latent Dirichlet Allocation. Journal of Machine Learning Research, Hal: 993-1022
  4. Campbell, J. C., Hindle, A., & Stroulia, E. 2015. Latent Dirichlet Allocation: Extracting Topics from Software Engineering Data. The Art and Science of Analyzing Software Data, Hal. 139-159
  5. Fahlevvi, M. R., & Azhari, S. N. 2022. Topic Modeling on Online News Portal Using Latent Dirichlet Allocation. IJSS (Indonesian Journal of Computing and Cybernetics Systems Vol. 16, No. 4: Hal. 335-344
  6. Feldman, R., & Sanger, J. 2007. The Text Mining Handbook: Advanced Approaches in Analyzing Unstructured Data. New York: Cambridge University Press
  7. Griffiths, T. L., & Steyvers, M. 2004. Finding Scientific Topics. Proceedings of the National Academy of Sciences, Stanford University: 6 April 2004
  8. Hearst, M. 2003. What is Text Mining. SIMS: UC Berkeley
  9. Heinrich, G. 2005. Parameter Estimation for Text Analysis. Technical Report Fraunhofer IGD Vol. 1, Hal: 1-32
  10. Hong, L., & Davison, B. D. 2010, July. Empirical Study of Topic Modeling in Twitter. Proceedings of The First Workshop on Social Media Analytics, Lehigh University: 25 Juli 2010
  11. Manning, C., Raghavan, P., & Schuetze, H. 2008. Introduction to Information Retrieval. New York: Cambridge University Press
  12. Mohammed, S. H., & Al-augby, S. 2020. LSA & LDA Topic Modeling Classification: Comparison Study on E-books. Indonesian Journal of Electrical Engineering and Computer Science Vol. 19, No. 1: Hal. 353-362
  13. Presiden Republik Indonesia. 2014. Undang-Undang Nomor 5 Tahun 2014 tentang Aparatur Sipil Negara. Jakarta: Sekretariat Negara
  14. Roder, M., Both, A., & Hinneburg, A. 2015. Exploring The Space of Topic Coherence Measures. Proceedings of The Eighth ACM International Conference on Web Search and Data Mining, Shanghai China: 2-6 Februari 2015
  15. Stevens, K., Kegelmeyer, P., Andrzejewski, D., & Buttler, D. 2012. Exploring Topic Coherence Over Many Models and Many Topics. Proceedings of the 2012 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning, Jeju Island Korea: 12-14 Juli 2012
  16. Steyvers, M., & Griffiths, T. 2006. Probabilistic Topic Models. Latent Semantic Analysis: A Road to Meaning. Laurence Erlbaum
  17. YouTube. 2022. About YouTube. https://about.youtube/. Diakses pada: 16 November 2022

Last update:

No citation recorded.

Last update:

No citation recorded.