EMPIRICAL PRIOR LATENT DIRICHLET ALLOCATION MODEL

Authors

  • MA Adegoke DEPARTMENT OF COMPUTER SCIENCE &TECHNOLOGY, BELLS UNIVERSITY OF TECHNOLOGY, OTA, OGUN STATE, NIGERIA
  • JOA Ayeni DEPT. OF COMPUTER SCIENCE, COLLEGE OF NATURAL SCIENCES, REDEEMER’S UNIVERSITY, EDE, OSUN STATE, NIGERIA
  • PA Adewole DEPARTMENT OF COMPUTER SCIENCES, FACULTY OF SCIENCE, UNIVERSITY OF LAGOS, AKOKA-YABA, LAGOS STATE, NIGERIA

Abstract

In this study, empirical prior Dirichlet allocation (epLDA) model that uses latent semantic indexing framework to derive the priors required for topics computation from data is presented. The parameters of the priors so obtained are related to the parameters of the conventional LDA model using exponential function. The model was implemented and tested with benchmarked data and it achieves a prediction accuracy of 92.15%. It was observed that the epLDA model consistently outperforms the conventional LDA model on different datasets with an average percentage accuracy of 6.33%; this clearly demonstrates the advantage of using side information obtained from data for the computation of the mixture components.

 

Keywords: latent Dirichlet allocation; semantic indexing; empirical prior; hidden structures; Prediction accuracy.

 

http://dx.doi.org/10.4314/njt.v38i1.27

Downloads

Published

2018-12-30

Issue

Section

Computer, Telecommunications, Software, Electrical & Electronics Engineering

How to Cite

EMPIRICAL PRIOR LATENT DIRICHLET ALLOCATION MODEL. (2018). Nigerian Journal of Technology, 38(1), 223-232. https://nijotech.com/index.php/nijotech/article/view/1934