Latent Dirichlet Allocation or LDA is a statistical technique that was introduced in 2003 from a research paper. LDA is used for topic modelling in text documents. LDA is more often analogue to PCA that we covered before. If you remember in PCA, we used to generate a single value for the existing values in a dataset. LDA will generate a topic for documents by analyzing the content of the document. This technique can be used to cluster documents as well which is an important task in text analytics.
Read the full article at Latent Dirichlet Allocation in Text Analytics
This is ToC for the Azure Machine Learning Series.
Introduction to Azure Machine Learning using Azure ML Studio
Azure Machine Learning: Named Entity Recognition in Text Analytics
No comments:
Post a Comment