A framework for streamlined statistical prediction using topic models

April 15, 2019 Β· Declared Dead Β· πŸ› LaTeCH@NAACL-HLT

πŸ‘» CAUSE OF DEATH: Ghosted
No code link whatsoever

"No code URL or promise found in abstract"

Evidence collected by the PWNC Scanner

Authors Vanessa Glenny, Jonathan Tuke, Nigel Bean, Lewis Mitchell arXiv ID 1904.06941 Category stat.AP Cross-listed cs.CL Citations 2 Venue LaTeCH@NAACL-HLT Last Checked 4 months ago
Abstract
In the Humanities and Social Sciences, there is increasing interest in approaches to information extraction, prediction, intelligent linkage, and dimension reduction applicable to large text corpora. With approaches in these fields being grounded in traditional statistical techniques, the need arises for frameworks whereby advanced NLP techniques such as topic modelling may be incorporated within classical methodologies. This paper provides a classical, supervised, statistical learning framework for prediction from text, using topic models as a data reduction method and the topics themselves as predictors, alongside typical statistical tools for predictive modelling. We apply this framework in a Social Sciences context (applied animal behaviour) as well as a Humanities context (narrative analysis) as examples of this framework. The results show that topic regression models perform comparably to their much less efficient equivalents that use individual words as predictors.
Community shame:
Not yet rated
Community Contributions

Found the code? Know the venue? Think something is wrong? Let us know!

πŸ“œ Similar Papers

In the same crypt β€” stat.AP

R.I.P. πŸ‘» Ghosted

Forecasting: theory and practice

Fotios Petropoulos, Daniele Apiletti, ... (+78 more)

stat.AP πŸ› International Journal of Forecasting πŸ“š 481 cites 5 years ago

Died the same way β€” πŸ‘» Ghosted