Element 68Element 45Element 44Element 63Element 64Element 43Element 41Element 46Element 47Element 69Element 76Element 62Element 61Element 81Element 82Element 50Element 52Element 79Element 79Element 7Element 8Element 73Element 74Element 17Element 16Element 75Element 13Element 12Element 14Element 15Element 31Element 32Element 59Element 58Element 71Element 70Element 88Element 88Element 56Element 57Element 54Element 55Element 18Element 20Element 23Element 65Element 21Element 22iconsiconsElement 83iconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsElement 84iconsiconsElement 36Element 35Element 1Element 27Element 28Element 30Element 29Element 24Element 25Element 2Element 1Element 66
Juli 2017

International Conference on Computational Social Science

At the Conference, hosted by GESIS Leibniz Institute for the Social Sciences, July 10-13, 2017, Cologne, Cornelius Puschmann will co-present a Tutorial "Topic modeling European political debates with the EUSpeech dataset"

"Topic modeling European political debates with the EUSpeech dataset"

Presented by:
TATJANA SCHEFFLER, University of Potsdam
DAMIAN TRILLING, University of Amsterdam
CORNELIUS PUSCHMANN, Hans Bredow Institute for Media Research

The tutorial provides a concise and hands-on introduction to topic modeling, an increasingly popular method in computational social science (Blei, 2012; DiMaggio, 2015; Puschmann & Scheffler, 2016). While a growing number of packages in widely used programming languages are at the disposal of researchers, there are a number of caveats to consider when deploying topic models in the research process, both on the technical level and in terms of research design, such as which algorithm to rely on, how to set parameters, and in which ways to preprocess data. Interpreting the output generated by popular algorithms such as Latent Dirichlet Allocation (LDA; Blei, Ng & Jordan, 2003), and evaluating the validity of topic model statistics are key challenges for social scientists interested in using topic modeling, as is the question of successfully embedding topic models in a research design in a fashion that allows the testing of concrete hypotheses. In a series of eight compact segments, we will both provide a user-friendly description of the workflow in both Python and R for the practical application of topic models to our example data, the EUspeech corpus (Schumacher et al., 2016), and discuss the conceptual basis of topic models along with approaches for the evaluation of topic model fit and the validity of the results generated with them. We expect an audience of both social scientists and computational researchers, mostly at the PhD student and postdoctoral level. Our approach will be research-oriented, involving an overview of relevant packages in both Python and R, in addition to our own scripts (also in both languages) which will be shared via Github. We aim to be both highly practical and language-agnostic by focusing on what topic models do, how they do it, and what questions can be studied effectively using them. We expect some familiarity with programming for those participants who want to apply topic modeling in their own research, but the segments on how to interpret topic models should be equally relevant for those with and without programming knowledge.

Infos zur Veranstaltung


Tagungszentrum des Erzbistums Köln
Kardinal-Frings-Str. 1-3
50668 Köln

Contact person

Prof. Dr. Cornelius Puschmann
Professor at the University of Bremen

Prof. Dr. Cornelius Puschmann

Leibniz-Institut für Medienforschung | Hans-Bredow-Institut (HBI)
Rothenbaumchaussee 36
20148 Hamburg

Tel. +49 (0)40 45 02 17 55
Fax +49 (0)40 45 02 17 77

Send Email


Subscribe to our newsletter and receive the Institute's latest news via email.