Element 68Element 45Element 44Element 63Element 64Element 43Element 41Element 46Element 47Element 69Element 76Element 62Element 61Element 81Element 82Element 50Element 52Element 79Element 79Element 7Element 8Element 73Element 74Element 17Element 16Element 75Element 13Element 12Element 14Element 15Element 31Element 32Element 59Element 58Element 71Element 70Element 88Element 88Element 56Element 57Element 54Element 55Element 18Element 20Element 23Element 65Element 21Element 22iconsiconsElement 83iconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsiconsElement 84iconsiconsElement 36Element 35Element 1Element 27Element 28Element 30Element 29Element 24Element 25Element 2Element 1Element 66
SCAN – Systematic Content Analysis of User Comments for Journalists

SCAN – Systematic Content Analysis of User Comments for Journalists

Journalistic editorial departments face an increasing amount of feedback from the audience, e.g., in forums, comment sections and social media. The amount of comments and other feedback from the audience poses an enormous challenge for editorial departments. A large part of this effort concentrates on the downside of this development: filtering spam, hate speech or content that could be propaganda. The SCAN Project, in collaboration with Prof. Dr. Walid Maalej and his team of the Department of Informatics of Universität Hamburg, focuses on a constructive approach and wants to support journalists to extract the “journalistic sense” out of user comments for their own work but also for the audience itself. Thus, they should be able to find helpful comments or identify different opinions on a topic much faster. Dr. Wiebke Loosen and Lisa Merten present the interdisciplinary cooperation in the 15th episode of the BredowCast.

show more

Project Description

As part of the larger transformation of public communication in the digital age, professional journalists are facing an increasing amount of audience feedback, e.g. in forums, comments sections, and social media. In pre-digital times, conversations among audience members about mass media content remained largely invisible to journalists, with the exception of letters or calls to the editor. Today, the conversations of “the people formerly known as the audience” (Jay Rosen) are becoming visible to journalists, but also to other users, fundamentally changing how today’s journalists and their audiences perceive, use, and manage this kind of feedback.

Most (online) newsrooms will consider comment sections and other features for audience feedback mandatory. However, newsrooms differ regarding how they manage these spaces, how they engage their users, and how they make use of the feedback for their own journalistic reporting – not the least because the manual handling and summarising of comments by journalists or dedicated social media editors is time consuming, while a fully automated analysis is expensive and error-prone. Accordingly, the development of tools to assist journalists in analysing, filtering, and summarising user-generated content has been identified as a main challenge for news organisations.

The Hans-Bredow-Institut works together with the Department of Informatics of Universität Hamburg in order to develop a framework that supports journalist to analyse, filter, and summarise user-generated content. This framework enables them to carry out a systematic, semi-automated analysis of audience feedback to better reflect the voice of users, mitigate the analysis efforts, and help journalist in generating new content from the user comments. With the framework journalists can create different samples of user comments, configure the questions they want to answer from the comments, and assign the question-answering task to “human coders” from the crowd.

The framework uses machine learning and natural language processing techniques in combination with manual content analysis (peer coding) and crowdsourcing to automatically filter spams, distinguish between praise and criticism, and cluster the comments into customisable categories. Moreover, journalists can create basic summaries about the comments such as how many users were for or against a particular position. As part of the project, we will (a) discuss and develop the framework requirements with journalists and (b) evaluate the framework in a concrete use case with a large German online news site.

The requirements for such a system will be specified together with journalists in the course of the project. Furthermore, it will be tested within the scope of a certain case on a big German news website.

Project Information


Duration: 2015-2016

Research programme:
RP1 - Transformation of Public Communication

Third party

Google Computational Journalism Research Programme

Contact person

Prof. Dr. Wiebke Loosen
Senior Researcher Journalism Research

Prof. Dr. Wiebke Loosen

Leibniz-Institut für Medienforschung | Hans-Bredow-Institut (HBI)
Rothenbaumchaussee 36
20148 Hamburg

Tel. +49 (0)40 45 02 17 - 91
Fax +49 (0)40 45 02 17 - 77

Send Email



Subscribe to our newsletter and receive the Institute's latest news via email.