Computer - Based Content Analysis I

Fall 2019



Participants need to have attended the following IPSDS courses or have corresponding knowledge: 1. SURV699C Introduction to Python and SQL or necessary knowledge in programming in Python: data types & structures, functions & loops, file I/O 2. SURV736 Web


This course investigates the foundations of Natural Language Processing (NLP) as tool for analyzing natural language texts in the social sciences, thus providing an alternative to traditional ways of data generation through surveys. The course introduces general use cases for NLP, provides a guide to standard operations on text as well as their implementation in the Python-based Natural Language Toolkit (NLTK) and introduces the text mining functionalities of the WEKA Machine Learning workbench.
The theory part of the course worth one credit can be supplemented by an optional project part worth another credit point.

*This course can be found on Testudo under Fall 2018 registrations as SURV699P.