Wordstat 5.1

CONTENT ANALYSIS AND
TEXT-MINING MODULE

Designed and written by Normand Péladeau

WordStat is a text analysis module specifically designed to study textual information such as responses to open-ended questions, interviews, titles, journal articles, public speeches, electronic communications, etc. WordStat may be used for automatic categorization of text using a dictionary approach or various text mining as well as for manual coding. WordStat can apply existing categorization dictionaries to a new text corpus. It also may be used in the development and validation of new categorization dictionaries or taxonomies. When used in conjunction with manual coding, this module can provide assistance for a more systematic application of coding rules, help uncover differences in word usage between subgroups of individuals, assist in the revision of existing coding using KWIC (Keyword-In-Context) tables, and assess the reliability of coding by the computation of inter-raters agreement statistics.

WordStat includes numerous exploratory data analysis and graphical tools that may be used to explore the relationship between the content of documents and information stored in categorical or numeric variables such as the gender or the age of the respondent, year of publication, etc. Relationships among words or categories as well as document similarity may be identified using hierarchical clustering and multidimensional scaling analysis. Correspondence analysis and heatmap plots may be used to explore relationship between keywords and different groups of individuals.

WordStat is a module that must be run from either of the following base products:

SimStat -This statistical software provides a wide range of statistical procedures for the analysis of quantitative data. It offers advanced data file management tools such as the ability to merge data files, aggregate cases, perform complex computation of new variables and transformation of existing ones. When used with Simstat, WordStat can analyze textual information stored in any alphanumeric, plain text and rich text memo variable (or field). It includes various tools to explore the relationship between any numeric variable of a data file and the content of alphanumeric ones.

QDA Miner - The text management and qualitative analysis program allows one to create and edit data files, import documents, and perform manual coding of those documents. Several analysis tools are also available to look at the frequency of manually assigned codes and the relationship between those codes and other categorical or numeric variables. When used with QDA Miner, WordStat can perform content analysis on whole documents or selected segments of those documents tagged with specific user defined codes.

Which should I use? - If you are primarily working with just textual data then QDA Miner provides the most powerful text manipulation and organization tools. If you need to also analyze associated numerical data then Simstat provides a wide range of statistical anlyses. If you need to do both then you can get both Simstat and QDA Miner. They easily coexist and work with Wordstat, giving you the widest range of options.

Download

You can download a demo version of the latest Wordstat from here.

Prices and ordering

For prices, on-line ordering and other purchasing information please go to our ordering page.

System Requirements

Requires Windows 98 or later and either Simstat for Windows version 1.21d or later or QDA Miner.

Copyright © 2008 Kovach Computing Services, Anglesey, Wales. All Rights Reserved. Portions copyright Addinsoft, Provalis Research, and Data Description Inc.

Last modified 29 October, 2008