Wordstat 5.1

Computer Assisted Text Analysis

Designed and written by Normand Péladeau

“For those who have ever needed to find themes or relationships in verbatim responses, focus group transcripts, or other text sources, WordStat is very attractive indeed.” Marketing Research Magazine, Spring 2006

Do you need to extract information from large amounts of documents, customers feedbacks, interview transcripts or open-ended responses?

Whether you need a text mining tool for fast extraction of themes and trends or achieve careful and precise measurement with a state-of-the-art quantitative content analysis method, WordStat provides a unique combination of both approaches in a flexible and easy to use text analysis software.

Its seamless integration with Simstat, a statistical data analysis tool and QDA Miner, a qualitative data analysis software gives you unprecedented flexibility for analyzing text and relating its content to structured information including numerical and categorical data.

   What is it used for?
  • Content analysis of open-ended responses, interview of focus group transcripts.
  • Business intelligence &competitive analysis of web sites. 
  • Information extraction and knowledge discovery from    incident reports, customer complaints, messages.
  • Analysis of news coverage or scientific literature.   
  • Automatic tagging and classification of documents.   
  • Taxonomy development and validation.   
  • Fraud detection, authorship attribution, patent analysis   
  • And more…   
   ›› SEE A DETAILED LIST OF FEATURES    ›› SEE SOME CASE STUDIES

Text Mining Process

Main Benefits

WordStat offers, in a single software environment…

  • Powerful text mining and content analysis tools to analyze large amounts of unstructured information;
  •  Integrated charting, visualization and report features;
  • A total control over the analysis process and enough closeness to the data to achieve the perfect balance between analysis efficiency and precision in the results;
  • Analytical supports for important decision making and timely responses for your information needs.
   Key features
  • Integrated text mining analysis and visualization tools (clustering, multidimensional scaling, heatmaps, correspondence analysis)
  • Hierarchical categorization dictionary or taxonomy  supporting words, word patterns, phrases and proximity rules.
  • Vocabulary and phrase finder for extraction of technical  terms, recurring ideas and themes.
  • Keyword-in-context and keyword retrieval tools for easy  identification of relevant text segments.
  • Machine Learning algorithm for automatic document  classification (Naive Bayes and K-Nearest Neighbors) with automatic features selection and validation tools.
  • Importation of documents and exportation of data, tables and graphs support industry standard formats.

Reviews of WordStat

THE POLITICAL METHODOLOGIST, vol 15 (1), Summer 2007
JOURNAL OF MIXED METHODS RESEARCH , April 2007
MARKETING RESEARCH, Spring 2006

OR/MS Today, October 2005
RESEARCH, August 2005
AMERICAN STATISTICIAN, February 2005
LINGUIST, April 2004
SOCIAL SCIENCE COMPUTER REVIEW, vol 18(3), Fall 2000
FIELD METHODS, vol 11(2), 1999

Which base module?

WordStat is a module that must be run from either of the following base products:

QDA Miner - The text management and qualitative analysis program allows one to create and edit data files, import documents, and perform manual coding of those documents. Several analysis tools are also available to look at the frequency of manually assigned codes and the relationship between those codes and other categorical or numeric variables. One of the features that has become popular is the "Query by Example" that will retrieve text similar to a starting example and can "learn" based on relevance feedback from the user. When used with QDA Miner, WordStat can perform content analysis on whole documents or selected segments of those documents tagged with specific user defined codes.

SimStat -This statistical software provides a wide range of statistical procedures for the analysis of quantitative data. It offers advanced data file management tools such as the ability to merge data files, aggregate cases, perform complex computation of new variables and transformation of existing ones. When used with Simstat, WordStat can analyze textual information stored in any alphanumeric, plain text and rich text memo variable (or field). It includes various tools to explore the relationship between any numeric variable of a data file and the content of alphanumeric ones.

Which should I use? - If you are primarily working with just textual data then QDA Miner provides the most powerful text manipulation and organization tools. QDA Miner is document oriented and has some unique tools to handle documents, perform searches on those documents, and tag them.

If you need to also analyze numerical data associated with your textual data then Simstat provides a wide range of statistical analyses. It offers advanced statistical routines like multiple regression, multi-way anova/ancova, factor analysis, reliability analysis, etc.

If you need to do both then you can get both Simstat and QDA Miner. They easily coexist and work with Wordstat, giving you the widest range of options.

Download

You can download a demo version of the latest Wordstat from here.

Prices and ordering

For prices, on-line ordering and other purchasing information please go to our ordering page.

System Requirements

Requires Windows 98 or later and either QDA Miner or Simstat for Windows version 1.21d or later.

Copyright © 2009 Kovach Computing Services, Anglesey, Wales. All Rights Reserved. Portions copyright Addinsoft, Provalis Research, and Data Description Inc.

Last modified 1 March, 2010