Free Concordance / Keyword Frequency Text Analysis Tools

Follow and Like:
RSS
Follow by Email
SHARE

Free Concordance / Keyword Frequency Text Analysis ToolsDoing qualitative research that involves text analysis for either keyword frequency or concordance context related review? I complied a list of a few free basic software packages that might help you with that. You should be able to do a simple keyword frequency lookup, keyword search, context concordance viewing of occurrences, with basic import and export.

 

Check out the following :

Simple Concordance Program

This FREE program lets you create word lists and search natural language text files for words, phrases, and patterns. SCP is a concordance and word listing program that is able to read texts written in many languages.

Features

  • You can generate concordances, and search for words or phrases. You can produce both KWIC and Line-Based concordances. The words used can be highlighted in context or contracted. Left and right context sorting can be achieved. The concordances can be printed or saved to files.
  • You can save concordances in html to display on the web.
  • You can create word lists, which you can sort , print, or save. Sorting can be done in ascending or descending alphabetic or frequency order.
  • You can specify words and phrases using a prefix, a suffix, specific words, or using a pattern. You can select a group of words from which to produce your concordances and word lists.
  • You can exclude words using a stoplist.
  • You can display statistics of letter and word-frequency usage.
  • You can define the method of reading text, the types of symbols used, together with their collating order and style of embedded text references.Many text files already exist in forms that SCP can handle. Several sample files are included in the SCP download

 

TextSTAT – Simple Text Analysis Tool

TextSTAT is a simple programme for the analysis of texts. It reads plain text files (in different encodings) and HTML files (directly from the internet) and it produces word frequency lists and concordances from these files. This version includes a web-spider which reads as many pages as you want from a particular website and puts them in a TextSTAT-corpus. The new news-reader, too, puts news messages in a TextSTAT-readable corpus file.
TextSTAT reads MS Word and OpenOffice files. No conversion needed, just add the files to your corpus…
In TextSTAT you can use regular expression which provides you with powerful search possibilities. The programme is multilingual. Because it uses Unicode internally, TextSTAT can cope with many different languages and file encodings.

 

AntConc

AntConc started out as a relatively simple concordance program, but has been slowly progressing to become a rather useful text analysis tool. It is written in Perl 5.8 using ActiveState’s excellent Komodo development environment.

 

WordSmith

The current version is 5.0 and is available from Lexical Analysis Software Ltd. The tools include a concordancer, word-listing facilities, a tool for computing the keywords of a text or genre, and a series of other utilities e.g. to convert text files from one format to another or to make multiple changes in many text files, to examine the format of text files, to download texts from the web, etc

 

AdTAT – the Adelaide Text Analysis Tool

A free concordance tool by The University of Adelaide.

 

[Update 2013/04/08] You might wanna check out the widely popular LIWC. I ended up writing  a Python script that counts keywords for CSV files.

[Update 2014/09/16] You might also want to check Wmatrix corpus analysis. Looks promising.

 

See this list for more related software packages for various qualitative research methods tools. If you now of any others, let me know.

Leave a Reply

7 Comments on "Free Concordance / Keyword Frequency Text Analysis Tools"

Notify of
avatar
Sort by:   newest | oldest | most voted
Mona
Guest

i need some help; who can help me with a free concordancer i want to use it for my research in the field of applied linguistics. discourse analysis

Gilad Feldman
Guest

Mona – I suggest contacting the people behind the app. I have very little experience with this.

Philocalist
Guest

Me too Mona, I am going to work on academic writing using concordancer. I have some questions, can I contact you please?
Thanks

MariusDejess
Guest
Thanks for your listing of concordancers, I hope I will find one that is free and will read internet websites as the corpus, and easy, quick to use. Some years back I found one but now I don’t have it anymore because of flooding in my place, and I lost my computer or it got damaged by the flood, beyond repair. That was a very simple and quick to use concordancer of internet texts, I just enter the word(s), and it produced lines after lines of text with word(s) sought for represented as a short line like this _____ .… Read more »
MariusDejess
Guest

Well, I thought that this was the one I used to have:

Download (binary version for MS Windows XP/Win7):

TextSTAT 2.9c for Windows (ZIP file, approx. 8 MB, Feb 20, 2014)

But it would not download although I clicked download twice and nothing happened, no indication of any downloading taking place, and the box appearing on my screen has the message to open with or to save to, I clicked to save to.

Now I will try to open with.

Hermetic Systems
Guest

Since WordSmith is not (or no longer) free, perhaps I may be permitted to mention our (not free but inexpensive) text analysis software:

Hermetic Word Frequency Counter http://www.hermetic.ch/wfc/wfc.htm
This scans a text file or a Word docx file and counts the occurrences of different words. What is a ‘word’ is user-definable. The words found and displayed can be ordered alphabetically or by frequency.

Hermetic Word Frequency Counter Advanced Version http://www.hermetic.ch/wfca/wfca.htm
As above, and can scan multiple files at once, also counts phrases, and supports pattern-matching.
Other features too numerous to list.

Hermetic Systems
Guest

Since our last comment we have released an actual concordance program. It scans a text file or an MS Word docx file, ignoring stop words, to generate a KWIC concordance which can then be searched, including searches for word patterns and for two word (or word patterns) in close proximity. Description and screenshots at http://www.hermetic.ch/skc/skc.htm

wpDiscuz