Freely Available Automatic Text Analysis Tools:
ARTE is a tool that will automatically calculate a variety of readability formulas for texts. The readability formulas include classic formulas like Flesch-Kincaid Grade Level and new formulas like the Crowdsourced Algorithm of Reading Comprehension (CAREC)
Click here to learn more CLA is a simple but powerful text analysis tool. One can use CLA to analyze texts using very large custom dictionaries. In addition to words, custom dictionaries can include n-grams and wildcards.
Click here to learn more |
CRAT is an easy to use tool that includes over 700 indices related to lexical sophistication, cohesion and source text/summary text overlap. CRAT is particularly well suited for the exploration of writing quality as it relates to summary writing. Click here to learn more
|
GAMET is an easy to use tool that provides incidence counts for structural and mechanics errors in texts including grammar, spelling, punctuation, white space, and repetition errors. The tool also provides line output for the errors flagged in the text. Click here to learn more.
|
SEANCE is an easy to use tool that includes 254 core indices and 20 component indices based on recent advances in sentiment analysis. In addition to the core indices, SEANCE allows for a number of customized indices including filtering for particular parts of speech and controlling for instances of negation. Click here to learn more
|
SiNLP is a simple tool that allows users to analyze texts with regard to the number of words, number of types, TTR, letters per word, number of paragraphs, number of sentences, and number of words per sentence for each text. In addition, users can analyze texts with regard to their own custom dictionaries. Click here to learn more
|
TAACO is an easy to use tool that calculates 150 indices of both local and global cohesion, including a number of type-token ratio indices, adjacent overlap indices, and connectives indices. The tool also measures text overlap between two texts (intertextual cohesion). (TAACO 2.0 now available!) Click here to learn more
|
TAALED is an analysis tool designed to calculate a wide variety of lexical diversity indices. Homographs are disambiguated using part of speech tags, and indices are calculated using lemma forms. Indices can also be calculated using all lemmas, content lemmas, or function lemmas. Click here to learn more
TAALES is a tool that measures over 400 classic and new indices of lexical sophistication, and includes indices related to a wide range of sub-constructs. Included are indices for both single words and n-grams. Starting with version 2.2, TAALES also provides comprehensive index diagnostics. (TAALES 2.2 now available!) Click here to learn more
|
TAASSC is an advanced syntactic analysis tool that measures fine-grained indices of clausal and phrasal complexity, classic indices of syntactic complexity, and frequency-based verb argument construction indices. Click here to learn more
TAMMI calculates measures related to basic morpheme counts, morphological variety, morphological complexity, morpheme type-token counts, and variables found in the MorphoLex database (Sánchez-Gutiérrez et al., 2017) including morpheme frequency/length, morpheme family size counts and frequency, and morpheme hapax counts. Click here to learn more
|