Explore Your Text Corpus

0 / 3000
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

Benford Law

Benford is an observation about the frequency distribution of leading digits in many real-life data. This graph combines Benford law with the custom probability function from (First Letter of Word) paper to show Benford's magic for text corpus.

1: represent the most-used first letter frequency,
2: the 2nd most-used first letter, etc.

Benford law diagram

Word Cloud

Data visualization technique to represent the most used words from your corpse (not-including stopwords) in a handy graphical image.

Word cloud diagram


Generate a frequency distribution histogram that represents all words (not-including stopwords) from your text. This tool useful to understand the text before processing and compare between different corpora.

Words Frequency Diagram
Text Length Total Sentences Total Tokens

Without Punct.

- - -
Unique Tokens Tokens Size Mean Total Words
- - -
Stopwords Counts Punctuations Punctuations %
- - -
Noun Count Verb Count Adjective Count
- - -


Statistics summary represents your text; these insights are useful before applying text processing or analysis.

Note that sentence counter works best with paragraphs that include punctuations.

TextMiner uses pos tagging to understand the relationship between words and sentences.

Get More Insights
For sentiment analysis, custom entity extraction, automatic text generation and more
TextMiner Copyright © 2020 by AhmadAI