Free Text Concordance — KWIC, n-grams, Frequency
Build a keyword-in-context concordance and unigram/bigram/trigram frequency tables for any text. Runs in your browser.
runs locally — nothing leaves your browser
What is Free Text Concordance — KWIC, n-grams, Frequency?
Text Concordance gives researchers a quick, browser-native view of how words behave inside a text. It computes frequencies for unigrams, bigrams, and trigrams, supports a stopword filter so the top of the list isn't 'the, of, and', and produces a keyword-in-context (KWIC) view — every occurrence of a search term with a window of surrounding context. Concordance views were the original tool of corpus linguistics and qualitative coding, and they remain useful any time you want to understand how a term is actually used rather than how often it appears.
When to use this
- →Qualitative coding of interview transcripts or open-ended survey responses
- →Checking how an author uses a specific term across a chapter or paper
- →Quick literary or stylistic analysis of a text corpus
- →Spotting overused phrases or filler patterns in your own writing
How it works
The text is tokenised with a Unicode-aware regex that captures letter and digit sequences. N-gram frequencies are computed by sliding a window of size 1, 2, or 3 across the token list. The stopword filter is a built-in English list applied before n-gram counting. KWIC search runs a regex over the original text (optionally case-sensitive and whole-word) and slices a window of characters on each side of every match.
Example use cases
Interview coding
Paste a transcript, find every mention of a theme with surrounding context, and copy quotes for analysis.
Style audit
Surface your own most-used bigrams to spot crutches and overused phrasing.
Term tracking
Find every appearance of a key concept across a chapter to verify consistent usage.
Free Text Concordance — KWIC, n-grams, Frequency
Interactive ToolHow to use
- 1
Paste your corpus
Drop in an article, transcript, chapter, or pile of research notes.
- 2
Inspect frequencies
Switch between unigram, bigram, and trigram views; toggle stopword filtering.
- 3
Click a term for KWIC
Click any term to jump to a keyword-in-context view with adjustable window size.
Why use this tool?
- Unigram, bigram, and trigram frequency tables in one view
- Optional stopword filter for English so frequencies surface real terms
- Keyword-in-context (KWIC) view with case-sensitivity and whole-word options
- Type-token ratio and token counts for quick lexical diversity checks
Frequently asked questions
- What is KWIC?
- KWIC stands for Keyword In Context — a concordance format that shows each occurrence of a search term with a window of surrounding text, so you can see how the word is used rather than just how often.
- Which stopwords are filtered?
- A built-in English list of the most common function words (the, of, and, is, etc.). Turn the toggle off if you want the raw frequencies including stopwords.
- How big a text can it handle?
- It runs comfortably on chapter-length text. Very large corpora (millions of tokens) will be slow because everything runs in a single render pass.
- Is my text sent anywhere?
- No. All tokenisation, frequency counting, and KWIC search run in your browser.
Related tools
Count words, characters, sentences, and estimate reading time instantly in your browser. No sign-up required.
Remove duplicate lines from any text instantly in your browser. Supports case-sensitive and case-insensitive modes.
Pull DOIs, arXiv IDs, PMIDs, ISBNs, and URLs out of any prose, paper, or references list. Runs in your browser.
Popular right now
Format, validate, and minify JSON instantly in your browser. Your data never leaves your device.
Decode JWT tokens and inspect header and payload instantly in your browser. Your tokens never leave your device.
Count words, characters, sentences, and estimate reading time instantly in your browser. No sign-up required.