Utilora

Free Text Concordance — KWIC, n-grams, Frequency

Build a keyword-in-context concordance and unigram/bigram/trigram frequency tables for any text. Runs in your browser.

runs locally — nothing leaves your browser

What is Free Text Concordance — KWIC, n-grams, Frequency?

Text Concordance gives researchers a quick, browser-native view of how words behave inside a text. It computes frequencies for unigrams, bigrams, and trigrams, supports a stopword filter so the top of the list isn't 'the, of, and', and produces a keyword-in-context (KWIC) view — every occurrence of a search term with a window of surrounding context. Concordance views were the original tool of corpus linguistics and qualitative coding, and they remain useful any time you want to understand how a term is actually used rather than how often it appears.

When to use this

  • Qualitative coding of interview transcripts or open-ended survey responses
  • Checking how an author uses a specific term across a chapter or paper
  • Quick literary or stylistic analysis of a text corpus
  • Spotting overused phrases or filler patterns in your own writing

How it works

The text is tokenised with a Unicode-aware regex that captures letter and digit sequences. N-gram frequencies are computed by sliding a window of size 1, 2, or 3 across the token list. The stopword filter is a built-in English list applied before n-gram counting. KWIC search runs a regex over the original text (optionally case-sensitive and whole-word) and slices a window of characters on each side of every match.

Example use cases

Interview coding

Paste a transcript, find every mention of a theme with surrounding context, and copy quotes for analysis.

Style audit

Surface your own most-used bigrams to spot crutches and overused phrasing.

Term tracking

Find every appearance of a key concept across a chapter to verify consistent usage.

Free Text Concordance — KWIC, n-grams, Frequency

Interactive Tool

How to use

  1. 1

    Paste your corpus

    Drop in an article, transcript, chapter, or pile of research notes.

  2. 2

    Inspect frequencies

    Switch between unigram, bigram, and trigram views; toggle stopword filtering.

  3. 3

    Click a term for KWIC

    Click any term to jump to a keyword-in-context view with adjustable window size.

Why use this tool?

  • Unigram, bigram, and trigram frequency tables in one view
  • Optional stopword filter for English so frequencies surface real terms
  • Keyword-in-context (KWIC) view with case-sensitivity and whole-word options
  • Type-token ratio and token counts for quick lexical diversity checks

Frequently asked questions

What is KWIC?
KWIC stands for Keyword In Context — a concordance format that shows each occurrence of a search term with a window of surrounding text, so you can see how the word is used rather than just how often.
Which stopwords are filtered?
A built-in English list of the most common function words (the, of, and, is, etc.). Turn the toggle off if you want the raw frequencies including stopwords.
How big a text can it handle?
It runs comfortably on chapter-length text. Very large corpora (millions of tokens) will be slow because everything runs in a single render pass.
Is my text sent anywhere?
No. All tokenisation, frequency counting, and KWIC search run in your browser.

Related tools

Popular right now