Free Text Concordance — KWIC, n-grams, Frequency

Text Tools

What is Free Text Concordance — KWIC, n-grams, Frequency?

Text Concordance gives researchers a quick, browser-native view of how words behave inside a text. It computes frequencies for unigrams, bigrams, and trigrams, supports a stopword filter so the top of the list isn't 'the, of, and', and produces a keyword-in-context (KWIC) view — every occurrence of a search term with a window of surrounding context. Concordance views were the original tool of corpus linguistics and qualitative coding, and they remain useful any time you want to understand how a term is actually used rather than how often it appears.

How it works

The text is tokenised with a Unicode-aware regex that captures letter and digit sequences. N-gram frequencies are computed by sliding a window of size 1, 2, or 3 across the token list. The stopword filter is a built-in English list applied before n-gram counting. KWIC search runs a regex over the original text (optionally case-sensitive and whole-word) and slices a window of characters on each side of every match.

Features & Benefits

Unigram, bigram, and trigram frequency tables in one view
Optional stopword filter for English so frequencies surface real terms
Keyword-in-context (KWIC) view with case-sensitivity and whole-word options
Type-token ratio and token counts for quick lexical diversity checks

Frequently Asked Questions

What is KWIC?

KWIC stands for Keyword In Context — a concordance format that shows each occurrence of a search term with a window of surrounding text, so you can see how the word is used rather than just how often.

Which stopwords are filtered?

A built-in English list of the most common function words (the, of, and, is, etc.). Turn the toggle off if you want the raw frequencies including stopwords.

How big a text can it handle?

It runs comfortably on chapter-length text. Very large corpora (millions of tokens) will be slow because everything runs in a single render pass.

Is my text sent anywhere?

No. All tokenisation, frequency counting, and KWIC search run in your browser.

Related Tools

Word Counter

Count words, characters, sentences, and estimate reading time instantly in your browser. No sign-up required.

Remove Duplicate Lines Tool

Remove duplicate lines from any text instantly in your browser. Supports case-sensitive and case-insensitive modes.

Citation Extractor — DOI, arXiv, PMID, ISBN, URL

Pull DOIs, arXiv IDs, PMIDs, ISBNs, and URLs out of any prose, paper, or references list. Runs in your browser.

Popular Utilities

JSON Formatter & Validator

Format, validate, and minify JSON instantly in your browser. Your data never leaves your device.

JWT Decoder

Decode JWT tokens and inspect header and payload instantly in your browser. Your tokens never leave your device.

Word Counter

Count words, characters, sentences, and estimate reading time instantly in your browser. No sign-up required.