Skip to main content


New #blog post: Stylometric fingerprinting redux.

Avoid de-anonymization through analysis of your writing style. Defend against machine- and human-driven stylometric identification.

This is an expanded version of a previous microblog about stylometric fingerprinting. Feedback welcome, esp. from anyone with a stylometry or linguistic close-reading background. Excerpt:
To paint with a broad brush, we can divide most stylometric fingerprinting into machine- and human-driven techniques.

Machine-driven techniques: These techniques involve analysis of reading level metrics, unusual words, machine-identifiable grammatical and spelling errors, and statistical analysis of writing style. A great amount of recent research studies statistical analysis of writing style; it’s a rapidly-evolving field. Human-driven techniques: There are some areas in which manual analysis still beats computers. Someone you know may recognize your writing style.
#blog