This debt we pay to human guile; with torn and bleeding hearts we smile, and mouth with myriad subtleties.
February 22, 2012 5:33 AM Subscribe
We study techniques for identifying an anonymous author via linguistic stylometry, i.e., comparing the writing style against a corpus of texts of known authorship. We experimentally demonstrate the effectiveness of our techniques with as many as 100,000 candidate authors. [...] In over 20% of cases, our classiﬁers can correctly identify an anonymous author given a corpus of texts from 100,000 authors; in about 35% of cases the correct author is one of the top 20 guesses.On the Feasibility of Internet-Scale Author Identiﬁcation[pdf] is a draft of a paper for the IEEE Symposium on Security and Privacy.
Covered yesterday by Cory Doctorow, via one of the paper's authors Narayanan's blog. If you want to build a tool to obfuscate your text you might start by reading Obfuscating Document Stylometry to Preserve Author Anonymity[pdf] by Gary Kacmarcik & Michael Gamon, and Practical Attacks Against Authorship Recognition Techniques[pdf] by Michael Brennan who presented at the 26th Chaos Communication Congress. Unfortunately, the Automouth tool does not appear to be out of beta.
This thread has been archived and is closed to new comments