Authorship Detection with Authors from Project Gutenberg

python (4)


A folders containing raw text files to use for Authorship detection.


There are 12 folders, each associated to a famous author.  Each folder contains text files written by the identified author, as well as a text file written by a mystery author.  Using text analysis tools, one can try do determine whether the "mystery author" is the same as the identified author.


From the web

    There are currently no web resources associated with this dataset.

Resources that use this dataset