
The Gutenberg Auf! development project arose out of a perceived
need for a computational aid in the production of
solid, empirical research in the biology
of language1, drawing on a large body of extant historical
language data. The bulk
of the corpus has been drawn from Project
Gutenberg, a collection of public domain literary texts, and indexed by
title,
authour, authour lifespan, publication date, and other variables, to allow,
for example, the graphing of temporal correlations.
Searches are specified in terms of regular expressions, and results can be
examined in a temporal hierarchy or per document.
See the manual for labeled screenshots and more
details.
Gutenberg Auf! version 1.1.3, released March 27, 2006 (self-installing archive)
Full Self-Extracting Executable with Entire Corpus (Warning: 1.2GB, email me for a DVD)
Gutenberg Auf! PDF Reference Manual (including more detailed project description)
Slides from my June 3, 2006 presentation on Calvin, Gutenberg et al at the SFU Open House