Not to take away from anything he did--that concordance is a monumental undertaking--but that obituary link is a little too laudatory. Father Busa didn't really have much to do with the invention of computer language, the web, or hypertext.
The idea of even a semi-automated lemmatization of a 13-million-word corpus in 1949 was pretty freaking revolutionary, though. posted by Mr. Bad Example at 6:18 PM on August 10, 2011
Deus ex machina should apply for electronics just as well.
I began, in 1949, with only electro-countable machines with punched cards. My goal was to have a file of 13 million of these cards, one for each word, with a context of 12 lines stamped on the back. The file would have been 90 meters long, 1.20 m in height, 1 m in depth, and would have weighed 500 tonnes.
In His mercy, around 1955, God led men to invent magnetic tapes. The first were the steel ones by Remington, closely followed by the plastic ones of IBM. Until 1980, I was working on 1,800 tapes, each one 2,400 feet long, and their combined length was 1,500 km, the distance from Paris to Lisbon, or from Milan to Palermo. I used all the generations of the dinosaur computers of IBM at that time. I finished in 1980 (before personal computers came in) with 20 final and conclusive tapes, and with these and the automatic photocompositor of IBM, I prepared for offset the 20 million lines which filled the 65,000 pages of the 56 volumes in encyclopedia format which make up the Index Thomisticus on paper.
Something something medieval scribes still among us something something. (See also the St. John's Bible, an illuminated manuscript produced a few years ago at the abbey at St. John's University in Minnesota.)
Truly this was an amazing feat, given the great physical bulk of the materials: My goal was to have a file of 13 million of these cards, one for each word, with a context of 12 lines stamped on the back. I mean, if someone dropped just a fraction of his punchcards, he'd've been re-sorting the things for a long, long time. posted by wenestvedt at 6:42 AM on August 11, 2011
The idea of even a semi-automated lemmatization of a 13-million-word corpus in 1949 was pretty freaking revolutionary, though.
posted by Mr. Bad Example at 6:18 PM on August 10, 2011