"This is the largest dataset of its kind ever produced."
September 21, 2020 8:37 AM Subscribe
Newspaper Navigator is a project being carried out by Ben Lee (his announcement on Twitter), Innovator in Residence at the Library of Congress. It extracts visual content from 16+ million pages of sixty years of public domain digitized American newspapers and helps people learn to search the visual content using machine learning techniques. Read the FAQ to learn more about how its creator tried to manage algorithmic bias. Fun search terms are offered if you're not feeling creative: national park, giraffe, blimp, hats, stunts. The dataset is publicly available, the code is available and here's a white paper about the process of building it.
This thread has been archived and is closed to new comments