80 Million Tiny Images
January 15, 2009 5:46 PM Subscribe
A visualization of all the nouns in the English language arranged by semantic meaning. [NSFW words included!]
Currently computers have difficult recognizing objects in images. While practical solutions exist for a few simple classes such as human faces or cars, the more general problem of recognizing all different classes of objects in the world (e.g. guitars, bottles, telephones) remains unsolved. Computer Vision researchers are currently investigating methods that can recognize and localize thousands of different object categories in complex scenes. A key component of these algorithms is the data used to train the computers' model of each object. Current approaches use collections of images gathered by hand. Our research explores how the billions of images available on the Internet can be used to train models for object recognition. With overwhelming amounts of data, many problems can be tackled with simple algorithms. We gathered from the web 79 million images. We are using this massive dataset to train a computer to recognize objects within an image and to understand the scenes depicted in photographs.WordNet previously
You can help... get better training data for computer vision algorithms by labeling some of the images.
This thread has been archived and is closed to new comments