Each week, the Internet Archive's tumblr account is completely transformed by a digital resident along a theme of their choosing. [more inside]
Mining books to map emotions through a century. Emotion words aren't consistently used through time, it seems. Things got scary in the 80's.
Mining the Mother of all Data Dumps We now have a relatively massive haul of digital data from the OBL strike. There are several forensic toolkits in use by the private (commercially available) and public sector as well as open-source. Best practices include inventorying all the sources, cloning the sources so as to not damage pristine data, recovering any partial or damaged content, making the cloned sources read-only, adhering to legally-admissible tools standards, and documenting everything. There is an excellent source titled Digital Forensics and Born-Digital Content from the Council on Library and Information Resources [pdf, Resource Shelf]. But what to do next*? [more inside]
It has applications in health care, pharmaceuticals, facial recognition, economics/related areas, and of course, much much more. Previously, MeFi discussed controversial homeland security applications, and the nexus between social networking and mobile devices that further contributes to the pool. With plenty to dig into, let's talk Data Mining in more detail. [more inside]
Why is Miss Congeniality the most frequently rated DVD on Netflix? Database magic reveals the most contentious movies ever.