PLOS’ New Data Policy: Public Access to Data
"PLOS has always required that authors make their data available to other academic researchers who wish to replicate, reanalyze, or build upon the findings published in our journals.
In an effort to increase access to this data, we are now revising our data-sharing policy for all PLOS journals: authors must make all data publicly available, without restriction, immediately upon publication of the article. Beginning March 3rd, 2014, all authors who submit to a PLOS journal will be asked to provide a Data Availability Statement, describing where and how others can access each dataset that underlies the findings." Openscience.org also have a primer on why open science data is important.
posted by jaduncan
on Feb 25, 2014 -
You don’t want your privacy: Disney and the meat space data race
The bands are even uniquely colored and monogrammed with your family members’ names so that they won’t get switched up. Why? Because they don’t want their database to get confused and think that you, a 45-year-old man, rode the teacups instead of your little son Timmy. This is one of the first examples I’ve seen of physical design (e.g., monogramming and coloring) for the sake of digital data purity.
If ever there was a testimony to the importance big data has achieved in business it’s this: We will now shape our physical world to create better streams of digital information.
posted by frimble
on Jan 27, 2014 -
If you use Netflix, you've probably wondered about the specific genres that it suggests to you. Some of them just seem so specific that it's absurd. Emotional Fight-the-System Documentaries? Period Pieces About Royalty Based on Real Life? Foreign Satanic Stories from the 1980s? ... Through a combination of elbow grease and spam-level repetition, we discovered that Netflix possesses not several hundred genres, or even several thousand, but 76,897 unique ways to describe types of movies.
posted by Horace Rumpole
on Jan 2, 2014 -
A few months ago there was a list of links to classic video game emulators posted.
Very recently, I'm pleased to report, those links all came true
. The Internet Archive bespoke upon aforementioned consoles, computers, and mileposts on our way to the tech utopia of today, (seriously, where's my flying car?) and they asked us to do something: Imagine every computer that ever existed, literally, in your browser
. And it was so.
I have absolutely no affiliation with jscott
, btw. Thought I should disclose that.
posted by jdaura
on Oct 25, 2013 -
is an application of BioGraph
-based data mining to music, which helps you get recommendations for other musicians. Based on 140K user-defined tags from last.fm
that are collected for over 400K artists, results are sorted by the "nearest" or most probable matches for your artist of interest (algorithm described here
). [more inside]
posted by Blazecock Pileon
on Oct 2, 2013 -
"We live in a world where digital information is exploding
. Some 90% of the world’s data was generated in the past two years. The obvious question is: how can we store it all? In Nature Communications today
, we, along with Richard Evans from CSIRO, show how we developed a new technique to enable the data capacity of a single DVD to increase from 4.7 gigabytes up to one petabyte (1,000 terabytes). This is equivalent of 10.6 years of compressed high-definition video or 50,000 full high-definition movies."
posted by Blazecock Pileon
on Jun 20, 2013 -
Every Noise At Once.
A map of musical genres, built by Glenn McDonald of The War Against Silence and the Echo Nest. Click on a genre name to hear a sound sample, or pop it open to see a map of bands within that genre.
posted by escabeche
on Apr 30, 2013 -
Is Psychometric g a Myth?
- "As an online discussion about IQ or general intelligence grows longer, the probability of someone linking to statistician Cosma Shalizi's essay g, a Statistical Myth
approaches 1. Usually the link is accompanied by an assertion to the effect that Shalizi offers a definitive refutation of the concept of general mental ability, or psychometric g
." [more inside]
posted by kliuless
on Apr 11, 2013 -
is a publicly–accessible database of food. It serves as a definitive resource for snack enthusiasts throughout the world.
posted by cthuljew
on Apr 9, 2013 -
"While playing around with the Nmap Scripting Engine (NSE) we discovered an amazing number of open embedded devices on the Internet. " After completing the scan of roughly one hundred thousand IP addresses, we realized the number of insecure devices must be at least one hundred thousand. Starting with one device and assuming a scan speed of ten IP addresses per second, it should find the next open device within one hour. The scan rate would be doubled if we deployed a scanner to the newly found device. After doubling the scan rate in this way about 16.5 times, all unprotected devices would be found; this would take only 16.5 hours. Additionally, with one hundred thousand devices scanning at ten probes per second we would have a distributed port scanner to port scan the entire IPv4 Internet within one hour. [more inside]
posted by jquinby
on Mar 18, 2013 -
, the latest project of the nonprofit Sunlight Foundation
, is an easy to navigate comprehensive database of activities from all state capitols that makes it easy to find your state lawmaker, review their votes, search for legislation, track bills and much more.
posted by joedan
on Mar 2, 2013 -
To understand what is at stake we need to make our way through the rhetorical smog. For months prior to the WCIT, the Euro-American press trumpeted warnings that this was to be an epochal clash between upholders of an open Internet and would-be government usurpers, led by authoritarian states like Russia, Iran and China. The terms of reference were set so rigidly that one European telecom company executive called it a campaign of “propaganda warfare” (2).
~ Masters of the Internet, Le Monde Diplomatique
posted by infini
on Feb 12, 2013 -
is a new body of artwork aimed at exploring the gap between data and information. Consisting of a set of images
, this project stems from our interest in glitches, code-breaking, and translation. our custom script encodes text files as images, making it possible to visualize both the size and architecture of large-scale data sets through an aesthetic lens. So if you ever wanted to see hamlet as a jpeg
and find artistic merit hiding within its code, here's your chance. [more inside]
posted by legweak
on Jan 7, 2013 -