facts and figures about the animals we eat
"On September 19th, the Census Bureau released the American Community Survey
of poverty and income
. Based on a much larger survey sample than the older Current Population Survey, the ACS affords a closer
look at state
, and local
and education spending
). It is not a pretty picture.
" --Neat Data visualizations
of the survey info from Dissent Magazine
Top 10 most iconic data graphs of the last decade
FastCoLabs enlisted three data visualization experts to compile this list to answer
posed in Simply Statistics
, a blog from three Johns Hopkins biostatistics professors. via [more inside]
Estimated US Energy Use in 2012: 95.1 Quads
- "Energy flow charts show the relative size of primary energy resources and end uses in the United States, with fuels compared on a common energy unit basis." (via
) [more inside]
Interactive map of pronunciation and use of various words and phrases differs by region in the US.
Based on Bert Vaux's online survey
of English dialects, the program allows you to see results for individual cities, as well as nationwide (though inexplicably it does not include Alaska or Hawaii).
Is Psychometric g a Myth?
- "As an online discussion about IQ or general intelligence grows longer, the probability of someone linking to statistician Cosma Shalizi's essay g, a Statistical Myth
approaches 1. Usually the link is accompanied by an assertion to the effect that Shalizi offers a definitive refutation of the concept of general mental ability, or psychometric g
." [more inside]
U.S. Poverty Rate, 1 in 6, at Highest Level in Years (NYT) -
An additional 2.6 million people slipped below
the poverty line in 2010, census officials said, making 46.2 million people in poverty in the United States
, the highest number in the 52 years the Census Bureau has been tracking it, said Trudi Renwick, chief of the Poverty Statistic Branch
. That represented
15.1 percent of the country. The poverty line in 2010 was at $22,113 for a family of four
A corpus analysis of rock harmony
[PDF] - The analyses were encoded using a recursive notation, similar to a context-free grammar, allowing repeating sections to be encoded succinctly. The aggregate data was then subjected to a variety of statistical analyses. We examined the frequency of different chords
and chord transitions ... Other results concern the frequency of different root motions, patterns of
co-occurrence between chords, and changes in harmonic practice across time.
More information, analysis, and explanation here
Stanford's Visualization Group
has produced a data cleanup web app called Wrangler
that works like straight up magic
give their hopes and dreams for data, data tools and data science
Already, Google has provided Google Refine
) to help clean your datasets. While great visualizations
can be created with online tools
or by combining R (great posts previously
), with ggplot2
, and even Google Motion Charts With R
(already built into Google Spreadsheets
Need data? Needlebase
, helps non-programmers scrape, harvest, merge, and data from the web. Or if you’re introspective, Your Flowing Data
provide tools to measure and chart details of your own life.
The New York Times presents an interactive map of America's population
separated by race, income, and education, according to census data from 2005 to 2009. One dot for every 50 people. (Previously
) [more inside]
hosts competitions to glean information from massive data sets, a la the Netflix Prize
. Competitors can enter free, while companies with vast stores of impenetrable data pay Kaggle to outsource their difficulties to the world population of freelance data-miners. Kaggle contestants have already developed dozens of chess rating systems which outperform the Elo rating currently in use
, and identified genetic markers in HIV associated with a rise in viral load
. Right now, you can compete to forecast tourism statistics
or predict unknown edges in a social network
. Teachers who want to pit their students against each other can host a Kaggle contest free of charge
is World Statistics Day, so help yourself to a metric (haha sorry)
ton of publicly available data at UNdata
, ICSPR (registration required to download data sets)
, and data.gov
). You can also explore, visualize and animate a variety of publicly available data sets with Google Labs' Public Data Explorer
A Tour through the Visualization Zoo
. A survey of powerful visualization techniques, from the obvious to the obscure.
OK Cupid statistics fun:
We collected 552,000 example user pictures.
We paired them up and asked people to make snap judgments.
Here's what we found.
R is quickly becoming the
programming language for data analysis and statistics. R
) is free, open-source, and has hundreds
of packages available. You can use it on the command-line, through a GUI, or in your favorite text editor. Use it with Python
, or Java
R code into LaTeX documents
for reproducible research. [more inside]
Mercenary Epidemiology: Data Reanalysis and Reinterpretation for Sponsors With Financial Interest in the Outcome.
(.pdf link) When should scientists be required to release their raw data for (potentially hostile) re-analysis? A letter to the editors of Annals of Epidemiology from David Michaels, Ph.D., MPH, public health blogger
, author of the book Doubt Is Their Product
, and, as of December 2009, the Assistant Secretary of Labor for OSHA,
unanimously confirmed by the Senate despite the dismay of some
. Michaels interviewed at Science Progress
about Doubt Is Their Product
(podcast, with transcript.)
"Death Risk Rankings
calculates your risk of dying in the next year and allows you to compare that risk to others in the world." Fun with mortality data and statistics from Carnegie Mellon University.
The fine folks at OkCupid
, the dating site, have begun to analyze aggregate data from the questions their users answer to form dating profiles, revealing, among other things, that users in Nevada are more open to rape fanstasies than those from Michigan
. [more inside]
- a network of online data libraries on topics including census data, economic data, health data, income and unemployment data, population data, labor data, cancer data, crime and transportation data, family dynamics, vital statistics data
US Census Bureau Facts & Figures: Holiday Edition
says that more than 20 billion letters, packages and cards
will be delivered this holiday season and 12 million packages a day through to Christmas Eve. Also check out the Special Edition
for comparison data from 1915, 1967 and 2006, the African-American History Month Facts & Features
and more data going back to 2000
statistics on the death penalty
Statistics which suggest that the death penalty does not accomplish what we expected of it. Unless, perhaps, we like revenge.