Still incomplete. See vega#15
Application icons from open-source software projects.
A raster grid of global annual precipitation for the year 2016 at a resolution 1 degree of lon/lat per cell, from CFSv2.
Graphs in Statistical Analysis, F. J. Anscombe, The American Statistician
The result of a 1930s agricultural experiment in Minnesota, this dataset contains yields for 10 different varieties of barley at six different sites. It was first published by agronomists F.R. Immer, H.K. Hayes, and L. Powers in the 1934 paper "Statistical Determination of Barley Varietal Adaption." R.A. Fisher's popularized its use in the field of statistics when he included it in his book "The Design of Experiments." Since then it has been used to demonstrate new statistical techniques, including the trellis charts developed by Richard Becker, William Cleveland and others in the 1990s.
http://lib.stat.cmu.edu/datasets/
http://scrippsco2.ucsd.edu/data/atmospheric_co2/primary_mlo_co2_record but modified to only include date and CO2 for months with valid data.
https://ourworldindata.org/natural-catastrophes
https://archive.nytimes.com/www.nytimes.com/imagepages/2010/05/02/business/02metrics.html
https://earthquake.usgs.gov/earthquakes/feed/v1.0/summary/all_week.geojson (Feb 6, 2018)
Flight delay statistics from U.S. Bureau of Transportation Statistics, https://www.transtats.bts.gov/OT_Delay/OT_DelayCause1.asp.
Transformed using /scripts/flights.js
Generated using /scripts/github.py
Data about engineers from https://www.bls.gov/oes/tables.htm. Hurricane data from http://www.nhc.noaa.gov/paststate.shtml. Income data from https://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?pid=ACS_07_3YR_S1901&prodType=table.
Generated using the -graticule
console option of http://mapshaper.org
The state of Iowa has dramatically increased its production of renewable wind power in recent years. This file contains the annual net generation of electricity in the state by source in thousand megawatthours. The dataset was compiled by the U.S. Energy Information Administration and downloaded on May 6, 2018. It is useful for illustrating stacked area charts.
More than 60 people lost their lives amid the looting and fires that ravaged Los Angeles for five days starting on April 29, 1992. This file contains metadata about each person, including the geographic coordinates of their death. It was compiled and published by the Los Angeles Times Data Desk.
Boundaries of London boroughs reprojected and simplified from London_Borough_Excluding_MHW
shapefile held at https://data.london.gov.uk/dataset/statistical-gis-boundary-files-london. Original data "contains National Statistics data © Crown copyright and database right (2015)" and "Contains Ordnance Survey data © Crown copyright and database right [2015].
Calculated from londongBoroughs.json
using d3.geoCentroid
.
Selected rail lines simplified from tfl_lines.json
at https://github.com/oobrien/vis/tree/master/tube/data
The dataset has well known and intentionally included errors. This dataset is used for instructional purposes, including the need to reckon with dirty data.
Data from NOAA.
Transformed using /scripts/weather.py
We synthesized the categorical "weather" field from multiple fields in the original dataset. This data is intended for instructional purposes.
In the mid 2000s the global economy was hit by a crippling recession. One result: Massive job losses across the United States. The downturn in employment, and the slow recovery in hiring that followed, was tracked each month by the Current Employment Statistics program at the U.S. Bureau of Labor Statistics.
This file contains the monthly employment total in a variety of job categories from January 2006 through December 2015. The numbers are seasonally adjusted and reported in thousands. The data were downloaded on Nov. 11, 2018, and reformatted for use in this library.
Totals are included for the 22 "supersectors" tracked by the BLS. The "nonfarm" total is the category typically used by economists and journalists as a stand-in for the country's employment total.
A calculated "nonfarm_change" column has been appended with the month-to-month change in that supersector's employment. It is useful for illustrating how to make bar charts that report both negative and positive values.
Maunga Whau (Mt Eden) is one of about 50 volcanos in the Auckland volcanic field. This data set gives topographic information for Maunga Whau on a 10m by 10m grid. Digitized from a topographic map by Ross Ihaka, adapted from R datasets. These data should not be regarded as accurate.
NOAA
In an 1822 letter to Parliament, William Playfair, a Scottish engineer who is often credited as the founder of statistical graphics, published an elegant chart on the price of wheat. It plots 250 years of prices alongside weekly wages and the reigning monarch. He intended to demonstrate that “never at any former period was wheat so cheap, in proportion to mechanical labour, as it is at the present time.”
Simulated wind patterns over northwestern Europe.
GeoNames.org