Skip to content

Commit cb43688

Browse files
authored
docs: one-off source URLs adjustments in dataset metadata (#728)
* fix: update broken source URLs in dataset metadata Updates several broken or outdated source URLs in datapackage_additions.toml as a one-off: NOTE: this one-off adjustment does not imply a policy to maintain current links in the example gallery metadata - airports.csv: Update to aviation-facilities1 (old dataset ID removed) - londonBoroughs.json: Update to current data.london.gov.uk URL - population_engineers_hurricanes.csv: Remove outdated FactFinder source (deprecated) - us-10m.json: Fix LICENSE URL (remove incorrect .md extension) - world-110m.json: Fix LICENSE URL (remove incorrect .md extension) - penguins.json: Update LTER URL to palmerpenguins R package site Also fixes documentation URLs in CONTRIBUTING.md: - Update datapackage.org/standard URL All URLs verified accessible. Broken URLs discovered during link checking with lychee. Regenerated datapackage.json and datapackage.md via npm build. * docs: fix broken LICENSE link in CONTRIBUTING.md Update LICENSE reference to point to GitHub README.md#license section instead of local ./LICENSE file which doesn't exist in the repository. * fix: correct Palmer Station LTER URL for penguins dataset Update Palmer Station source URL to the correct LTER site at Rutgers (https://pallter.marine.rutgers.edu/) instead of the palmerpenguins R package site. * Revise/correct resource descriptions and update sources for population_engineers_hurricanes.csv Updated resource descriptions and sources in the datapackage_additions TOML file for data/population_engineers_hurricanes.csv. Corrects year and ACS dataset for population and employment data and confirms ratio denominator. Identifies likely source, a NOAA FAQ, of state-level hurricane count aggregation that specifies the methodology used (e.g. 'direct hits') as well as link to disaggregated data table. * Run build checks and update datapackage timestamps * fix: update airports.csv description and correct source information * chore: regenerate datapackage with updated timestamps
1 parent 198ab81 commit cb43688

File tree

4 files changed

+59
-55
lines changed

4 files changed

+59
-55
lines changed

CONTRIBUTING.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ Thank you for your interest in contributing to Vega Datasets! This repository se
44

55
We welcome contributions that enhance metadata, improve documentation, fix dataset issues, enhance infrastructure, or add new datasets that demonstrate visualization techniques. Each dataset should serve a clear purpose in showcasing visualization capabilities.
66

7-
All contributions are accepted under the [Project's license](./LICENSE). The Project abides by the Vega Organization's [code of conduct](https://github.com/vega/.github/blob/main/CODE_OF_CONDUCT.md) and [governance](https://github.com/vega/.github/blob/main/project-docs/GOVERNANCE.md).
7+
All contributions are accepted under the [Project's license](https://github.com/vega/vega-datasets/blob/main/README.md#license). The Project abides by the Vega Organization's [code of conduct](https://github.com/vega/.github/blob/main/CODE_OF_CONDUCT.md) and [governance](https://github.com/vega/.github/blob/main/project-docs/GOVERNANCE.md).
88

99
## Dataset Contribution Guidelines
1010

@@ -52,7 +52,7 @@ For datasets requiring processing:
5252

5353
## Metadata and Documentation
5454

55-
We follow the [Data Package Standard 2.0](https://datapackage.org/standard/) with:
55+
We follow the [Data Package Standard 2.0](https://datapackage.org/) with:
5656

5757
- Auto-generated `datapackage.json` and `datapackage.md` files
5858
- Datasets in the `data/` directory

_data/datapackage_additions.toml

Lines changed: 17 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -28,13 +28,15 @@ path = "https://www.7-zip.org/license.txt"
2828

2929
[[resources]] # Path: airports.csv
3030
path = "airports.csv"
31-
description = """Airports in the United States and its territories, including major commercial, regional,
31+
description = """Airports in the United States and its territories, including major commercial, regional,
3232
and municipal airports. Contains information about each airport's location (latitude/longitude
33-
coordinates), identification codes, name, city, state, and country."""
33+
coordinates), identification codes, name, city, state, and country. While the exact generation
34+
source of this file is unknown, this data is consistent with files provided on a monthly
35+
frequency by the FAA's [National Airspace System Resource](https://www.faa.gov/air_traffic/flight_info/aeronav/aero_data/NASR_Subscription/)."""
3436

3537
[[resources.sources]]
36-
title = "Airports (Data.gov)"
37-
path = "https://catalog.data.gov/dataset/airports-5e97a"
38+
title = "Federal Aviation Administration"
39+
path = "https://www.faa.gov/air_traffic/flight_info/aeronav/aero_data/NASR_Subscription/"
3840

3941
[[resources.licenses]]
4042
name = "other-open"
@@ -891,7 +893,7 @@ and \"Contains Ordnance Survey data © Crown copyright and database right [2015]
891893

892894
[[resources.sources]]
893895
title = "Statistical GIS Boundary Files, London Datastore"
894-
path = "https://data.london.gov.uk/dataset/statistical-gis-boundary-files-london"
896+
path = "https://data.london.gov.uk/dataset/statistical-gis-boundary-files-for-london-20od9/"
895897

896898
[[resources.licenses]]
897899
title = "UK Open Government License"
@@ -1125,7 +1127,7 @@ description = "Sex of the penguin (MALE, FEMALE or null)"
11251127

11261128
[[resources.sources]]
11271129
title = "Palmer Station Antarctica LTER"
1128-
path = "https://pal.lternet.edu/"
1130+
path = "https://pallter.marine.rutgers.edu/"
11291131
[[resources.sources]]
11301132
title = "Allison Horst's Penguins Repository"
11311133
path = "https://github.com/allisonhorst/penguins"
@@ -1220,18 +1222,18 @@ path = "https://www.ipums.org/about/terms"
12201222

12211223
[[resources]] # Path: population_engineers_hurricanes.csv
12221224
path = "population_engineers_hurricanes.csv"
1223-
description = """Per-state population, number of engineers, and hurricanes. Used in Vega-Lite example,
1225+
description = """Per-state population (2016 ACS 1-Year), ratio of engineers to total civilian employed population (2016 ACS 1-Year), and total hurricane landfalls (possibly 1851-2015). Used in Vega-Lite example,
12241226
[Three Choropleths Representing Disjoint Data from the Same Table](https://vega.github.io/vega-lite/examples/geo_repeat.html)"""
12251227

12261228
[[resources.sources]]
1227-
title = "Bureau of Labor Statistics"
1228-
path = "https://www.bls.gov/oes/tables.htm"
1229+
title = "U.S. Census Bureau, 2016 ACS 1-Year Estimates: Total Population (B01001) and Occupation (S2401)"
1230+
path = "https://www.census.gov/data/developers/data-sets/acs-1year/2016.html"
12291231
[[resources.sources]]
1230-
title = "American Community Survey"
1231-
path = "https://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?pid=ACS_07_3YR_S1901&prodType=table"
1232+
title = "Continental United States Hurricane Impacts/Landfalls"
1233+
path = "https://www.aoml.noaa.gov/hrd/hurdat/All_U.S._Hurricanes.html"
12321234
[[resources.sources]]
1233-
title = "NOAA National Climatic Data Center"
1234-
path = "https://www.ncdc.noaa.gov/cdo-web/datatools/records"
1235+
title = "NOAA FAQ: How Many Landfalling Hurricanes Have Hit Eact State?"
1236+
path = "https://www.aoml.noaa.gov/hrd-faq/#landfalls-by-state"
12351237

12361238
[[resources.licenses]]
12371239
title = "U.S. Government Dataset"
@@ -1574,7 +1576,7 @@ path = "https://www.census.gov/geographies/mapping-files/time-series/geo/cartog
15741576

15751577
[[resources.licenses]]
15761578
title = "TopoJSON US Atlas ISC License"
1577-
path = "https://github.com/topojson/us-atlas/blob/master/LICENSE.md"
1579+
path = "https://github.com/topojson/us-atlas/blob/master/LICENSE"
15781580

15791581
[[resources]] # Path: us-employment.csv
15801582
path = "us-employment.csv"
@@ -1739,7 +1741,7 @@ path = "https://www.naturalearthdata.com/downloads/110m-cultural-vectors/110m-a
17391741

17401742
[[resources.licenses]]
17411743
title = "TopoJSON World Atlas ISC License"
1742-
path = "https://github.com/topojson/world-atlas/blob/master/LICENSE.md"
1744+
path = "https://github.com/topojson/world-atlas/blob/master/LICENSE"
17431745

17441746
[[resources.licenses]]
17451747
title = "Natural Earth Data Public Domain"

datapackage.json

Lines changed: 15 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -20,7 +20,7 @@
2020
}
2121
],
2222
"version": "3.2.1",
23-
"created": "2025-08-21T23:04:37.691343+00:00",
23+
"created": "2025-10-31T08:34:52.699272+00:00",
2424
"resources": [
2525
{
2626
"name": "icon_7zip",
@@ -49,7 +49,7 @@
4949
{
5050
"name": "airports",
5151
"type": "table",
52-
"description": "Airports in the United States and its territories, including major commercial, regional, \nand municipal airports. Contains information about each airport's location (latitude/longitude \ncoordinates), identification codes, name, city, state, and country.",
52+
"description": "Airports in the United States and its territories, including major commercial, regional, \nand municipal airports. Contains information about each airport's location (latitude/longitude \ncoordinates), identification codes, name, city, state, and country. While the exact generation \nsource of this file is unknown, this data is consistent with files provided on a monthly \nfrequency by the FAA's [National Airspace System Resource](https://www.faa.gov/air_traffic/flight_info/aeronav/aero_data/NASR_Subscription/).",
5353
"licenses": [
5454
{
5555
"name": "other-open",
@@ -58,8 +58,8 @@
5858
],
5959
"sources": [
6060
{
61-
"title": "Airports (Data.gov)",
62-
"path": "https://catalog.data.gov/dataset/airports-5e97a"
61+
"title": "Federal Aviation Administration",
62+
"path": "https://www.faa.gov/air_traffic/flight_info/aeronav/aero_data/NASR_Subscription/"
6363
}
6464
],
6565
"path": "airports.csv",
@@ -2147,7 +2147,7 @@
21472147
"sources": [
21482148
{
21492149
"title": "Statistical GIS Boundary Files, London Datastore",
2150-
"path": "https://data.london.gov.uk/dataset/statistical-gis-boundary-files-london"
2150+
"path": "https://data.london.gov.uk/dataset/statistical-gis-boundary-files-for-london-20od9/"
21512151
}
21522152
],
21532153
"path": "londonBoroughs.json",
@@ -2622,7 +2622,7 @@
26222622
"sources": [
26232623
{
26242624
"title": "Palmer Station Antarctica LTER",
2625-
"path": "https://pal.lternet.edu/"
2625+
"path": "https://pallter.marine.rutgers.edu/"
26262626
},
26272627
{
26282628
"title": "Allison Horst's Penguins Repository",
@@ -2933,7 +2933,7 @@
29332933
{
29342934
"name": "population_engineers_hurricanes",
29352935
"type": "table",
2936-
"description": "Per-state population, number of engineers, and hurricanes. Used in Vega-Lite example,\n[Three Choropleths Representing Disjoint Data from the Same Table](https://vega.github.io/vega-lite/examples/geo_repeat.html)",
2936+
"description": "Per-state population (2016 ACS 1-Year), ratio of engineers to total civilian employed population (2016 ACS 1-Year), and total hurricane landfalls (possibly 1851-2015). Used in Vega-Lite example,\n[Three Choropleths Representing Disjoint Data from the Same Table](https://vega.github.io/vega-lite/examples/geo_repeat.html)",
29372937
"licenses": [
29382938
{
29392939
"title": "U.S. Government Dataset",
@@ -2942,16 +2942,16 @@
29422942
],
29432943
"sources": [
29442944
{
2945-
"title": "Bureau of Labor Statistics",
2946-
"path": "https://www.bls.gov/oes/tables.htm"
2945+
"title": "U.S. Census Bureau, 2016 ACS 1-Year Estimates: Total Population (B01001) and Occupation (S2401)",
2946+
"path": "https://www.census.gov/data/developers/data-sets/acs-1year/2016.html"
29472947
},
29482948
{
2949-
"title": "American Community Survey",
2950-
"path": "https://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?pid=ACS_07_3YR_S1901&prodType=table"
2949+
"title": "Continental United States Hurricane Impacts/Landfalls",
2950+
"path": "https://www.aoml.noaa.gov/hrd/hurdat/All_U.S._Hurricanes.html"
29512951
},
29522952
{
2953-
"title": "NOAA National Climatic Data Center",
2954-
"path": "https://www.ncdc.noaa.gov/cdo-web/datatools/records"
2953+
"title": "NOAA FAQ: How Many Landfalling Hurricanes Have Hit Eact State?",
2954+
"path": "https://www.aoml.noaa.gov/hrd-faq/#landfalls-by-state"
29552955
}
29562956
],
29572957
"path": "population_engineers_hurricanes.csv",
@@ -3456,7 +3456,7 @@
34563456
"licenses": [
34573457
{
34583458
"title": "TopoJSON US Atlas ISC License",
3459-
"path": "https://github.com/topojson/us-atlas/blob/master/LICENSE.md"
3459+
"path": "https://github.com/topojson/us-atlas/blob/master/LICENSE"
34603460
}
34613461
],
34623462
"sources": [
@@ -3842,7 +3842,7 @@
38423842
"licenses": [
38433843
{
38443844
"title": "TopoJSON World Atlas ISC License",
3845-
"path": "https://github.com/topojson/world-atlas/blob/master/LICENSE.md"
3845+
"path": "https://github.com/topojson/world-atlas/blob/master/LICENSE"
38463846
},
38473847
{
38483848
"title": "Natural Earth Data Public Domain",

datapackage.md

Lines changed: 25 additions & 23 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# vega-datasets
2-
`3.2.1` | [GitHub](git+http://github.com/vega/vega-datasets.git) | 2025-08-21 23:04:37 [UTC]
2+
`3.2.1` | [GitHub](git+http://github.com/vega/vega-datasets.git) | 2025-10-31 08:34:52 [UTC]
33

44
Common repository for example datasets used by Vega related projects.
55
BSD-3-Clause license applies only to package code and infrastructure. Users should verify their use of datasets
@@ -36,9 +36,11 @@ Application icon from open-source software project. Used in [Image-based Scatter
3636
### path
3737
airports.csv
3838
### description
39-
Airports in the United States and its territories, including major commercial, regional,
39+
Airports in the United States and its territories, including major commercial, regional,
4040
and municipal airports. Contains information about each airport's location (latitude/longitude
41-
coordinates), identification codes, name, city, state, and country.
41+
coordinates), identification codes, name, city, state, and country. While the exact generation
42+
source of this file is unknown, this data is consistent with files provided on a monthly
43+
frequency by the FAA's [National Airspace System Resource](https://www.faa.gov/air_traffic/flight_info/aeronav/aero_data/NASR_Subscription/).
4244
### schema
4345

4446
| name | type |
@@ -51,9 +53,9 @@ coordinates), identification codes, name, city, state, and country.
5153
| latitude | number |
5254
| longitude | number |
5355
### sources
54-
| title | path |
55-
|:--------------------|:------------------------------------------------|
56-
| Airports (Data.gov) | https://catalog.data.gov/dataset/airports-5e97a |
56+
| title | path |
57+
|:--------------------------------|:---------------------------------------------------------------------------------|
58+
| Federal Aviation Administration | https://www.faa.gov/air_traffic/flight_info/aeronav/aero_data/NASR_Subscription/ |
5759
### licenses
5860
| name | title |
5961
|:-----------|:-------------------------------------|
@@ -1020,9 +1022,9 @@ Boundaries of London boroughs reprojected and simplified from `London_Borough_Ex
10201022
Original data "contains National Statistics data © Crown copyright and database right (2015)"
10211023
and "Contains Ordnance Survey data © Crown copyright and database right [2015].
10221024
### sources
1023-
| title | path |
1024-
|:-------------------------------------------------|:-------------------------------------------------------------------------|
1025-
| Statistical GIS Boundary Files, London Datastore | https://data.london.gov.uk/dataset/statistical-gis-boundary-files-london |
1025+
| title | path |
1026+
|:-------------------------------------------------|:------------------------------------------------------------------------------------|
1027+
| Statistical GIS Boundary Files, London Datastore | https://data.london.gov.uk/dataset/statistical-gis-boundary-files-for-london-20od9/ |
10261028
### licenses
10271029
| title | path |
10281030
|:---------------------------|:---------------------------------------------------------------------------|
@@ -1287,7 +1289,7 @@ variations between species and sexual dimorphism in Antarctic penguins.
12871289
### sources
12881290
| title | path |
12891291
|:------------------------------------|:-----------------------------------------|
1290-
| Palmer Station Antarctica LTER | https://pal.lternet.edu/ |
1292+
| Palmer Station Antarctica LTER | https://pallter.marine.rutgers.edu/ |
12911293
| Allison Horst's Penguins Repository | https://github.com/allisonhorst/penguins |
12921294
### licenses
12931295
| name | path |
@@ -1409,7 +1411,7 @@ http://doi.org/10.18128/D010.V6.0
14091411
### path
14101412
population_engineers_hurricanes.csv
14111413
### description
1412-
Per-state population, number of engineers, and hurricanes. Used in Vega-Lite example,
1414+
Per-state population (2016 ACS 1-Year), ratio of engineers to total civilian employed population (2016 ACS 1-Year), and total hurricane landfalls (possibly 1851-2015). Used in Vega-Lite example,
14131415
[Three Choropleths Representing Disjoint Data from the Same Table](https://vega.github.io/vega-lite/examples/geo_repeat.html)
14141416
### schema
14151417

@@ -1421,11 +1423,11 @@ Per-state population, number of engineers, and hurricanes. Used in Vega-Lite exa
14211423
| engineers | number |
14221424
| hurricanes | integer |
14231425
### sources
1424-
| title | path |
1425-
|:-----------------------------------|:------------------------------------------------------------------------------------------------------------------|
1426-
| Bureau of Labor Statistics | https://www.bls.gov/oes/tables.htm |
1427-
| American Community Survey | https://factfinder.census.gov/faces/tableservices/jsf/pages/productview.xhtml?pid=ACS_07_3YR_S1901&prodType=table |
1428-
| NOAA National Climatic Data Center | https://www.ncdc.noaa.gov/cdo-web/datatools/records |
1426+
| title | path |
1427+
|:------------------------------------------------------------------------------------------------|:---------------------------------------------------------------------|
1428+
| U.S. Census Bureau, 2016 ACS 1-Year Estimates: Total Population (B01001) and Occupation (S2401) | https://www.census.gov/data/developers/data-sets/acs-1year/2016.html |
1429+
| Continental United States Hurricane Impacts/Landfalls | https://www.aoml.noaa.gov/hrd/hurdat/All_U.S._Hurricanes.html |
1430+
| NOAA FAQ: How Many Landfalling Hurricanes Have Hit Eact State? | https://www.aoml.noaa.gov/hrd-faq/#landfalls-by-state |
14291431
### licenses
14301432
| title | path |
14311433
|:------------------------|:-------------------------------------|
@@ -1698,9 +1700,9 @@ in turn is a redistribution of the Census Bureau's cartographic boundary shapefi
16981700
| TopoJSON US Atlas | https://github.com/topojson/us-atlas |
16991701
| US Census Bureau Cartographic Boundary FIles | https://www.census.gov/geographies/mapping-files/time-series/geo/cartographic-boundary.html |
17001702
### licenses
1701-
| title | path |
1702-
|:------------------------------|:------------------------------------------------------------|
1703-
| TopoJSON US Atlas ISC License | https://github.com/topojson/us-atlas/blob/master/LICENSE.md |
1703+
| title | path |
1704+
|:------------------------------|:---------------------------------------------------------|
1705+
| TopoJSON US Atlas ISC License | https://github.com/topojson/us-atlas/blob/master/LICENSE |
17041706
## `us_employment`
17051707
### path
17061708
us-employment.csv
@@ -1897,10 +1899,10 @@ resource for creating web-based world maps where precise boundary detail isn't r
18971899
| TopoJSON World Atlas (Likely original source, processed from Natural Earth data) | https://github.com/topojson/world-atlas |
18981900
| Natural Earth Data - Admin 0 Countries (1:110m) | https://www.naturalearthdata.com/downloads/110m-cultural-vectors/110m-admin-0-countries/ |
18991901
### licenses
1900-
| title | path |
1901-
|:---------------------------------|:---------------------------------------------------------------|
1902-
| TopoJSON World Atlas ISC License | https://github.com/topojson/world-atlas/blob/master/LICENSE.md |
1903-
| Natural Earth Data Public Domain | https://www.naturalearthdata.com/about/terms-of-use/ |
1902+
| title | path |
1903+
|:---------------------------------|:------------------------------------------------------------|
1904+
| TopoJSON World Atlas ISC License | https://github.com/topojson/world-atlas/blob/master/LICENSE |
1905+
| Natural Earth Data Public Domain | https://www.naturalearthdata.com/about/terms-of-use/ |
19041906
## `zipcodes`
19051907
### path
19061908
zipcodes.csv

0 commit comments

Comments
 (0)