| Title: | Spatial Analysis Datasets for Teaching | 
| Version: | 0.1.0 | 
| Description: | Stores small spatial datasets used to teach basic spatial analysis concepts. Datasets are based off of the 'GeoDa' software workbook and data site https://geodacenter.github.io/data-and-lab/ developed by Luc Anselin and team at the University of Chicago. Datasets are stored as 'sf' objects. | 
| Depends: | R (≥ 3.3.0) | 
| License: | CC0 | 
| URL: | https://github.com/spatialanalysis/geodaData | 
| BugReports: | https://github.com/spatialanalysis/geodaData/issues | 
| Encoding: | UTF-8 | 
| LazyData: | true | 
| RoxygenNote: | 7.0.2 | 
| Suggests: | sf | 
| NeedsCompilation: | no | 
| Packaged: | 2020-05-20 01:07:05 UTC; angela | 
| Author: | Angela Li | 
| Maintainer: | Angela Li <ali6@uchicago.edu> | 
| Repository: | CRAN | 
| Date/Publication: | 2020-05-27 09:20:02 UTC | 
geodaData: Spatial Analysis Datasets for Teaching
Description
Stores small spatial datasets used to teach basic spatial analysis concepts. Datasets are based off of the 'GeoDa' software workbook and data site <https://geodacenter.github.io/data-and-lab/> developed by Luc Anselin and team at the University of Chicago. Datasets are stored as 'sf' objects.
Author(s)
Maintainer: Angela Li ali6@uchicago.edu (ORCID)
Other contributors:
- Luc Anselin (Creator of original spatial datasets) [contributor] 
See Also
Useful links:
- Report bugs at https://github.com/spatialanalysis/geodaData/issues 
Chicago Community Areas (2010).
Description
Population in Chicago community areas in 2010.
Usage
chicago_comm
Format
An sf data frame with 77 rows, 4 variables, and a geometry column:
- community
- Community name 
- area_num_1
- Community ID 
- NID
- Community ID (repeated) 
- POP2010
- Population in 2010 
- geometry
- MULTIPOLYGON 
Details
Sf object, unprojected. EPSG 4326: WGS84.
Source
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(chicago_comm)
  plot(chicago_comm["community"])
}
Cleveland Home Sales (2015).
Description
Location and sales price of home sales in a core area of Cleveland, OH for the fourth quarter of 2015.
Usage
clev_pts
Format
An sf data frame with 205 rows, 9 variables, and a geometry column:
- unique_id
- unique parcel id 
- parcel
- unique parcel number 
- x
- point latitude 
- y
- point longitude 
- sale_price
- price paid for the house ($) 
- tract10int
- License plate number and sometimes a description (state, color). Some entries did not include a plate number. 
- quarter
- quarter of sale (4th for all) 
- year1
- year of sale (2015 for all) 
- yrquarter
- year and quarter of sale (4th quarter of 2015 for all) 
- geometry
- POINT 
Details
Sf object, units in ft. EPSG 3734: NAD83 / Ohio North (ftUS).
Source
Cuyahoga County Fiscal Office. https://geodacenter.github.io/data-and-lab//clev_sls_154_core/
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(clev_pts)
  plot(clev_pts["unique_id"])
}
Chicago Population Change (2000-2010).
Description
Change in population in Chicago community areas from 2000 to 2010.
Usage
commpop
Format
An sf data frame with 77 rows, 8 variables, and a geometry column:
- community
- Community name 
- NID
- Community ID 
- POP2010
- Population in 2010 
- POP2000
- Population in 2000 
- POPCH
- Population change, count 
- POPPERCH
- Population percent change 
- popplus
- 1 if area has positive population change (17 observations) 
- popneg
- 1 if area has negative population change (60 observations) 
- geometry
- MULTIPOLYGON 
Details
Sf object, unprojected. EPSG 4326: WGS84.
Source
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(commpop)
  plot(commpop["community"])
}
Guerry "Moral Statistics" (1830s).
Description
Classic social science foundational study by Andre-Michel Guerry on crime, suicide, literacy and other “moral statistics” in 1830s France. Data from the R package Guerry (Michael Friendly and Stephane Dray).
Usage
guerry
Format
An sf data frame with 85 rows, 23 variables, and a geometry column:
- variable
- Description 
- dept, code_de
- Department ID: Standard numbers for the departments 
- region
- Region of France (‘N’=’North’, ‘S’=’South’, ‘E’=’East’, ‘W’=’West’, ‘C’=’Central’). Corsica is coded as NA. 
- dprtmnt
- Department name: Departments are named according to usage in 1830, but without accents. A factor with levels Ain Aisne Allier … Vosges Yonne 
- crm_prs
- Population per Crime against persons. 
- crm_prp
- Population per Crime against property. 
- litercy
- Percent of military conscripts who can read and write. 
- donatns
- Donations to the poor. 
- infants
- Population per illegitimate birth. 
- suicids
- Population per suicide. 
- maincty
- Size of principal city (‘1:Sm’, ‘2:Med’, ‘3:Lg’), used as a surrogate for population density. Large refers to the top 10, small to the bottom 10; all the rest are classed Medium. 
- wealth
- Per capita tax on personal property. A ranked index based on taxes on personal and movable property per inhabitant. 
- commerc
- Commerce and Industry, measured by the rank of the number of patents / population. 
- clergy
- Distribution of clergy, measured by the rank of the number of Catholic priests in active service population. 
- crim_prn
- Crimes against parents, measured by the rank of the ratio of crimes against parents to all crimes – Average for the years 1825-1830. 
- infntcd
- Infanticides per capita. A ranked ratio of number of infanticides to population – Average for the years 1825-1830. 
- dntn_cl
- Donations to the clergy. A ranked ratio of the number of bequests and donations inter vivios to population – Average for the years 1815-1824. 
- lottery
- Per capita wager on Royal Lottery. Ranked ratio of the proceeds bet on the royal lottery to population — Average for the years 1822-1826. 
- desertn
- Military desertion, ratio of number of young soldiers accused of desertion to the force of the military contingent, minus the deficit produced by the insufficiency of available billets – Average of the years 1825-1827. 
- instrct
- Instruction. Ranks recorded from Guerry’s map of Instruction. Note: this is inversely related to Literacy. 
- prsttts
- Number of prostitutes registered in Paris from 1816 to 1834, classified by the department of their birth 
- distanc
- Distance to Paris (km). Distance of each department centroid to the centroid of the Seine (Paris). 
- area
- Area (1000 km^2). 
- pop1831
- Population in 1831, in 1000s. 
- geometry
- MULTIPOLYGON 
Details
Sf object, units in m. EPSG 27572: NTF (Paris) / Lambert zone II.
Source
- Angeville, A. (1836). Essai sur la Statistique de la Population française Paris: F. Doufour. 
- Guerry, A.-M. (1833). Essai sur la statistique morale de la France Paris: Crochard. English translation: Hugh P. Whitt and Victor W. Reinking, Lewiston, N.Y. : Edwin Mellen Press, 2002. 
- Parent-Duchatelet, A. (1836). De la prostitution dans la ville de Paris, 3rd ed, 1857, p. 32, 36 
https://geodacenter.github.io/data-and-lab/Guerry/
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(guerry)
  plot(guerry["CODE_DE"])
}
Homicides & Socio-Economics (1960-90).
Description
Homicides and selected socio-economic characteristics for continental U.S. counties. Data for four decennial census years: 1960, 1970, 1980 and 1990.
Usage
ncovr
Format
An sf data frame with 3085 rows, 69 variables, and a geometry column:
- name
- county name 
- state_name
- state name 
- state_fips
- state fips code (character) 
- cnty_fips
- county fips code (character) 
- fips
- combined state and county fips code (character) 
- stfips
- state fips code (numeric) 
- cofips
- county fips code (numeric) 
- fipsno
- fips code as numeric variable 
- south
- dummy variable for Southern counties (South = 1) 
- hr
- homicide rate per 100,000 (1960, 1970, 1980, 1990) 
- hc
- homicide count, three year average centered on 1960, 1970, 1980, 1990 
- po
- county population, 1960, 1970, 1980, 1990 
- rd
- resource deprivation 1960, 1970, 1980, 1990 (principal component, see Codebook for details) 
- ps
- population structure 1960, 1970, 1980, 1990 (principal component, see Codebook for details) 
- ue
- unemployment rate 1960, 1970, 1980, 1990 
- dv
- divorce rate 1960, 1970, 1980, 1990 (percent males over 14 divorced) 
- ma
- median age 1960, 1970, 1980, 1990 
- pol
- log of population 1960, 1970, 1980, 1990 
- dnl
- log of population density 1960, 1970, 1980, 1990 
- mfil
- log of median family income 1960, 1970, 1980, 1990 
- fp
- percent families below poverty 1960, 1970, 1980, 1990 (see Codebook for details) 
- blk
- percent black 1960, 1970, 1980, 1990 
- gi
- Gini index of family income inequality 1960, 1970, 1980, 1990 
- fh
- percent female headed households 1960, 1970, 1980, 1990 
- geometry
- MULTIPOLYGON 
Details
Sf object, unprojected. EPSG 4326: WGS84.
Source
S. Messner, L. Anselin, D. Hawkins, G. Deane, S. Tolnay, R. Baller (2000). An Atlas of the Spatial Patterning of County-Level Homicide, 1960-1990. Pittsburgh, PA, National Consortium on Violence Research (NCOVR). https://geodacenter.github.io/data-and-lab/ncovr/
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(ncovr)
  plot(ncovr["NAME"])
}
Rental Housing and Demographics in NYC (2000s), non-spatial.
Description
Demographic and housing data for New York City’s 55 sub-boroughs (2000s).
Usage
nyc
Format
A data frame with 55 rows and 34 variables:
- CODE
- sub-borough code, 1XX Bronx, 2XX Brooklyn, 3XX Manhattan, 4XX Queens, 5XX Staten Island 
- FORHIS06
- percentage of hispanic population, not born in US, 2006 
- FORHIS07
- percentage of hispanic population, not born in US, 2007 
- FORHIS08
- percentage of hispanic population, not born in US, 2008 
- FORHIS09
- percentage of hispanic population, not born in US, 2009 
- FORWH06
- percentage of white population, not born in US, 2006 
- FORWH07
- percentage of white population, not born in US, 2007 
- FORWH08
- percentage of white population, not born in US, 2008 
- FORWH09
- percentage of white population, not born in US, 2009 
- HHSIZ1990
- average number of people per household, 1990 
- HHSIZ00
- average number of people per household, 2000 
- HHSIZ02
- average number of people per household, 2002 
- HHSIZ05
- average number of people per household, 2005 
- HHSIZ08
- average number of people per household, 2008 
- KIDS2000
- percentage households w kids under 18, 2000 
- KIDS2005
- percentage households w kids under 18, 2005 
- KIDS2006
- percentage households w kids under 18, 2006 
- KIDS2007
- percentage households w kids under 18, 2007 
- KIDS2008
- percentage households w kids under 18, 2008 
- KIDS2009
- percentage households w kids under 18, 2009 
- NAME
- name of borough, one of five 
- RENT2002
- median monthly contract rent, 2002 
- RENT2005
- median monthly contract rent, 2005 
- RENT2008
- median monthly contract rent, 2008 
- RENTPCT02
- percentage of housing stock that is market rate rental units, 2002 
- RENTPCT05
- percentage of housing stock that is market rate rental units, 2005 
- RENTPCT08
- percentage of housing stock that is market rate rental units, 2008 
- SUBBOROUGH
- name of sub-borough 
- PUBAST90
- percentage of households receiving public assistance, 1990 
- PUBAST00
- percentage of households receiving public assistance, 2000 
- YRHOM02
- average number of years living in current residence, 2002 
- YRHOM05
- average number of years living in current residence, 2005 
- YRHOM08
- average number of years living in current residence, 2008 
- bor_subb
- sub-borough code, repeated 
Details
Dataframe, no spatial components.
Source
https://geodacenter.github.io/data-and-lab/nyc/
Rental Housing and Demographics in NYC (2000s).
Description
Demographic and housing data for New York City’s 55 sub-boroughs (2000s).
Usage
nyc_sf
Format
An sf data frame with 55 rows, 34 variables, and a geometry column:
- forhis06-09
- percentage of hispanic population, not born in US 
- forwh06-09
- percentage of white population, not born in US 
- hhsiz1990
- average number of people per household 
- hhsiz00
- average number of people per household 
- hhsiz02-05-08
- average number of people per household 
- kids2000, kids2005-2009
- percentage households w kids under 18 
- rent2002,2005,2008
- median monthly contract rent 
- rentpct02,05,08
- percentage of housing stock that is market rate rental units 
- pubast90,00
- percentage of households receiving public assistance 
- yrhom02,05,08
- average number of years living in current residence 
- geometry
- MULTIPOLYGON 
Details
Sf object, units in ft. EPSG 2263: NAD83 / New York Long Island (ftUS).
Source
https://geodacenter.github.io/data-and-lab/nyc/
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(nyc_sf)
  plot(nyc_sf["bor_subb"])
}
Ohio Lung Cancer Mortality (1960s-80s).
Description
Ohio lung cancer data for 1968, 1978 and 1988.
Usage
ohio_lung
Format
An sf data frame with 88 rows, 42 variables, and a geometry column:
- county_id
- Sequential county ID (alphabetic order) 
- name
- County name 
- fipsno
- Fips code as numeric 
- lg_ryy
- Lung cancer cases for gender G (M or F) and race R (W or B) in year yy (1968, 1978, 1988) 
- popg_ryy
- Population at risk for gender G (M or F) and race R (W or B) in year yy (1968, 1978, 1988) 
- l_gyy
- Total male and female lung cancer cases for each year 
- pop_gyy
- Total population at risk by gender 
- geometry
- POLYGON 
Details
Sf object, units in m. EPSG 32617: WGS 84 / UTM Zone 17N.
Source
https://geodacenter.github.io/data-and-lab/ohiolung/
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(ohio_lung)
  plot(ohio_lung["FIPSNO"])
}
Abandoned Vehicles (2016).
Description
Point locations of abandoned vehicles in Chicago in September 2016.
Usage
vehicle_pts
Format
An sf data frame with 2635 rows, 10 variables, and a geometry column:
- CreationDt
- Date created 
- Address
- Address of abandoned vehicle 
- ZIPCode
- Zip code of abandoned vehicle 
- X
- Projected X, EPSG 32616 
- Y
- Projected Y, EPSG 32616 
- Ward
- Ward ID 
- PoliceD
- Police district ID 
- Comm
- Community area ID 
- Latitude
- Latitude of vehicle 
- Longitude
- Longitude of vehicle 
- geometry
- POINT 
Details
Sf object, unprojected. EPSG 4326: WGS84.
Source
https://data.cityofchicago.org/Service-Requests/311-Service-Requests-Abandoned-Vehicles/3c9v-pnva
Examples
if (requireNamespace("sf", quietly = TRUE)) {
  library(sf)
  data(vehicle_pts)
  plot(vehicle_pts["CreationDt"])
}