Urban SDK provides data to streamline projects revolving around spatial analysis, transportation, people, and planning with live and historical data.

The English Proficiency datasets are reported once per year by the United States Census. The data reflect persons' abilities to speak English. The dataset is available to download through the Urban SDK Data Hub for the years 2015 - 2020. The data has been aggregated to the census tract.

Census English Proficiency Definition

Data reported by the US census, the measures for English speaking ability are in reference to the diversity of languages spoken within the United States. This data counts the population of English speaking sorted out by ability.

English Proficiency Categories

Across the United States, persons may speak many more languages outside of English. Collecting data on the ability level of speakers of English provides insights into the diversity of the United States.

Metrics used to report on English language speaking abilities:

  • Very Well

  • Well

  • Not Well

  • Not at All

  • Native Speaker

Census English Proficiency Parameters: Census.gov

Contents:

Census English Proficiency Schema

The dataset is available in JSON, Shapefile, GEOJSON, and CSV file types.

Description

Type

Example

geo_id

14-digit code relating the data to the correct geolocation

String

16981048

census_tract_code

The unique identifier provided by the US Census for each census tract

String

14401

sub_category

Level of English speaking

String

Well

unit

Total count of persons

String

Total Count

value

Number of persons fitting within survey criteria

Float

692

year

Year data was collected

String

2019

county_name

Name of the county that the census tract resides in

String

Duval

state_name

Name of the state that the census tract resides in

String

Florida

geometry

The GIS information required for the computer to read the mapping file

String

MULTILINESTRING ((-81.59791 30.24124, -81.59801 30.24135))

Detailed Calculations

geo_id

The “GEO.ID” field contains 14-digit codes that identify the summary level of data, the geographic component of the data, and FIPS codes that uniquely identify the data. For example, the 14-digit “GEO.ID” for Harris County, TX is “0500000US48201” where “050” represents the summary level of the data, “0000” represents the 2-digit geographic variant, and the 2-digit geographic component, “US” represents the United States, “48” represents the state of Texas and “201” represents Harris County.

Did this answer your question?