Exposing Undercounts in the Census through Regression Modeling

Authors

  • Tarun Shah Lane Tech High School

DOI:

https://doi.org/10.47611/jsrhs.v12i3.4662

Keywords:

Census, Undercounts, Limited English Profficiency, Regression, Language, Machine Learning, Statistical Modeling, Population Survey

Abstract

Undercounts in the Census are notoriously difficult to measure but are necessary to understanding the extent of systemic divides within America. Although many community leaders have proposed that language barriers pose significant obstacles to Census outreach, this paper explores the viability of using predictive models to quantify the extent the role language plays. By using a multivariable regression model trained on student data from the Civil Rights Data Collection, we concluded that Limited English Proficient (LEP) Asian populations were undercounted by over 98,000 people, LEP Native Hawaiian/Pacific Islander populations were undercounted by over 73,000 people, and LEP White populations were undercounted by over 164,000 people.  However, biases in the dataset made the results for other ethnic groups unreliable, indicating that regression modeling should be used as a tool for identifying areas of improvement in the Census rather than producing an exact estimate of the population. Our data suggests a strong correlation between language proficiency and Census undercounts, and the Census Bureau could ideally use the results to create targeted language outreach programs for specific states.

Downloads

Download data is not yet available.

References or Bibliography

Abdul-Hakim, Gabriella, and MaryAlice Parks. “Pandemic Shows Need for Native Hawaiians, Pacific Islanders Participation in Census.” ABC News, ABC News Network, 26 May 2020, https://abcnews.go.com/Politics/pandemic-shows-native-hawaiians-pacific-islanders-participation-census/story?id=70873566.

The American Academy of Political and Social Science, 2018, Accurately Counting Asian Americans Is a Civil Rights Issue, https://journals.sagepub.com/doi/abs/10.1177/0002716218765432. Accessed 25 Jan. 2023.

“Census Bureau Releases Estimates of Undercount and Overcount in the 2020 Census.” United States Census Bureau, Census Bureau, 10 Mar. 2022, https://www.census.gov/newsroom/press-releases/2022/2020-census-estimates-of-undercount-and-overcount.html. Accessed 25 Jan. 2023.

Civil Rights Data Collection. “Entire Database of the Civil Rights Data Collection from 2010-2016.” https://ocrdata.ed.gov/resources/downloaddatafile

Dekker, L.M., Krou, C.A., Wright, T.D. & Smith, D.M. (April, 2002). Effective strategies for reducing the overrepresentation of minorities in special education. Paper presented at the Annual Classroom Action Research Conference, South Bend, IN. EDRS Reproduction No. ED464076.

Institute of Education Sciences, 2022,. English Language Development among American Indian English Learner Students in New Mexico, https://ies.ed.gov/ncee/rel/regions/southwest/pdf/REL_2022135.pdf.

Johnson, Kenneth, and Daniel Lichter. “Growing Racial Diversity in Rural America: Results from the 2020 Census.” UNH, University of New Hampshire, 31 May 2022, https://carsey.unh.edu/publication/growing-racial-diversity-in-rural-america#:~:text=Rural%20America%20remains%20predominately%20non,according%20to%20the%202020%20Census.

Núñez, Anna. “Census Data Has Been Misused before - in WWII and after 9/11.” America’s Voice, 3 Apr. 2018, americasvoice.org/blog/census-data-misuse/.

NBC, “Asian Americans Were Overcounted on the Last Census. Experts Say That Fact Is Misleading.” NBCNews.com, NBCUniversal News Group, 29 Apr. 2022, https://www.nbcnews.com/news/asian-america/asian-americans-overcounted-last-census-experts-say-fact-misleading-rcna26509.

O' Hare, William. “2020 Census Faces Challenges in Rural America.” UNH, University of New Hampshire, 21 Oct. 2019, https://carsey.unh.edu/publication/2020-census.

Pew Research Center, 2016, Rise in English Proficiency among U.S. Hispanics Is Driven by the Young, https://www.pewresearch.org/fact-tank/2016/04/20/rise-in-english-proficiency-among-u-s-hispanics-is-driven-by-the-young/. Accessed 25 Jan. 2023.

Pew Research Center, 2013, Second Generation Americans, https://www.pewresearch.org/social-trends/2013/02/07/second-generation-americans/ Accessed 25 Jan. 2023.

U.S. Department of Education, 2007, Status of Education in Rural America, https://nces.ed.gov/pubs2007/2007040.pdf. Accessed 25 Jan. 2023.

Published

08-31-2023

How to Cite

Shah, T. (2023). Exposing Undercounts in the Census through Regression Modeling. Journal of Student Research, 12(3). https://doi.org/10.47611/jsrhs.v12i3.4662

Issue

Section

HS Research Projects