IPUMS Announces 2022 Research Award Recipients

IPUMS research awardsWe are excited to announce the winners of our annual IPUMS Research Awards competition. These awards honor outstanding published research and graduate student papers from 2022 that use IPUMS data to advance or deepen our understanding of social and demographic processes.

The 2022 competition awarded prizes for the best published and best graduate student research in eight categories:

  • IPUMS USA, providing data from the U.S. decennial censuses, the American Community Survey, and includes full count data, from 1850 to the present.
  • IPUMS CPS, providing data from the monthly U.S. labor force survey, the Current Population Survey (CPS), from 1962 to the present.
  • IPUMS International, providing harmonized data contributed by more than 100 international statistical office partners for over 500 censuses and surveys from around the world for 1960 forward as well as full count historical (NAPP) data.
  • IPUMS Health Surveys, which makes available the U.S. National Health Interview Survey (NHIS) and the Medical Expenditure Panel Survey (MEPS).
  • IPUMS Spatial, covering IPUMS NHGIS, IPUMS IHGIS, and IPUMS Terra. NHGIS includes GIS boundary files from 1790 to the present; IHGIS provides data tables from population and housing censuses as well as agricultural censuses from around the world; Terra provides data on population and the environment from 1960 to the present.
  • IPUMS Global Health, providing harmonized data from the Demographic and Health Surveys and the Performance Monitoring and Accountability surveys, for low and middle-income countries from the 1980s to the present.
  • IPUMS Time Use, providing time diary data from the U.S. and around the world from 1965 to the present.
  • IPUMS Excellence in Research, this award is an opportunity to highlight and reward outstanding work using any of the IPUMS data collections by authors who are underrepresented in social science research.

The awards committee reviewed hundreds of submissions; from these, we selected the 2022 winners.

Continue reading…

IPUMS FAQ: What are Virtual Office Hours?

by Amy Grotsun and Kari Williams

FAQ in speech bubbleWhile scrolling through an IPUMS email or checking up on the latest IPUMS tweets you may have read about IPUMS Virtual Office Hours. You also probably wondered, “What even are virtual office hours?” Today we are here to answer that question.

In the wake of canceled conferences during spring of 2020, IPUMS wanted to create an opportunity to connect with data users about questions they might have when using the data–the kind of thing you might pop by the exhibit hall booth to ask. We found that we connected with virtual conference attendees as expected, but many other IPUMS data users as well. As in-person conferences resume, we will continue to host virtual office hours quarterly so we can continue connecting with a broad range of IPUMS users.

Continue reading…

IPUMS FAQs: How do I open IPUMS microdata files in my stats package?

By Kari Williams

FAQ in speech bubble

As part of the IPUMS mission to democratize data, our user support team strives to answer your questions about the data. Over time, some questions are repeated. This blog post is an extension of an earlier series addressing frequently asked questions. Maybe you’ll learn something. Perhaps you’ll just find the information interesting. Regardless, we hope you enjoy it!

Here’s one of those questions:

How do I open IPUMS microdata files in my stats package?

You have honed your research question and analytical approach, identified an IPUMS data collection that suits your needs, learned to navigate the IPUMS interface to create a custom data extract, and just received an email notification that your data file is ready to download. You put your favorite song on the stereo and open your data file in Stata (or whatever statistical software package makes your data analysis dreams come true), and…

record scratch! You see a “file not Stata format” error.

Continue reading…

Malaria Transmission in Context: Linking Health, Census, and Ecological Data

by Yara Ghazal, Ilyana Hohenkirk, Tracy Kugler, and Kelly Searle

Malaria, like many vector-borne diseases, impacts health, economic growth, and society. The burden of malaria incidence and death is concentrated in Sub-Saharan Africa; in 2020, 95% of all malaria cases and 96% of all deaths occurred in Sub-Saharan Africa (WHO, 2022). Malaria impacts not only population health but also the economic growth of these 32 countries. It is estimated that up to 1.3% of economic growth in this region of Africa is slowed each year due to malaria (CCP-JHU, 2015). Understanding malaria transmission is essential to ending its spread and creating a healthier and more prosperous future for developing nations.

The literature on malaria transmission patterns has shown that several environmental factors impact mosquito and parasite vital rates, and thus affect the transmission intensity, seasonality, and geographical distribution of malaria (Castro, 2017). Temperature and precipitation are the primary climate-based factors that influence malaria transmission patterns. Temperature creates geographical constraints for vector and parasite development. Increasing temperatures have been found to shorten mosquito maturation time and increase feeding frequency. However, areas of extremely high temperatures usually yield smaller, less fecund mosquitoes. In parallel, because mosquitoes often breed in pools formed by rainfall and flooding, the frequency, duration, and intensity of precipitation have a significant influence on mosquito populations.

Continue reading…


by Devon Kristiansen

IPUMS was proud to partake in the International Conference on Family Planning in Pattaya City, Thailand. We participated by hosting a pre-conference workshop, sponsoring the conference, staffing an exhibit both, and presenting research as part of the conference program. The conference, held between November 14th and 17th, 2022, had 3,500 in-person attendees, with many virtual participants, as well.

Research staff representing IPUMS PMA, IPUMS DHS, IPUMS MICS, and IPUMS International conducted a 2-hour pre-conference workshop, providing participants with an overview of each of the IPUMS data collections featuring international data as well as a website and data analysis demonstration.

Continue reading…

Guidance for Pooling Multiple Years of NHIS Data

By Julia A. Rivera Drew


Depending on their research question, analysts will commonly pool multiple years of the National Health Interview Survey (NHIS) data together in order to increase sample sizes of particular subpopulations of interest, such as bisexual adults, immigrants, or pregnant women. The complex design of the NHIS, however, requires analysts to take additional steps to correctly construct and analyze pooled NHIS datasets. Moreover, planned changes to the NHIS design implemented in 2019, as well as changes made in response to the COVID-19 pandemic, require additional special handling to correctly analyze datasets combining multiple years of NHIS data. The objectives of this blog post are to: (1) share tips to correctly construct and analyze pooled NHIS datasets and (2) identify resources for more information.

Tips to Correctly Construct and Analyze Pooled NHIS Datasets

1. Create a pooled sampling weight to use with your pooled dataset.

In general, when pooling multiple years of NHIS data together, you will need to create a new sampling weight to use with the pooled sample. To create this new sampling weight, divide the appropriate sampling weight by the number of years within each distinct sample design period. For example, if one wished to estimate the number of children living in families with low or very low food security (FSSTAT) using pooled 2020-2021 NHIS data (e.g., similar to this report), one would need to create a new sampling weight by dividing the sampling weight identified under the “weights” tab for FSSTAT, SAMPWEIGHT, by the number of years pooled together from the same sampling design period (in this case, two). The sum of the pooled weights would then represent the average annual population size for the pooled time period, rather than the total cumulative population size for the pooled time period. For any given combination of variables, refer to information under the “weights” tab for the variables included in your analysis to help select the appropriate sampling weight. The distinct NHIS sample design periods are 1963-1974, 1975-1984, 1985-1994, 1995-2005, 2006-2015, 2016-2018, and 2019-present.

Continue reading…

IPUMS Founder Steven Ruggles Awarded MacArthur Fellowship

By Stacy Nordstrom

Steven Ruggles standing with arms crossed in front of trees
Steven Ruggles, Historical Demographer, 2022 MacArthur Fellow, Minneapolis, MN

Dr. Steven Ruggles, Regents Professor of History and Population Studies and Director of the Institute for Social Research and Data Innovation at the University of Minnesota, has been honored by the John D. and Catherine T. MacArthur Foundation as one of this year’s MacArthur Fellows. Commonly known as the “genius grant”, the fellowship is regarded as one of the most prestigious awards in the United States for intellectual and artistic achievement.

A historical demographer, Dr. Ruggles is renowned for building IPUMS, the world’s largest publicly available database of population statistics, and an invaluable tool for comparative research across time and space.

“I first met Professor Ruggles when I was working at the National Science Foundation. We have since served on working groups together, and I have been repeatedly impressed by the intellectual rigor and human caring he brings to any problem,” said University of Minnesota Executive Vice President and Provost Rachel T.A. Croson. “His dedicated work on IPUMS has significantly advanced our scientific understanding of the human experience, and has provided data for untold numbers of scholars. This recognition is well-deserved and I am proud that Professor Ruggles is a member of our academic community.”

Continue reading…