Even More IPUMS Data Available in the SDA Online Data Analysis Tool

By Daniel Backman

Beyond offering the ability to create and download customized datasets from the IPUMS microdata collections, we also support web-based analysis of the data through the SDA (Survey Documentation and Analysis) online data analysis tool. SDA empowers users to analyze IPUMS data directly from their web browsers without the need for additional software or advanced programming skills. Whether you’re a seasoned researcher or a student exploring data for the first time, the SDA tool makes it easier than ever to unlock insights from our datasets. If you’re a current SDA user and ready to get started, check out the new datasets from IPUMS CPS and IPUMS MEPS. Otherwise, read on to learn more about SDA and how to use this tool to analyze IPUMS data.

About IPUMS & SDA

What is SDA?

The SDA tool is a web-based interface that allows you to generate frequency tables, cross-tabulations, and summary statistics; create customized data visualizations, including bar charts, line graphs, and scatter plots; perform regression analysis; and export results as a CSV file for presentations or further analysis.

SDA increases the accessibility of data by allowing users to analyze data through a web-interface without needing to use (or purchase!) statistical software. There is detailed guidance on how to use the tool for analyses and how to manipulate variables. Additionally, it provides exceptionally fast real-time processing of data, making it ideal for use in the classroom or other interactive settings. See our data training exercises page for exercises that will guide you through using SDA to analyze IPUMS data.

Continue reading…

Introducing the MEPS Variable Builder!

By Julia A. Rivera Drew

Earlier this year, IPUMS MEPS launched a new feature – the MEPS Variable Builder – to make it dramatically easier to create customized person-level variables that summarize information from the medical event and condition records and add them to your IPUMS extract. If you have ever thought about using the MEPS event and condition data but didn’t know where to begin because of the complexity of the data, the MEPS Variable Builder is for you!

The Medical Expenditure Panel Survey Household Component (MEPS-HC, referred to MEPS here) provides comprehensive information on characteristics of people residing in responding households, as well as information about their medical encounters during the calendar year – e.g., office-based provider visits, emergency room (ER) visits, and hospitalizations – and medical conditions associated with those medical encounters. This unique combination of information makes the MEPS data ideal for research questions that need detailed health care utilization and/or expenditure data alongside individual-level correlates of health. However, these rich data can be difficult to work with, creating barriers for researchers who wish to use the MEPS data.

IPUMS MEPS created the MEPS Variable Builder to enable users to easily build person-level variables summarizing information from the MEPS-HC event and medical condition records, also known as “event summary variables.” Using a point-and-click interface, researchers can create custom event summary variables that count the number of events or sum expenditures across event records, filtered on selected characteristics of events and/or medical conditions. Users can then include these custom event summary variables in their IPUMS extract. At this time, the variable builder does not include prescribed medicines data.

In this blog post, we run through an example where we create a variable that is the sum of all expenditures paid for by Workers’ Compensation for medical visits due to a workplace injury.

Continue reading…

Introducing the MEPS Prescribed Medicines Data

By Julia A. Rivera Drew

The Household Component of the Medical Expenditure Panel Survey (MEPS), administered by the Agency for Healthcare Research and Quality (AHRQ), is a short panel survey collecting information for a nationally representative sample of the civilian, noninstitutionalized population. Since 1996, the MEPS has collected information on demographic and socioeconomic characteristics; health status; medical conditions; and health care access, utilization, and expenditures.

Based on information provided by a family respondent about each family member at each interview, AHRQ produces a dataset of all reported fills of prescribed medicines purchased by family members during the calendar year (including refills). For example, if a prescription was filled monthly, there would be 12 records for that specific prescribed medicine (DRUGID) in the annual file. The prescribed medicines data includes information such as the medication name (RXNAME), national drug code (RXNDC), therapeutic classification (MULTC1), when the person began taking the medication (RXBEGMM and RXBEGYR), amounts paid (RXFEXPTOT), and source of payment (RXFEXPSRC).

IPUMS MEPS provides a harmonized and integrated version of the MEPS Household Component data, including data from the prescribed medicines files.

Continue reading…

Guidance for Pooling Multiple Years of NHIS Data

By Julia A. Rivera Drew

Introduction

Depending on their research question, analysts will commonly pool multiple years of the National Health Interview Survey (NHIS) data together in order to increase sample sizes of particular subpopulations of interest, such as bisexual adults, immigrants, or pregnant women. The complex design of the NHIS, however, requires analysts to take additional steps to correctly construct and analyze pooled NHIS datasets. Moreover, planned changes to the NHIS design implemented in 2019, as well as changes made in response to the COVID-19 pandemic, require additional special handling to correctly analyze datasets combining multiple years of NHIS data. The objectives of this blog post are to: (1) share tips to correctly construct and analyze pooled NHIS datasets and (2) identify resources for more information.

Tips to Correctly Construct and Analyze Pooled NHIS Datasets

Create a pooled sampling weight to use with your pooled dataset.

In general, when pooling multiple years of NHIS data together, you will need to create a new sampling weight to use with the pooled sample. To create this new sampling weight, divide the appropriate sampling weight by the number of years within each distinct sample design period. For example, if one wished to estimate the number of children living in families with low or very low food security (FSSTAT) using pooled 2020-2021 NHIS data (e.g., similar to this report on children and food insecurity), one would need to create a new sampling weight by dividing the sampling weight identified under the “weights” tab for FSSTAT, SAMPWEIGHT, by the number of years pooled together from the same sampling design period (in this case, two). The sum of the pooled weights would then represent the average annual population size for the pooled time period, rather than the total cumulative population size for the pooled time period. For any given combination of variables, refer to information under the “weights” tab for the variables included in your analysis to help select the appropriate sampling weight. The distinct NHIS sample design periods are 1963-1974, 1975-1984, 1985-1994, 1995-2005, 2006-2015, 2016-2018, and 2019-present.

Continue reading…

Using the MEPS-HC to Study Change in Adult Mental Health

By Julia A. Rivera Drew and Natalie Del Ponte

The Medical Expenditure Panel Survey-Household Component, or MEPS-HC, data are an invaluable resource for studying short-term trajectories in health, including adult mental health. An integrated series of the MEPS-HC data is available at IPUMS MEPS. Collected on the Self-Administered Questionnaire and, starting in 2019, the Preventive Self-Administered Questionnaire, the MEPS-HC includes two validated adult mental health scales. The Kessler Psychological Distress Scale (K6) and the two-item Patient Health Questionnaire depression screener (PHQ-2) are asked twice per panel, during interview rounds 2 and 4 (see Table 1). There are also two validated scales measuring health-related quality of life (HRQOL) that capture several interrelated health domains, including mental health. These include the Short Form-12 (SF-12) in 2000-2016 and the Veterans RAND-12 (VR-12) starting in 2017 (see Table 2 for VR-12 measures). For more information on the SF-12, see the section on SF-12 scoring on MCS and for more information on the VR-12, see ADDAYA.

Continue reading…

An Age-Based Approach to Disability

By Solvejg Wastvedt and Yash Singh

Disability is dynamic: it evolves over time and interacts with environmental and societal factors. Due to the complex nature of disability, researchers conceptualize and measure disability differently depending on their research question and available data. For instance, an identical condition might evolve differently for a child facing food insecurity compared to one that has stable access to food. Similarly, a physical limitation for a worker in New York City may have a vastly different impact compared to a similar worker in rural Iowa. Disability can be viewed as the relational concept between individuals with physical or mental impairment and the environment.1 2 This complexity makes measuring disability a challenging task. The following post aims to help researchers understand and use disability measurements available in IPUMS data collections.

Continue reading…

IPUMS Announces 2020 Research Award Recipients

IPUMS research awardsIPUMS is excited to announce the winners of its annual IPUMS Research Awards. These awards honor the best-published research and nominated graduate student papers from 2020 that used IPUMS data to advance or deepen our understanding of social and demographic processes.

IPUMS, developed by and housed at the University of Minnesota, is the world’s largest individual-level population database, providing harmonized data on people in the U.S. and around the world to researchers at no cost.

There are six award categories, and each is tied to the following IPUMS projects:

  • IPUMS USA, providing data from the U.S. decennial censuses, the American Community Survey, and IPUMS CPS from 1850 to the present.
  • IPUMS International, providing harmonized data contributed by more than 100 international statistical office partners; it currently includes information on 500 million people in more than 200 censuses from around the world, from 1960 forward.
  • IPUMS Health Surveys, which makes available the U.S. National Health Interview Survey (NHIS) and the Medical Expenditure Panel Survey (MEPS).
  • IPUMS Spatial, covering IPUMS NHGIS and IPUMS Terra. NHGIS includes GIS boundary files from 1790 to the present; Terra provides data on population and the environment from 1960 to the present.
  • IPUMS Global Health: providing harmonized data from the Demographic and Health Surveys and the Performance Monitoring and Accountability surveys, for low and middle-income countries from the 1980s to the present.
  • IPUMS Time Use, providing time diary data from the U.S. and around the world from 1965 to the present.

Over 2,500 publications based on IPUMS data appeared in journals, magazines, and newspapers worldwide last year. From these publications and from nominated graduate student papers, the award committees selected the 2020 honorees.

Continue reading…