New Frontiers in Big Data

Figure showing growth of microdata records from IPUMS, 1970-2020. There is a steady increase of public-use data over the time period, with the addition of historical data beginning in the early 2000s and a sharp increase in restricted-use IPUMS-format records in Federal Statistical Research Data Centers in the 2010s

By 2020, MPC will make freely available to researchers worldwide 100% count U.S. Census microdata through 1940. This dataset will include over 650 million individual-level (1850-1940) and 7.5 million household-level records (1790-1840). The microdata represents the fruition of longstanding collaborations between MPC and the nation’s two largest genealogical organizations—Ancestry.com and FamilySearch—to leverage genealogical data for scientific purposes.

“The importance of this massive donation of census data would be difficult to overstate,” says MPC Director Steve Ruggles. “This is one of the largest-scale data-entry efforts ever undertaken.”

Continue reading…