The purpose of this page is to give you the tools you will need to explore and analyze the PIAAC dataset. Before working with PIAAC data, make sure to read our "What You Need to Consider" guide.
We also highly recommend that you watch modules 2 and 4 of the Distance Learning Data Training (DLDT) for PIAAC (beneath Assessments: International) before beginning your analysis. DLDT online training modules allow you to learn more about the statistical procedures and methods for proper analysis of PIAAC datasets. For further questions about using the PIAAC dataset, you may visit our Q&A page or contact us at piaac@air.org.
Introduction to PIAAC Data Files
Conduct Analyses
U.S. National Data Files and Resources
International Data Files and Resources
Trend Data Files and Resources
International Log Files
Introduction to PIAAC Data Files
There are two types of resources and databases available for PIAAC: national and international.
National databases contain:
Data files: PIAAC 2012, PIAAC 2012/2014 and PIAAC 2017 Household, 2014 Prison Study
Sample: 16-74 years of age (total U.S. sample)
Variables: International and national variables. International variables are those common across all countries, based on the internationally comparable questionnaire and derived variables. National variables are those collected only in the United States, such as race/ethnicity or English language self-reported proficiency. All cognitive assessment items (literacy, numeracy, digital problem solving).
Formats: SAS, SPSS, ASCII
Documentation: See National section below.
International databases contain:
Data files: PIAAC 2012, PIAAC 2012/2014 and PIAAC 2017 Household
Sample: 16-65 years of age (internationally comparable sample)
Variables: International variables. International variables are those common across all countries, based on the internationally comparable questionnaire and derived variables. All cognitive assessment items (literacy, numeracy, digital problem solving).
Formats: SAS, SPSS, CSV
Documentation: See International section below.
Conduct Analyses
International Data Explorer (IDE) is a user-friendly, online tool that can be used for basic analyses such as averages, percentage distributions, and percentiles as well a few more advanced functions such as gap analysis and regression analysis. It takes into account the complex sampling and assessment design of PIAAC There are two sources of PIAAC IDE: NCES IDE and OECD IDE – that differ in functionality, as well as the listing of countries and available data. The OECD IDE contains only 2012 data for the United States and provides data on international variables for Rounds 1, 2, 3 countries, excluding Cyprus and Indonesia. The NCES IDE contains 2012/14 and 2017 data for the United States, provides data on international variables for Rounds 1 and 2 countries, excluding Russia, Indonesia and Australia, and provides data on the national U.S. variables, as well as U.S. Prison Study. In addition, the NCES IDE has more expanded functions, such as gap analysis and regression analysis, that the OECD IDE does not offer. Please view the PIAAC IDE Training Video for comprehensive, step-by step instructions on how to use the IDE.
PIAAC Results Portal is and interactive online tool designed by NCES to facilitate further exploration of the data introduced by the published reports, such as Skills of U.S. Unemployed, Young, and Older Adults in Sharper Focus and Highlights from the U.S. PIAAC Survey of Incarcerated Adults. It allows users to access pre-made data tables and figures, create their own customizable data tables, and generate national or comparative international reports using a select number of PIAAC variables.
Data Tools
SAS and STATA users can use the macros for analysis, which takes into account the complex sampling and assessment design of PIAAC. You can download the macros and data analysis manual here. For additional information, please read the "What You Need to Consider" guide.
SPSS or SAS users can use the IDB Analyzer for analysis, which takes into account the complex sampling and assessment design of PIAAC. The IDB Analyzer provides a user-friendly interface to easily merge the micro-data files of the participating countries and create SPSS or SAS syntax to produce accurate statistical results. IDB Analyzer is a Windows program used in conjunction with SPSS or SAS for Windows version 15 or higher. The program is available from the International Association for the Evaluation of Educational Achievement (IEA) website. You can download the IDB Analyzer here. Please visit module 5 of the Distance Learning Dataset Training (DLDT) for PIAAC (in the section called Assessments: International) to review an instructional video on how to use the IDB Analyzer.
U.S. National Data Files and Resources
Public Data Files: The U.S. national PIAAC 2017 Household, PIAAC 2012/14 Household, and PIAAC 2014 Prison databases can be found under the "PIAAC Publications & Products" section of the NCES PIAAC website. The U.S. databases include all U.S. data, both international and U.S.-only variables. Resources available include:
U.S. PIAAC database (in SPSS, SAS and ASCII formats)
U.S. codebook and data compendia
U.S. national technical reportClick here to access the PIAAC U.S. Household 2017 public use data files ›
Click here to access the PIAAC U.S. Household 2012/14 public use data files ›
Click here to access the PIAAC U.S. Prison 2014 public use data files ›
Restricted Data Files: For data that was suppressed in the public-use dataset due to confidentiality concerns for respondents, a U.S. restricted-use database is also available. Restricted-use databases contain more detailed information, such as continuous age and earnings variables. To access the restricted-use data, the restricted-use license has to be applied for and obtained from NCES. More information on the process is available at: http://nces.ed.gov/pubsearch/licenses.asp
Click here to view information on the PIAAC U.S. Household 2017 restricted use data files >
Click here to view information on the PIAAC U.S. Household 2012/14 restricted use data files ›
Click here to view information on the PIAAC U.S. Prison 2014 restricted use data files ›
2012 Public and Restricted Household Data Files: The 2012 datasets only contain data from the 2012 Main Study sample and do not include data from the additional sample collected in 2014 U.S. National Supplement (household). The 2012/14 dataset includes data from the expanded national household sample, which supports more accurate and reliable national estimates, more fine-grained estimates for the subgroups oversampled in the household National Supplement (young adults and unemployed adults) and, in the case of older adults, estimates for new groups not represented in the first round of PIAAC. These original 2012 data have been updated, reweighted, and revised in the PIAAC 2012/14 household public-use and restricted-use datasets. The 2012/2014 dataset should be used for analyses instead of this original version unless you are seeking to reproduce historic analyses.
Click here to access the PIAAC U.S. 2012 public data files ›
Click here to view information on the PIAAC U.S. 2012 restricted data files page ›
Synthetic Restricted Data Files: The U.S. PIAAC synthetic restricted use file (S-RUF) is available to researchers outside the U.S. to prepare computer code for the analysis of PIAAC data on the U.S. household restricted use file (RUF). The U.S. household RUF contains some variables that are not included in the US international household PUF as well as data for some international variables at greater levels of disaggregation than available in the PUF. The structure of the S-RUF is similar to that of the actual household restricted use file (RUF) to allow researchers to develop and test computer code for analytical routines. As the data on the S-RUF is synthetic, no conclusions can be drawn from the output generated.
Researchers wishing to run analysis on the actual U.S. household RUF are able to submit their SAS, SPSS, or Stata code to piaac@air.org, where the requested analyses will be run using on the real U.S. household RUF. The output will undergo a confidentiality review and will be returned to the researcher after approval.
Click here to access the PIAAC U.S. synthetic restricted data files ›
International Data Files and Resources
Public Data Files: The international PIAAC database can be found on the "Public data and analysis" page of the international OECD PIAAC website . Resources available include:
PIAAC international database (in SPSS,SAS and CSV formats)
International codebook and data compendia
International technical report (Round 1,Round 2, Round 3)
Click here to access the PIAAC international data files ›
Australia Public Data Files: To access the Australian Public Use File, please write to microdata.access@abs.gov.au
Note to users: The OECD released updated public-use data files for Round 1 countries and the data for Round 2 countries (released June 28, 2016). The data updates include the rescaling of the skills use indices and the recalculation of the derived earning variables. A detailed list of the updated variables can be found here.
Trend Data Files and Resources
IALS (1994-1998), ALL (2003-2008)
Rescaled public-use micro data files from the International Adult Literacy Survey (IALS) and the Adult Literacy and Lifeskills Survey (ALL) are available from Statistics Canada. These data files have rescaled literacy scores (IALS and ALL) and rescaled numeracy scores (ALL) that can be used in trend analysis with PIAAC scores. The rescaled public-use files for IALS and ALL that include data from all of the countries that participated in these surveys can be requested from Statistics Canada by emailing fe-education@statcan.gc.ca. The request should specify "rescaled public-use data files for IALS (1994-1998) and ALL (2003-2008)". The files will be available for download after one signs a license.
International Log Files
Log Files (PIAAC 2012 Round 1 countries)
The Log Files contain a record of the interactions between respondents and the PIAAC computer testing application during the course of the assessment. They are useful for better understanding test-taking behavior and the strategies and processes followed by respondents in responding to test items. Individual records can be matched with corresponding background and cognitive response data available in the PIAAC PUFs using the SEQID variable. Resources available include:
Public use files containing log data from the PIAAC cognitive assessments
Full documentation of the contents of the files
Customized data analysis tool, the PIAAC LogDataAnalyzer
PIAAC log file data is available for 18 countries that participated in the first round of the PIAAC study (PIAAC 2012) including the United States.