Administrative Data Sets

PDF version

Catalog of Administrative Data Sets

Administrative data are information collected, used, and stored primarily for administrative (i.e., operational), rather than research, purposes. These data can be an excellent source of information for use in research and impact evaluation. Specific requirements and processes for obtaining data vary by data source. To assist researchers in screening potential data sources, J-PAL North America has cataloged a number of key US data sets. The catalog documents procedures on how to access data based on information provided by the originating agencies.

All of J-PAL North America's current resources related to administrative data can be found here

Used by J-PAL Affiliate
Earliest Year Available
Latest Year Available
PII Available for Linking
22 Results


American Medical Association (AMA), Division of Health Solutions Data Management (HSDM)

Current and historical data on all physicians in the US, Puerto Rico, Virgin Islands, and certain Pacific Islands.


California Office of Statewide Health Planning & Development (OSHPD)

Clinical care and patient records data from licensed general acute care hospitals, emergency departments, and ambulatory care surgery centers in California. Includes inpatient discharge and outpatient encounter information.


Chicago Public Schools (CPS) Office of Performance

Student- and staff-level information collected by CPS, including Illinois State Achievement Test (ISAT) scores, gender, and race.


Colorado Department of Public Health and Environment (CDPHE), Registries and Vital Statistics Branch (RVSB)

Individual-level vital records data from birth and death certificates in Colorado.

Crime, Violence, & Conflict

Illinois Criminal Justice Information Authority (ICJIA), Research & Analysis Unit, Illinois State Police (ISP)

Individual-level data on the arrest cycle, including misdemeanors and subsequent formal events in the criminal justice system, since the 1960s.

Labor Markets

Illinois Department of Employment Security (IDES)

Information collected from individuals, employers, and workforce service providers in the course of administering unemployment insurance programs and benefits in the state of Illinois.


LAUSD Office of Data and Accountability, Research and Reporting Branch

Student- and staff-level information collected by LAUSD, including standardized test scores, English Language Development level, school and classroom placement, and course enrollment


Massachusetts Department of Elementary and Secondary Education (ESE), Office of Planning and Research (OPR)

Currently collected to ensure compliance with state and federal education laws, ESE maintains a variety of student-level data sets including information on all students receiving public funds for elementary and secondary education in the state of Massachusetts. Data include student academic performance including standardized tests scores, English proficiency, behavioral records, attendance, and post-graduation plans, among a variety of additional information regarding both public school students and teachers.


Centers for Medicare and Medicaid Services (CMS), Office of Information Products and Data Analytics (OIPDA)

Enrollment and claims data for persons enrolled in Medicaid and the Children’s Health Insurance Program (CHIP) in all 50 states and DC. States submit data to CMS via the Medicaid Statistical Information System (MSIS), but these data are difficult to use for research because they contain a mixture of time periods. Therefore, from the MSIS data, CMS derives the MAX files—organized by calendar year and comprised of both fee-for-service and managed care claims data.


Centers for Medicare and Medicaid Services (CMS)

Claims data from Medicare Parts A and B, prescription drug data from Part D, beneficiary information, and cost reports.