Allegheny County Data Warehouse

Allegheny County Department of Human Services (DHS) Allegheny County Jail Allegheny County Adult Probation Allegheny County Juvenile Probation Allegheny County Medical Examiner’s Office Housing Authority of the City of Pittsburgh Allegheny County Housing Authority Pittsburgh Public School District Clairton City School District Duquesne School District Penn Hills School District Propel Charter Schools Magisterial District Courts and the Court of Common Pleas in Allegheny County Pennsylvania Department of Human Services Pennsylvania Department of Labor and Industry

Client-level demographic, service, and provider information across multiple DHS service areas e.g., aging, child welfare, intellectual disabilities, mental health, and homelessness.

Unit of Observation:
Individual, provider, and facility-level
Personally Identifiable Information Available for Linking:
Allegheny County, Pennsylvania, United States of America
Years Available:
Depends on data source (refer to “Data From Date” in the data dictionary) but the general availability of data for the DHS service areas are: Aging: 2014 – present; Child welfare: 2002 – present; Criminal justice 2007 – present; Housing: 2013 – present
Frequency of Updates:
The frequency differs by data source, between 1 to 12 months for most while some are unknown; specifics can be found under “Data Lag” in the data dictionary

Individuals and families using services and supports provided within Allegheny County


Allegheny County Data Warehouse data are available to researchers upon request. The Allegheny County Data Warehouse pulls together client and service data from sources internal and external to the county and is managed by the Allegheny County Department of Human Services (DHS). Some data in the Data Warehouse from external sources (e.g., school districts, emergency services, and the Department of Labor and Industry) are not available to request through DHS. For data that DHS is not authorized to release, researchers would have to reach out to the source directly to obtain permission and receive the data from them.  The full list of data sources is presented in the appendix of the Data Warehouse Documentation.

For data that DHS is authorized to release, researchers may request de-identified data, limited data (contains two or fewer of the 18 Health Insurance Portability and Accountability Act (HIPAA)-defined identifiers), or identified data (contains more than two HIPAA-defined identifiers). Researchers must submit a research/data request form, as well as a copy of institutional review board (IRB) approval and protocol if requesting for identified data, to the DHS privacy board at the email [email protected]. Research requests without IRB approval, will require approval from the DHS’ Department Review Board. The DHS privacy board may grant exemptions to the IRB requirement on a case-by-case basis. Researchers can also choose to include a data extract template as an example of how they would like the requested data to be structured.

If the request is approved, researchers must complete and submit a data sharing confidentiality agreement, also known as a Data Use Agreement (DUA). Researchers will then be contacted about the data transfer mechanism and completion timeline.  Researchers may be asked to provide a write up about their study and findings for DHS. These write ups would be posted on DHS’ public website.

Timeline for Access

It usually takes 30 days for the review of the application, after which the DHS privacy board will respond by email with further questions or its decision. After the request has been approved and the data sharing agreement has been completed, the signature process at the county can take a few weeks up to a couple of months, depending on the time of the year. This process may take longer if the researcher’s institution provides their own language or agreement, which will need to be reviewed by the county’s legal department. The data will then be sent as soon as it is ready and the data sharing agreement is signed. Researchers may also choose the frequency at which to receive the data (ad-hoc, daily, weekly, or monthly).

Any data that came from the Allegheny County DHS must be destroyed or returned after 14 days of the data sharing confidentiality agreement ending, and DHS must be notified.

Lag Time

The lag time for records to be updated vary by data source. In-house DHS services, e.g., child welfare, are updated daily, while data from some external partners are updated on a quarterly basis. See the “Data Lag” column in the data dictionary for specific details.


There is generally no fee associated to access the data unless the data request is part of a grant or contract with the DHS which specifies a fee.


DHS will link a cohort of study participants provided by the researcher to the clients in the DHS’ Data Warehouse. Researchers can also request for identified data and for specific identifiers to be included. DHS generally allows researchers to link warehouse data to external datasets through this process. 

Identifiers Available for Linking

  • Name
  • Geographic subdivision (available in limited data set)
  • Date, such as date of birth (available in limited data set)
  • Telephone number
  • Fax number
  • Electronic mail address
  • Social Security number
  • Medical record number
  • Health plan beneficiary number
  • Internet Protocol (IP) address numbers
  • Biometric identifier
  • Full face photographic images
  • Other HIPAA identifiers such as account number, certificate / license number, vehicle identifier, device identifier, web Universal Resource Locators (URLs)

Data Contents

The Allegheny County Data Warehouse has data from other Allegheny County government departments (e.g., Housing Authority, Jail, Probation Office, Health Department, Medical Examiner), as well as non-Allegheny County government entities (e.g., emergency services, school districts, and the Department of Labor and Industry). The Data Warehouse contains data on clients, services, providers, and facilities across multiple DHS programmatic service areas: Aging, Children, Youth and Families, Homelessness/Housing, Drug/Alcohol, Early Intervention, Independent Living, Intellectual Disabilities, and Mental Health. See the data dictionary for more information. Details about the programs and program areas can be found on the Quick Count Tool page.

Partial List of Variables

Client information: demographic information, program involvement, services received

Provider information: name, location, type of providers, and services delivered

Facility information: services offered, services rendered, cost of services

Other Documentation

Klitzmiller, Erika M., 2013. “IDS Case Study: Allegheny County’s Data Warehouse: Leveraging Data to Enhance Human Service Programs and Policies.” Actionable Intelligence for Social Policy (AISP), University of Pennsylvania.

Last reviewed