| DXn - Diagnosis |
| Documentation Sections: |
| General Notes |
| Uniform Values |
| State Specific Notes |
| General Notes | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
In the HCUP inpatient databases, the first listed diagnosis (DX1) is the principal diagnosis. In the HCUP outpatient databases, the first listed diagnosis (DX1) may not be the principal diagnosis; it may just be the first listed diagnosis on the record. In practice, ICD-9-CM diagnoses are represented by 3- to 5-character codes with explicit decimals. In the HCUP data files, ICD-9-CM diagnoses are represented as 5-character alphanumeric codes with implicit decimals. Examples are given below (alphanumeric codes are enclosed in quotation marks):
For proper handling of diagnosis codes:
The original value of the first listed diagnosis (DX1), whether blank or coded, is retained in the first position of the diagnosis vector. Starting at the first secondary diagnosis (DX2), the diagnoses are shifted during HCUP processing to eliminate blank secondary diagnoses. For example, if DX2 and DX4 contain nonmissing diagnoses and DX3 is blank, then the value of DX4 is shifted into DX3. Secondary diagnoses are never shifted into the first listed position (DX1). Prior to 2003, E-codes are included in the diagnosis array (DXn). Beginning in 2003, any separately reported E-codes and any E-codes encountered in the diagnosis array are placed in a separate array specific to E codes (ECODEn). Diagnoses are compared to a list of ICD-9-CM codes valid for the discharge date. Anticipation of or lags in response to official ICD-9-CM coding changes are permitted for discharges occurring within a window of time around the official ICD-9-CM coding changes (usually October 1). In the data prior to 1998, a six months window (three months before and three months after) is allowed. Beginning in the 1998 data, a year window (six months before and six months after) is allowed. For example, the code for Single Liveborn changed from "V300 " to "V3000" as of October 1, 1989. Under HCUP validation procedures, "V300 " is classified as valid for discharges on December 31, 1989, and "V3000" is classified as valid for discharges on July 1, 1989. If the diagnosis is not left justified, contains intermittent blanks, or is zero filled, then the diagnosis will be invalid. Diagnoses are compared to the sex of the patient (EDX03 beginning in the 1998 data and ED1nn prior to 1998) and the patient's age (EAGE04 and EAGE05 beginning in the 1998 data and ED3nn and ED4nn prior to 1998) for checking the internal consistency of the record. How invalid and inconsistent codes are handled varies by data year.
The validity flags (DXVn) need to be used in connection with any analysis of the diagnoses (DXn). The maximum number of diagnoses reported varies by state. HCUP retains all diagnosis fields provided by the data source.
Since on the NIS the number of diagnoses coded on the discharge (NDX) can be greater than the number of diagnoses available on the inpatient record, caution needs to be taken when using NDX to loop through the diagnoses. A counter for the loop should not extend past 25 (15 prior to the 2009 NIS). Programming code such as the following example SAS statement is needed to take this into account:
DO I = 1 to MIN(25,NDX);Followed by code to process all diagnoses.END; |
||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Top | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Uniform Values | ||||||||||||||
|
||||||||||||||
| Top | ||||||||||||||
| State Specific Notes | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
|
Arizona Beginning in 2008, Arizona reports six "cause of injury" E-codes in separate fields. From 2004-2007 Arizona reports five "cause of injury" E-codes in separate variables. Arizona reports some diagnosis codes with an explicit decimal point. The decimal point was removed during HCUP processing. Beginning with 1995 discharges, Arizona reports two "cause of injury" E-codes in separate variables. During processing of the 1995 to 2002 data, these E-codes are placed after the last non-missing diagnosis code if they are not already recorded as a secondary diagnosis. Beginning in 2003, the separately reported E-codes, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). California HIV Test Result Diagnoses California law prohibits the release of HIV test results in patient-identifiable form to any outside party without the patient's consent. Therefore, records that include certain ICD-9-CM codes that indicate HIV test results were not included in the data supplied for HCUP. California eliminated all occurrences of these codes from the diagnosis fields and packed the diagnosis vectors to cover gaps from such removals. The following ICD-9-CM codes were affected:
HIV-related diagnoses 042.x and 043.x were unaffected. The number of such diagnoses eliminated from the principal diagnosis position will be smaller than it otherwise might have been due to a practice in California that actively discourages the reporting of codes for HIV test results (044.x, 795.8, 795.71, and V08) as a principal diagnosis. During data editing, California flags discharges reporting one of these codes in the principal diagnosis position and then calls the submitting hospital to ask if the principal diagnosis should be changed. Hospitals have the option of deleting the code, changing it, or leaving it in place. Shriner's Hospitals Shriner's hospitals do not report diagnoses, procedures or total charges. Psychiatric Diagnoses Prior to 1995, some hospitals reported psychiatric diagnoses in DSM III which California then converted into ICD-9-CM diagnosis codes. The ICD-9-CM diagnosis codes are included in the HCUP database. From 1995-1998, some psychiatric hospitals began submitting data for primary diagnosis according to DSM IV criteria. DSM IV codes are indistinguishable in appearance from ICD-9-CM codes but have substantially different meanings. Because of similarities in the coding structure, the source was unable to convert the DSM IV codes to ICD-9-CM codes. DSM IV codes may occur in the HCUP data. Psychiatric hospitals may be included in the California data; no documentation was available on the use of DSM IV codes in psychiatric units of acute care hospitals. Beginning in 1999, DSM psychiatric codes are not accepted by OSHPD and are not present in the HCUP databases. E-Codes Beginning with 1990 discharges, the source reports five "cause of injury" E-codes as separate variables. During processing of the 1990 to 2002 data, E-codes were placed after the last non-missing diagnosis code. Beginning in 2003, the five separately reported E-codes, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). California does not require the reporting of E-codes in the range E870-E879 (misadventures and abnormal reactions). Hawaii Hawaii reports one "cause of injury" E code as a separate data element. Prior to 2003, During HCUP processing, this E code was placed after the last non-missing diagnosis code during HCUP processing. Beginning with 2003 HCUP processing, this E code is placed at the beginning of the separate HCUP E-code array (ECODEn). Illinois Illinois supplied diagnosis codes in a field of length 6. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. Iowa Beginning in data year 2001, the Iowa Hospital Association prohibits the release of two types of discharges:
These discharges were not included in the source file provided to HCUP and are therefore not included in the HCUP files. Beginning in 1994, Iowa reports one "cause of injury" E-code. Beginning in 1998, Iowa added one "place of injury" E-code. During processing of the 1994 to 2002 data, these separately reported E-code variables are placed at the end of the diagnosis vector; since the vector is packed during processing to remove blanks, the position of the E-code for a specific discharge depends on the number of diagnoses reported. Beginning in 2003, the two separately reported E-codes, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). Kentucky Prior to 2002, Kentucky reports one "cause of injury" E-codes as a separate variable. Beginning in June 2002, Kentucky reports two separate E-code fields. Prior to 2003, during HCUP processing, these separately reported E-codes were placed after the last non-missing secondary diagnosis. Beginning in 2003, the two separately reported E-codes, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). Beginning 2008, a third E-code was provided and placed into the E-cdoe arrays. Kentucky supplied diagnosis codes in a field length of 7. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. Maryland Maryland reports one "cause of injury" E-code as a separate variable. Prior to 2003, during HCUP processing, this separately reported E-code was placed after the last non-missing secondary diagnosis. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Maryland supplied diagnosis codes in a field of length 7. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. The last secondary diagnosis field on the source data was 9-filled instead of blank when no diagnosis was coded. During HCUP processing, the 9-filled diagnosis was set to blank. Massachusetts Beginning in 1993, Massachusetts reported one "cause of injury" E-code. Prior to 2003, during HCUP processing, the separately reported E-code was placed after the last non-missing secondary diagnosis. E-codes can appear in other secondary diagnosis codes. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). Minnesota Minnesota reports one "cause of injury" E-codes in a separate variable. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-codes, and any E-code encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Nebraska The Nebraska Hospital Association prohibits the release of discharge records for patients with HIV diagnoses. These discharges were not included in the source file provided to HCUP and are therefore not included in the HCUP files. Nebraska reports one "cause of injury" E-code in a separate variable. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code, and any E-code encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). New Hampshire New Hampshire reports one "cause of injury" E-code as a separate variable. The separately reported E-code and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). New Jersey Beginning with 1993 discharges, New Jersey reports one "cause of injury" E-codes as a separate variable. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Before 1994, the diagnosis codes provided by the state were right-padded with zeros (e.g., the diagnosis code '436' was supplied as '43600'). For the HCUP database the following algorithm was used to validate the diagnosis codes: Check the five-digit code for validity (using a six-month window for coding changes, 3 months before and 3 months after October of each year when ICD-9-CM coding changes occur).
New Jersey In 1993 only. An error in HCUP processing caused invalid five-digit codes that ended in non-zeros, as well as zeros, to be processed by the above algorithm. If deleting the rightmost non-zero digits created a valid code, then
New York Beginning in 2008, New York suppressed the identifier for the hospital (DSHOSPID) on records with an indication of induced abortion. These records are retained in the HCUP SID with the DSHOSPID set to "BLNK". New York identifies an indication of induced abortion by ICD-9-CM diagnosis or procedure code:
Beginning in 1993, New York reports "cause of injury" and "place of injury" E-codes. Prior to 2003, during HCUP processing, these separately reported E-codes were placed after the last nonmissing secondary diagnosis. When a "cause of injury" E-code in the range of E850.0-E869.9 or E880.0-E928.9 was reported, then a "place of injury" E-code was also reported. If the hospital stay involved the possibility of classifying more than one situation or event, only the single cause of injury, poisoning, or adverse effect that was most severe was reported. Beginning in 2003, the separately reported E-codes, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). North Carolina North Carolina supplied diagnosis codes in a field length of 6. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. North Carolina supplied diagnosis codes in a field length of 6. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. North Carolina North Carolina supplied diagnosis codes in a field length of 6. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. Oregon Prior to 1998, Oregon reports one "cause of injury" E-code as a separate variable. Between 1998 and 2002, Oregon reported two "cause of injury" E-codes. During HCUP processing, these separately reported E-codes are placed after the last non-missing secondary diagnosis. Beginning in 2003, the two separately reported E-codes, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). Oregon supplied diagnosis codes in a field of length 6; beginning in 2008, the field length became 7. Only the first five characters contained the diagnosis code and were used to assign the HCUP diagnosis codes. Pennsylvania Beginning with 1993 discharges, Pennsylvania reports one "cause of injury" E-code as a separate variable. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Some of the diagnosis codes in the 1989 Pennsylvania data that were flagged as invalid (DXV=1) appear to be valid codes. These diagnosis fields have four digits followed by a fifth digit that is an unprintable null character. The presence of the null character invalidates these otherwise valid diagnosis codes. Only the 1989 Pennsylvania data are affected. The following list includes all diagnosis codes in the 1989 Pennsylvania data that are valid ICD-9-CM codes but are flagged as invalid because they include null characters.
Rhode Island Rhode Island reports one "cause of injury" E-code in a separate variable. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). South Carolina Prior to 2000 data, a small number of discharges explicitly included decimals in the diagnosis field, usually the decimal is implicit. This is problematic because South Carolina supplied diagnoses in a field of length 5. If decimals were included, then a valid 5-digit code would be truncated. For example, the diagnosis for unspecified sickle cell anemia "28260" would be incorrectly reported as "262.6". Prior to 1998, invalid diagnosis codes are marked by a validity flag (DXVn = 1). Beginning in 1998, invalid diagnosis codes are masked (Dxn = "invl"). Beginning in 2000 data this was no longer a problem; explicit decimals were not included in the diagnosis codes. South Carolina E-codes
South Carolina does not require the reporting of E-codes in the range E870-E879 (misadventures and abnormal reactions). South Dakota South Dakota separately reports one "cause of injury" E-code and one "place of injury" E-code. Prior to 2003, during HCUP processing, these E-codes were placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). Tennessee Tennessee reports one "cause of injury" E-code as a separate variable. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Texas Texas provides five "cause of injury" E-code as a separate variable. Prior to 2003, during HCUP processing, this separately reported E-code was placed after the last non-missing secondary diagnosis. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Texas supplied diagnosis codes in a field length of 6. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. Utah Utah reports one "cause of injury" E-code as a separate variable. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). Virginia Virginia reports one "external injury code" E-code as a separate data element. Prior to 2003, during HCUP processing, this E-code was placed after the last non-missing diagnosis code. Beginning in 2003, the separately reported E-code, and any E-code encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Vermont Vermont reports one "cause of injury" E-code as a separate variable. Prior to 2003, during HCUP processing, this separately reported E-code was placed after the last non-missing secondary diagnosis. Beginning in 2003, the separately reported E-code and any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). Vermont supplied diagnosis codes in a field of length 6. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. Vermont Due to an error in HCUP processing, the separate "cause of injury" E-code was not included in the 2001 Vermont HCUP databases. This affects the four Vermont hospitals in the 2001 NIS. The four Vermont hospitals have 15,379 discharges (0.2% of the 2001 NIS ). We estimate that 21% of the Vermont discharges (approximately 3,200 discharges) are missing an E-code. Beginning in 2002, the separate "cause of injury" E-code is included in the HCUP databases. Vermont In 2001, expect secondary diagnosis DX16 - DX21 to be blank. No more than 15 diagnoses were provided by the data source. It is possible that none of the discharges have all of the diagnosis fields coded. The Vermont inpatient and outpatient source files come to HCUP in the same layout. To simplify HCUP processing, the number of diagnosis fields on the Vermont HCUP SID and SASD is the same. Washington Washington reported diagnosis codes in a field of length 6 for 1988-1992 and, beginning in 1993, in a field of length 7. Only the first five characters contain the diagnosis code and were used to assign the HCUP diagnosis code. In 1988, Washington did not report "cause of injury" E-codes. From 1989-1992, Washington reports two "cause of injury" E-codes. Beginning in 1993, Washington reports only one "cause of injury" E-code. Prior to 2003, during HCUP processing, any separately reported E-code was placed after the last non-missing secondary diagnosis. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). West Virginia West Virginia reports one "cause of injury" E-codes as a separate variable. Prior to 2003, during HCUP processing, are placed in a separate array specific to E codes (ECODEn). Beginning in 2003, the separately reported E-codesand any E-codes encountered in the diagnosis variables are placed in a separate array specific to E codes (ECODEn). West Virginia supplied diagnosis codes in a field length of 6. Only the first five characters contained in the left-justified source field were used to assign the HCUP diagnosis codes. Wisconsin To comply with statutory requirements, Wisconsin modified diagnosis and procedure codes that explicitly referenced induced termination of pregnancy to eliminate distinctions between induced and spontaneous termination. The following codes were modified:
Wisconsin reports one "cause of injury" E-code. Prior to 2003, during HCUP processing, this separately reported E-code was placed after the last non-missing secondary diagnosis. Beginning in 2003, the separately reported E-code, and any E-codes encountered in the diagnosis variables, are placed in a separate array specific to E codes (ECODEn). |
|||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Top | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| Internet Citation: HCUP NIS Description of Data Elements. Healthcare Cost and Utilization Project (HCUP). September 2008. Agency for Healthcare Research and Quality, Rockville, MD. www.hcup-us.ahrq.gov/db/vars/dxn/nisnote.jsp. |
| Are you having problems viewing or printing pages on this Website? |
| If you have comments, suggestions, and/or questions, please contact hcup@ahrq.gov. |
| Last modified 9/17/08 |