Division of Health Examination Statistics ROBERT S. MURPHY, Director JEAN ROBERTS, Chief, Medical Statistics Branch SIDNEY ABRAHAM, Chief, Nutritional Statistics Branch KURT R. MAURER, Chief Survey Planning and Development Branch Division of Data Services PHILLIP R. BEATTIE, Director HENRY W. MILLER, Chief, Health Examination Field Operations Branch Office of Research and Methodology JAMES T. MASSEY, Ph.D., Chief: Survey Design Staff MAURER, Chief Survey Planning and Development Branch Division of Data Services PHILLIP R. BEATTIE, Director HENRY W. MILLER, Chief, Health Examination Field Operations Branch Office of Research and Methodology JAMES T. MASSEY, Ph.D., Chief: Survey Design Staff Under the legislation establishing the National Health Survey, the Public Health Service is authorized to use, insofar as possible, the services or facilities of other Federal, State, or private agencies. In accordance with specifications established by the National Center for Health Statistics, the U.S. Bureau of the Census participated in the design and selection of the sample and carried out the household interview stage of the data collection and certain parts of the statistical processing. The Center for Disease Control acted as laboratory consultants and performed a series of biochemical, hematological, and serological assessments on blood specimens of persons participating in the survey. Library of Congress Catalog Card Number 80-607914 Contents Introduction ............................................................................ Planning process ......................................................................... Summary statement of data collection techniques ................................................... Questionnaires ........................................................................ Examination by physician ................................................................. Special clinical procedures and tests .......................................................... X-rays .............................................................................. Urine tests ........................................................................... Tests on blood samples ................................................................... Nutritional status assessments ................................................................ Detailed health examination ................................................................. Major new target conditions ............................................................... Other important target conditions ........................................................... Sample design for NHANES || ................................................................ Design specifications .................................................................... Definition and stratification of primary sampling units .............................................. Formation of superstrata in NHANES ll ....................................................... Selection of sample locations ............................................................... Selection of housing units within sample locations ................................................. Selection of sample persons ................................................................ Operational plan ......................................................................... Stand sequencing and scheduling ................................. A ........................... Advance contacts and logistics .............................................................. Household interviewing and appointment process ................................................. Examination center and staff ............................................................... Examination process and medical reports ....................................................... Quality control .......................................................................... Pilot testing ............................................................................ Plans for analysis and publication of data ......................................................... References ............................................................................. An example of a sample person selection sheet used in the National Health and Nutrition Examination Survey, 1976-80 . . Mobile examination center ................................................................ List of Text Tables I'HUOW > 'I'I . Number and population of National Health Interview Survey (NHIS) strata before and after subdivision of self-representing primary sampling units, by type of stratum and National Health and Nutrition Examination Survey region ........... . Variables in final stepwise regression model, by region .............................................. . Correlation matrix for health and sociodemographic variables ......................................... . Variables used for stratification in the National Health and Nutrition Examination Survey, by region ............... . Definition of control classes used for the selection of primary sampling units, by region: National Health and Nutrition Examination Survey, 1976-80 .............................................................. . Expected and actual number of sample primary sampling units (PSU’s) within control classes, by region and type of stratum ............................................................................. . Primary sampling units, stand sites, and percent of persons examined, by region: National Health and Nutrition Examina- tion Survey, 1976-80 .................................................................... Plan and Operation of the Second National Health and Nutrition Examination Survey. 1976-80 by Arthur McDowell, formerly with Division of Health Examination Statistics, Arnold Engel, M.D., Division of Health Examination Statistics, James T. Massey, Ph.D., Office of Research and Methodology, and Kurt Maurer, Division of Health Examination Statistics Introduction The second National Health and Nutrition Exam- ination Survey is another in a series of related pro- grams carried out over the past 20 years by the National Center for Health Statistics. These programs, authorized by Congress under the National Health Survey Act of 1956, are characteristically national in scope, based on probability sampling, and used to collect a broad range of morbidity data and related health information. The essential differentiating characteristic of the health examination surveys is their primary concern with those kinds of health- related data obtained only (or at least optimally) from specially standardized direct medical examina- tions, including tests and other procedures used in clinical practice. Such examinations given to persons selected in the scientific sample permit estimates of the prevalence of specifically defined diseases in the US. population, including cases not previously identified. They also permit estimation of the distri- bution within the population of a broad variety of health-related measurements, including not only physical measurements such as height, weight, and various skinfolds, but also physiological measure- ments, such as diastolic blood pressure and serum cholesterol level and psychological measurements. During the years 1959-76, the National Center for Health Statistics (NCHS) conducted four separate examination surveys. The first of these, the National Health Examination Survey, Cycle I, (NHES I) focused on the prevalence of selected chronic disease in civilian noninstitutionalized US. adults aged 18-79.1 The next two surveys, which were conducted from July 1963 through March 1970, were largely devoted to the growth and development of children 6-11 (the National Health Examination Survey, Cycle II—NHES II)2 and 12-17 years of age (the National Health Examination Survey, Cycle III—NHES III).3 The fourth survey introduced a new emphasis. In 1969 the Department of Health, Education, and Welfare established within NCHS a continuing activity to measure the nutritional status of the US. popula- tion and to monitor changes in status over time. After careful study by an NCHS task force, it was decided to combine the proposed national nutrition surveil- lance survey with the existing National Health Exam- ination Survey in order to enhance the performance of each component and to permit relating nutritional variables to health measures. The resultant survey is known as the National Health and Nutrition Exam- ination Survey, or NHANES. The first segment of NHANES (the National Health and Nutrition Examination SurveyfiNHANES I) was conducted from 1971 through 1974.4 An assessment of nutritional status was made on a representative sample of the US. population aged 1-74 years, and a detailed examination was given to a subsample aged 25-74 years. This segment of the NHANES I program was followed by a 14—month period (1974—75) in which an additional national sample of persons 25-74 years of age was given the detailed examination, to augment the size of the sample originally included in NHANES I (referred to as the National Health and Nutrition Examination Survey, Augmentation Cycle— NHANES IA).5 Data collected in successive surveys have been published in more than 100 separate publications6 and have also been made available on computer tapes for further study.7 The reports serve a broad spectrum of uses: 0 They provide estimates of the prevalence of char- acteristics or conditions. 0 Normative or descriptive data permit the monitor- ing or measurement of changes in health and nu- tritional status over time through successive assessment surveys. 0 Problems of possible public health importance can be identified. 0 The interrelationship of health and nutritional variables in the general population is made possible. Planning process The continuing responsibility for measuring and monitoring the nutritional health status of the US. population meant that the first assessment survey, NHANES I, would be followed by later assessment surveys. These would permit comparisons with the NHANES I baseline data and thereby allow measure- ments of changes over time. Thus, in a sense, the planning of the nutritional aspects of the second National Health and Nutrition Examination Survey, 1976-80, NHANES II, began with NHANES 1. Throughout the course of its operation there was an awareness of this. Constant consideration was given to procedures and content items in terms of whether they should be repeated in the succeeding survey. Then, too, the necessity for comparing NHANES II data with those from NHANES I re- quired that some of the same measurements be made in the same way and on the same age segment of the US. population in both surveys. The complex process of planning the NHANES II program began in a systematic way, however, only in mid-1974, about a year and a half before the survey was to begin opera- tion. The planning phase of a national health examina- tion survey 'is critically important. The planning process used in the NHANES and predecessor surveys has been described in more detail elsewhere, but part of that statement deserves repeating here: One aspect of planning is of prime impor- tance, namely, specifying the survey’s specific goals or substantive purposes. . . With respect to each element to be considered for inclusion in a health examination survey—for example, information on diabetes—the following ques- tions should be answered by the appropriate personnel: (i) How and for what purposes will the infor- mation be used? (Outlines of proposed analyses are desirable.) (ii) What specific data are needed? (iii) How can those data be obtained? (What specific tests, measures, and questionnaire items are needed, and what level of skill is required of examining personnel?) (iv) Is the health examination survey the appro- priate mechanism to get these data? (v) Is the expected prevalence level consonant with the ability of the planned survey to determine it within reasonable confidence limits? (vi) Can the entire process of obtaining these data be adequately standardized? (vii) What cost factors are involved in equip- ment, laboratory work, skilled personnel, and so on? _ (viii) Finally, if questions (i)-(vii) all are answered satisfactorily—What is the place of this par- ticular data need in an ordered priority listing with other potential needs? The appropriate personnel vary with the question asked. For example, for question (i), the head of a health planning agency would qualify, while for (iii) it might be an expert in the medical specialty involved. In the USA the process of determining the conditions to be included in each health examination survey has been a multi-stage effort involving hundreds of institutions, organi- zations, and individuals. At the beginning a wide net is cast and opinions are sought from hundreds of health planners, health researchers, medical care providers, and health educators as to the kinds of data, appropriate to this type of survey, that are mest needed. Important in this stage is the input from Federal Government agencies, particularly the various elements of the Depart-. ment‘ of Health, Education, and Welfare. Further follow-up contacts are made with respect to some of the suggested items which seem to be reason- able prospects for inclusion, and information is obtained in greater detail so as to answer each of the questions listed in the preceding paragraph. This leads to further stages of consultation and perhaps to convening ad hoc meetings of experts in a particular field to assist in determining feasibility and relative priorities. In the end, decisions must be made at the level of the NCHS, but these must be approved at successive Govem- mental levels up to the Office of Statistical Policy within the Office of Management and Budget in the Executive Office of the President.8 The processes described in the foregoing para- graphs were the general pattern of the planning process carried out in 1974 and 1975 to determine the content and data goals of the NHANES II pro- gram. During this same time many related determina- tions had to be made concerning sample size and design, method of operation in data collection, quality control procedures, field staff retraining, pilot testing and pretesting, and further resultant modifica- tions. Although it has not been unusual for NCHS to collaborate with other Federal agencies in the plan- ning, data collection, and analysis of previous Na- tional Health Examination Surveys, the level of col- laboration involved in NHANES II was unprece- dented: 0 The Bureau of Laboratories, Center for Disease Control, served as a technical consultant for the planning and quality control of NHANES labora— tory efforts, in addition to performing most of the health— and nutrition-related biochemistry and providing some of the funding for this effort. 0 The National Institute of Arthritis, Metabolism, and Digestive Diseases, National Institutes of Health, supported the serum creatinine testing, the development of a glucose tolerance testing protocol, plasma glucose determinations at the Center for Disease Control, and processing of the data to make it more quickly available for analysis. 0 The National Heart, Lung, and Blood Institute, National Institutes of Health, developed plans for assessing cholesterol, triglyceride, and high density lipoprotein (HDL) levels through the Lipid Research Clinic Laboratory at George Washington University, the results processed at the Coronary Patient Registry at the University of North Carolina. 0 The Office of Pesticides and Toxic Substances, Environmental Protection Agency, served as a technical consultant in collecting blood and urine specimens suitable for processing for residues and metabolites of certain pesticides. It processed the samples, monitored the quality of the processing, and coded the data in machine-readable form. 0 The Bureau of Foods, Food and Drug Administra- tion, supported the development of a serum ferritin assessment as part of the characterization of anemia. It also supported the measurement of blood lead levels at the Center for Disease Control. 0 The Department of Energy supported Dr. Edward Radford at the University of Pittsburgh in his assessment of carboxyhemoglobin levels in blood. Randomly selected blind samples both from Dr. Radford’s laboratory and from NCHS mobile examination centers were analyzed by accepted gas chromatographic procedures at the Naval Medical Research~lnstitute, insuring quality con- trol and providing a reference standard. 0 .The Bureau of State Services, Center for Disease Control, made arrangements in each sample area for supplies and testing for gonorrhea. The remaining sections of this report present the outcome of the planning with respect to the objectives of NHANES II. They describe in more detail some of the reasons for the selections and go into details of the sample design and operational plan. The appendixes of this report contain listings of the examination components; blood and urine assess- ments; pesticide residue and metabolite determina- tions;staff participation in the planning, development, and operation of NHANES II; and data collection forms. Summary statement of data collection techniques The plan developed with respect to the content of NHANES 11 called for the following items. Questionnaires Household questionnaire—For each household member, this questionnaire included the family relationships; certain demographic items such as age, sex, and race; selected housing information; items such as occupation, income, veteran status; and an indication of participation in food stamp programs. Medical history questionnaires—For each sample person at ages 6 months to 11 years a questionnaire included items on birth weight, prematurity, develop- mental congenital conditions, medication, neuro- logical conditions, lead poisoning, accidents, hospital care, disability, diarrhea, pica, vision, and a variety of chronic conditions. In addition, there were major sections on allergies, kidney and bladder disease, anemia, speech and hearing, lung and chest condi- tions, and participation in food programs. Two questionnaires for each sample person at ages 12-74 years included items on medication; hos- pital care and tuberculosis; nutrition; a variety of acute and chronic diseases; tobacco, tea, and coffee usage; physical activity; weight; height; vision disa- bility; exposure to pesticides; gastrointestinal prob- lems; and for females, a menstrual and pregnancy history. In addition, there were major sections on anemia, diabetes, respiratory condition, hearing and speech, liver and gallbladder conditions, kidney and bladder disease, allergies, hypertension, cardiovascular conditions, stroke, arthritis (stressing middle and upper back and neck problems), and participation in food programs. Two dietary questionnaires—For each sample person, a dietitian recorded the quantity of every item of food or drink consumed during the previous day, so that after computer calculation, the data yielded measures of calories, cholesterol, fat, unsaturated fats, protein, carbohydrates, and specific 4 vitamins and minerals. consumed during the recall period. A food frequency interview ascertained the usual pattern of food consumption, recording whether or not it included any foods in various groupings, in- cluding milk, meat, fish, eggs, fats and oils, legumes and nuts, cereals, fruits, vegetables, and alcoholic beverages. It also showed reported daily and/or weekly number of times each food was consumed and noted the use of salt and vitamin and mineral sup- plements. Medications and vitamin usage—This elicited a history of the preceding week’s usage of any medicines, vitamins, or minerals, for all examined persons. Dietary supplement interview f0rm.—,This form recorded the history of special diets, prior medications, and barriers to purchasing groceries or eating foods for examined persons aged 12-74 years. Behavior questionnaire—This questionnaire elicited data on behavior possibly associated with coronary heart disease for examined persons 25-74 years of age. Examination by physician A physician performed and. recorded a medical examination giving special attention to specified findings related to nutrition; hearing; the thyroid gland; and the cardiovascular, respiratory, neurolog- ical, and musculoskeletal systems. Special clinical procedures and tests A specially trained health technician carried out the following tests and procedures on examined persons in the designated age ranges. Spirometry trials—These were. digitized and recorded on magnetic tape for examined persons 6-24 years of age for various pulmonary function indicators such as forced vital capacity (FVC), forced expiratory volume in 1 second (FEV1), and peak flow rate. Electrocardiograms.—Digitized and recorded on magnetic tape for examined persons 25-74 years of age, electrocardiograms provided normative data on amplitudes and durations and permitted diagnostic interpretations of heart disease according to the Minnesota code. Body measurements—The measurements made on examinees included standing height, body weight, triceps and subscapular skinfolds, and several others. Puretone audiometry.—This test carried out on examined persons between the ages of 4 and 19 permitted determination of threshold levels of hearing for frequencies of 500, 1000, 2000, and 4000 Hertz for right and left ears. Speech recording—This involved the use of a tape recording of the subject’s repetition of specially de- veloped sentences. It was carried out on examined persons between the ages of 4 and 6, permitting in- terpretations as an indication of problems with articu- lation and language development. Allergy tests—These involved skin tests (the prick test) with eight common allergens (housedust, alter- naria, cat fur, dog fur, ragweed, oak, rye grass, and Bermuda grass). The tests were made on examined persons between the ages of 6 and 74 to obtain de- grees of skin reaction. X-rays For examined persons 25-74 years of age two X-rays were made. No X—rays were done on pregnant women, and no lumbar X-rays were done on women under 50 years of age. X-ray of cervical and lumbar spine—This provided evidence of osteoarthritis and degenerative disc disease. X-ray of chest.—The chest X-ray was used in the diagnosis of respiratory diseases and served as a measure of left ventricular enlargement. Urine tests Tests as follows were performed on casual sam- ples of urine. N—Multistix tests—These urinary dipstick tests for qualitative protein, glucose, ketones, bilirubin, blood, urobilinogen, pH, and bacteriuria (nitrite test) were done for examined persons 6-74 years of age. Urinary sediments—Sediments including red cells, white cells, and casts were measured for a subsample of examined adults 20-74 years of age. Gonorrhea cultures—Cultures of urinary sediments were performed for male and female examined persons 12-40 years of age. However, of those females who received the glucose tolerance test (GTT), only those 20-24 years of age had the gonorrhea test performed. Analyses for pesticide levels—Urine samples from a subsample of examined persons 12-74 years of age were tested for the presence of alkyl phosphate resi- dues and metabolites, carbamate residues, phenolic compound residues and malathion metabolites. Appendix III has a complete listing of the pesticide residues and metabolites tested for. Tests on blood samples Samples of blood provided a broad range of information related to health and nutrition. The particular tests performed varied with the specific target condition and age group (appendix II). The discussion of the development of the plan for NHANES II later in this report specifies the age groups and, in some instances, the subsampling pattern followed for each of the following tests. Glucose tolerance test.—This test involved the collection of blood specimens from examined persons while they were in a fasting state as well as at 1 and 2 hours after glucose challenge. The test was performed on a specified subsample of examined adults to provide estimates of the prevalence of diabetes. Tests related to liver function—The postprandial liver bile acid test measured the ability of the liver to remove bile acids from the blood following consump- tion of a food preparation that induced the eventual addition of bile acids to the blood via contraction of the gallbladder. Biochemical liver tests performed included bili- rubin, SGOT, and alkaline phosphatase. Anemia-related laboratory tests—The tests made to diagnose anemia consisted of protoporphyn'n, iron, total iron binding capacity (TIBC), zinc, copper, red cell folates, serum folates, serum ferritin, vitamin B12, and the determination of abnormal hemoglobin. Other biochemical nutritional tests—These tests included albumin, vitamin A, and vitamin C. Serum lipidsrBecause of their important relevance to cardiovascular disease, determinations were made of cholesterol, triglycerides, and high density lipoprotein (HDL). Biochemical tests for body burden from environ- mental exposures—Determinations were made of the levels of lead and organochlorine pesticide residues and metabolites. Tests were also performed for carboxyhemoglobin, which reflects environmental exposure to carbon monoxide and the individual’s smoking habits. Hematology—The hematology included determi- nations of hemoglobin, hematocrit, red blood cell count, white blood cell count and differential analysis, and red blood cell morphology. Kidney function—The only test for kidney function performed on blood samples was the serum creatinine test. Syphilis—The serology determinations for syphilis included qualitative and quantitative ART, an FTA—ABS, and MHA—TP. The foregoing list summarizes the content finally decided upon for inclusion in NHANES II. However, the planning process almost always involves a great deal of effort in connection with proposals that, for a variety of reasons, are not included in the final plan. A few of the important components considered in the process of planning but deleted from the final NHANES 11 plan deserve to be noted. Two of the proposals that Were seriously considered had to be deleted because of staff limitations or examination time. One of these would have involved administering a tuberculin skin test at the examination site with subsequent reading at the household; the other would have involved administration of a psychological schedule used in NHANES I, the General Well-Being Test. A third proposal involved completion of a questionnaire at the school attended by children and youth who were sample persons. In that case, con- siderations related to confidentiality and privacy, and the related clearance process required more time than was available for their resolution. Finally, in the early stages of planning, consideration was given to includ- ing an extensive neurological component based on computer analysis of tape recorded electroencephalo- grams. The main purpose would have been the provision of normative data on the distributions of the electroencephalogram variables in the general population and of some data on the prevalence of brain damage and related brain pathology. It was finally decided to drop this from NHANES II, with the, possibility of considering it in a later program. A major factor in this decision was the recommendation by the National Institutes of Health advisory com- mittee that reviewed the plan. While approving the general concept of such data collection and analysis, this group believed that the methodology available at the time was not appropriate for use in NHANES 11. Certain other components considered in planning but finally omitted from NHANES II are nofted later in the detailed description in this report. Nutritional status assessments The basic purpose of the NHANES 11 program yvith respect to nutritional status assessment required that the program continue to use, with some modifi- cation, the same or essentially the same format of NHANES I. In order to monitor the nutritional status of the population, the data to be collected needed to be not only comparable, at least in considerable part, but also carried out as in NHANES I on a probability sample of the civilian noninstitutionalized population of the United States. Again asinNHANES I, emphasis needed to be placed on the segments of the popula- tion classified as at or below the poverty level, the young children and the aged, since these were as- sumed to be at special risk of having nutritional problems. These groups then would again be sampled at rates substantially higher than their proportions in the general population. It is necessary, in order to assess nutritional status, to obtain data of four different types. The fourfold approach used in NHANES I and NHANES II involved the collection of information on dietary intake patterns along with the results of various hematological and biochemical tests, anthropometric measurements, and clinical assessments. The experience gained in the NHANES I program, however, made possible certain modifications of NHANES II in order to make the data obtained more useful while continuing to provide a considerable amount of comparable data for monitoring purposes. The NHANES I information indicated that vitamin A deficiencies were not a problem in the older age groups in our US. population, and as a result, col- lection of information on the biochemical findings of vitamin A was limited in NHANES II to the 3-11 years age group. (It was not recognized at the time that vitamin A levels in adults would be of considerable interest in cancer research.) Technical problems in the collection of blood samples and their analysis for vitamin C during the NHANES I program had re- sulted in unsatisfactory data. These problems were solved, and vitamin C determinations were again made in NHANES II. The methods used in NHANES I for determining the iodine, thiamine, and riboflavin values in urine were found to be inadequate, how- ever. Therefore, the decision was made to exclude those determinations from NHANES II. Some con- sideration was given to using the more sensitive enzyme analysis method to detect any riboflavin or thiamine deficiencies. Some of the investigations at the Center for Disease Control involved the spectro— photometric erythrocyte transketolase method as well as a spectrophotometric method for erythrocyte gluthathione reductase. This work identified a num- ber of compromises in basic enzyme assay princi- ples and certain questions in the color development procedure that would require a considerable amount of additional time to evaluate fully. It was, therefore, decided not to include these in the NHANES II program. On the other hand, the serum albumin test used in NHANES I was continued in NHANES II as a monitor of protein deficiency in the US. population. The relationship of the serum albumin test to clinical health status was also an important factor in its retention, since as a whole there is little evidence of a gross pattern of protein deficiency in the US. popula- tion. An important addition in NHANES II to the bio- chemical data obtained in NHANES I related to the investigation of the trace elements zinc and copper in blood. It was known in 1974 that there are more than 70 enzymes that need zinc for their proper function. Important factors in decreasing the absorption of dietary zinc are the fiber and phosphates in predom- inantly cereal-based diets. The consumption of alcohol increases urinary excretion. A number of diseases such as steatorrhea, regional enteritis, liver cirrhosis, hemolytic anemia, psoriasis, thalassanemia, and sickle cell disease may lead to zinc deficiency. Pregnancy may also predispose to zinc deficiency. Zinc is involved in the production of insulin, and zinc deficiency may impair wound healing. Copper defi- ciency is important for a number of reasons. The first 7 sign of copper deficiency in humans is usually neutro- penia. In advanced copper deficiency, iron is not absorbed. A copper-containing enzyme (cerulo- plasmin) is necessary for the human body to use iron. Copper is essential in hematopoiesis and plays a key role in connective tissue metabolism. Since in trace element surveys many factors can grossly interfere with the integrity of the specimens, a number of precautions were taken. A thorough investigation was made of various aspects of the collection, storage, stability, and possibilities of contamination of specimens. Special blood-drawing equipment and specimen storage containers were employed. A laminar flow table was used to prevent airbom contamination during specimen processing at the laboratory in the examination center. As in the NHANES I program, the two principal means of obtaining data on dietary intake were the 24—hour recall and the food frequency questionnaire. In order to facilitate comparison of the various types of information, the schedules used were modified somewhat in NHANES 11 so that both of them used identical food groupings. This was done in a way that still permits the comparison of NHANES II with NHANES I data. Considerably increased amounts of information on vitamin and mineral supplements were obtained in NHANES II as compared with NHANES I. In NHANES 11, information was obtained on participa- tion in such food programs as food stamps, commodi- ties, school lunches, home-delivery meals, and the like. This information will permit comparisons between the measures of nutritional status of individuals par- ticipating in these programs and individuals of similar socioeconomic status who are not participating. The body measurements obtained in NHANES II, the third part of the fourfold approach to assessing nutritional status, were the same as those used in NHANES I. They were as follows: standing height, sitting height, weight, bitrochanteric breadth, elbow breadth, upper arm girth, head circumference, triceps skinfold, and subscapular skinfold. The only change made was to obtain measures in 3-year-olds of both standing height and recumbent length, along with sitting height and a crown-rump measurement. The fourth approach to assessing nutritional status, a physician’s examination, was also largely unchanged from the examination given in NHANES I. The examining physician’s clinical diagnostic impres- sion was based on the physical examination and medical history along with the examining physician’s own reading of the electrocardiogram and X—ray and the results of some laboratory determinations imme- diately available at examination time (hematocrit, hemoglobin, white blood cell, red blood cell, red- blood-cell—urinary test tape, and microscopic urinal- ysis). The examining physician’s reading of the electrocardiogram and X-ray were not, of course, equivalent to the readings that were obtained later from medical specialists. The examining physician’s clinical diagnostic impression of many conditions was, in fact, based on much less than a complete workup. For many other conditions, however, the examining physician’s clinical diagnostic impression may have had a reasonable degree of accuracy. For their diagnostic impressions, the physicians entered the four—digit coding of the Eighth Revision Interna- tional Classification of Diseases, Adapted for Use in the United States9 rather than the three-digit code used in NHANES I. The most important change in the approach to nutritional assessment adopted for the NHANES II program was in relation to anemia. Since this condi- tion had been revealed by NHANES I to be a signifi- cant health problem in the US population, anemia was investigated in more detail in NHANES II. The approach used to characterize anemia was one that had been recommended by Dr. William Darby, Presi- dent of the Nutritional Foundation, Inc., Center for Disease Control personnel, and others. It involved symptoms, signs, and causes of anemia gathered in medical history questionnaires and physicians’ exam- inations; and it involved laboratory assessments in blood as follows: 0 A complete blood count: hematocrit, hemoglobin, white blood cell, red blood cell, dell differential, red cell morphology, and the determination of hemoglobinopathies. 0 Iron, iron—binding capacity, serum ferritin, and red cell protoporphyrin to designate iron status. 0 Serum folates, red cell folates, vitamin B12, zinc, copper, lead, and other indicators of anemia. The folate, ferritin, and vitamin B12 determina- tions were done on anemic individuals and on a sub- sample of the entire group. This approach used to characterize anemia should make a better determina- tion of the prevalence of anemia in the US. popula- tion possible than could be done from the NHANES I data and will enable the relationships among the va- rious iron—related measures to be characterized. Such a determination is important for various public policy actions such as recommendations for enrichment of food products with iron. Detailed health examination Major new target conditions The NHANES programs have been referred to as dual-purpose surveys, the purposes involving the assessment of both nutritional and health status. It might be more precise to refer to them as surveys to measure health status with special emphasis on one of the major determinants of health—nutrition. Be that as it may, information about a number of health conditions regarded as target conditions was collected in NHANES I, and many of these same target condi- tions were included in NHANES II. The new tar- get conditions included in NHANES II were diabetes, kidney pathology, liver function, and allergy. ’ Diabetes.—Diabetes has long been recognized as an extremely serious disease affecting a significant proportion of the U.S. population. Despite this fact, there has been wide variation in the estimated prevalence of diabetes in the population. A problem arises as a result of the presence of unrecognized or undiagnosed cases of diabetes that need to be added to the recognized or diagnosed to obtain the total prevalence. A health examination survey is an ideal mechanism to obtain prevalence estimates that include both diagnosed and undiagnosed cases. The prevalence of known cases of diabetes has been moni- tored by another NCHS survey, the National Health Interview Survey, and unpublished data from that program appears to indicate an increase in the pre- valence of diabetes. The apparent increase, however, may be due oto the wider use of diabetes-detecting clinical tests in the U.S. population and not to a true increase in the prevalence of the disease. The first National Health Examination Survey (1960-62) provided some information on the prevalence of diabetes, based on a l-hour glucose tolerance test, 10-13 but a closer approximation to a standard glucose tolerance test than was then used14 would have been essential to provide an adequate estimate of the total prevalence of diabetes mellitus. Increased attention to diabetes was mandated by the National Diabetes Mellitus Research and Education Act, enacted by Congress on July 23, 1974 (Public Law 93-354). Its purpose was to (l) expand the authority of the National Institutes of Health to advance the na- tional attack on diabetes mellitus; and (2) as part of that attack, to establish a long- range plan to (A) expand and coordinate the national research effort against diabetes mel- litus; (B) advance activities of patient educa- tion, professional education, and public education which will alert the citizens of the United States to the early indications of diabetes mellitus; and (C) to emphasize the significance of early detection, proper control and complications which may evolve from the disease. In planning NHANES II, NCHS worked closely with the National Commission on Diabetes (estab— lished under Public Law 93-354) and with the Na- tional Institute of Arthritis, Metabolism, and Digestive Diseases of the National Institutes of Health. Dr. G. Donald Whedon, Director of this Institute, specially requested that a diabetes component be included in NHANES II in order to determine both the preva- lence of diabetes mellitus in the U.S. population and the ratio of previously diagnosed to undiagnosed cases. In addition, the distribution of diabetes within the population according to various demographic characteristics was of interest. In addition to the assistance obtained from the National Institutes of Health directly, a number of consultants on the diabetes component were used in planning the NHANES II program. The principal ones were Drs. Peter Bennett, John O’Sullivan, Kelly West, and Harvey Knolls. A number of questions arose during the detailed 9 planning of the diabetes component. One of these was whether or not to require the consumption of a specific number of grams of carbohydrates during the 3 days before the examination. The major drawback of such a procedure for NHANES was the elimination of the 24—hour recall diet history from the nutritional dietary survey for individuals undergoing the glucose tolerance test, since the diet preparation would have seriously altered the previous day’s food intake. Con- sideration was given to interviewing persons to receive the glucose tolerance test at home at a time other than the 3 days before the examination, but limita- tions of budget and personnel precluded this solution. The question of diet preparation was brought up at a session of the work group on epidemiology of the Committee on Scope and Impact, a subcommittee of the National Commission on Diabetes. The work group did not reach general agreement. The group’s final decision was that the consump- tion of a specific amount of carbohydrates prior to the test would not be required. But data from the 24whour recall and the presence of ketones found in the urine sample would serve as an indication of whether or not there had been an inadequate con- sumption of carbohydrates prior to the test. Some consideration was also given to the conection of data reflecting levels of circulating insan and glucagon. After due consideration, it was decided to omit determinations of insulin and glucagon, largely because of the lack of adequate resources. The test finally decided upon for the diabetes component was as follows: a one-half sample of persons 20-74 years of age was scheduled for exami- nation in the mornings. (Analysis of Cycle I glucose tolerance data indicated that sample variances for this reduced sample would be low enough to permit data analysis.) Three blood glucose specimens were col- lected, a fasting one and specimens collected at 1- and 2-hour intervals after the glucose “challenge” had been drunk. Data could then be tabulated for each blood specimen, and some combination of the three values could be used to decide whether or not sample persons had diabetes. Previous studies had indicated that a 3-hour value did not contribute significantly to the diagnosis of diabetes and that attempting to obtain it would only increase nonresponse and unduly lengthen the examination time. A 75-gram glucose challenge was selected. Available information suggested that data derived from larger loading doses were generally interchangeable with the 75-gram dose. The tests were done only in the morning because glucose tolerance decreases later in the day. In general, health conditions, such as pregnancy, that were knOWn to alter carbohydrate metabolism were not grounds for exclusion from testing. The test was also given to those individuals who had been told by their physicians that they were diabetic and whose condition had been controlled by diet or by oral 10 hypoglycemic medication. The test was not given to insulin-dependent diabetics. The examinees were instructed not to eat any- thing after 11:00 pm. on the evening before the test. On the morning of the examination, after a fasting venal blood specimen had been drawn and a urine specimen had been analyzed for glucose, the examinee was given 7 ounces of caffeine-free cola (Glucola) to drink, which contained an equivalent of 75 grams of glucose. Two more specimens of blood were drawn at 1- and 2-hour intervals. The blood was processed in the examination center laboratory, and the frozen plasma was shipped to the Center for Disease Control in Atlanta, Ga. There the plasma was analyzed by the hexokinase Glucose 6-Phosphate Dehydrogenase Procedure, using an automated modification of the National Glucose Reference Method developed at the Center for Disease Control. Kidney pathology.—A second major new target condition selected for inclusion in the NHANES 11 program was kidney pathology. Very little data directly bearing on this had been collected in previous NHANES or NHES programs, and numerous requests to have a kidney component in the examination survey programs had been received over the years from the National Institutes of Health, the National Kidney Foundation, and several nephrologists in the NHANES professional inquiry groups. Malfunction of the kidneys is an important health condition, made more so by the very expensive and complex nature of the therapy that is provided by the artificial kidney. In planning this component, numerous people, including Dr. George Schreiner, Georgetown University Hospital, Dr. Nancy Cummings, National Institutes of Health, and Dr. James C. Hunt, Mayo Clinic, were consulted. A number of tests and procedures were considered in addition to an ex- panded medical history questionnaire, including a variety of questions related to urinary problems. Various modalities were investigated, some of which had to be rejected because of difficulties in the field situation. For example, because it was desirable to obtain a measure of bacteriuria, an indication of possible urinary infection, modifications of quanti- tative culture techniques and direct examination of urine for bacteria by gram stain were considered. However, to avoid the likelihood of‘false positive results, it is desirable to obtain at least three sep— arate specimens in any procedure involving a bac- terial culture. Previous examination survey experi- ence had made apparent the difficult logistical problems encountered in requiring repeated visits. Given the constraints, it was finally decided to rely upon the simple nitrite test using a dipstick to test for bacteriuria. The test is highly specific but not highly sensitive. The creatinine clearance test, a widely used test Of kidney function that involves the collection of timed urine specimens and a blood specimen, was also carefully considered. The original plans were to include a 2-hour creatinine clearance test with a water load of approximately 400 cubic centimeters at the start of the test. However, one of the major sources of error involved in 2-hour collection is in- adequate emptying of the bladder. Since the amount of urine collected in this instance would be relatively small, any retained urine could cause considerable error in test results. Methods for measuring retention of urine, such as use of isotopes, were not regarded as feasible in the field survey. Pilot testing of the timed urine collection strongly suggested that a significant number of individuals did not empty their bladders adequately. As a result of all these things, it was decided not to use the 2-hour creat- inine clearance test but to rely only on a serum creatinine test, a widely used but less sensitive in- dicator. Support for the laboratory work for this biochemical determination was provided by the National Institute of Arthritis, Metabolism, and Digestive Diseases. Microscopic examination of urinary sediments was another of the procedures considered for inclu- sion in the survey. While consideration was given to an exact quantitative test of urinary sediments using an aliquot of a timed urine specimen—a highly accurate procedure according to some reports—it was decided after the recommendation of consultants to use a method more closely approximating that used in clinical laboratories. The procedure finally adopted was the one used for urinalysis in the Mayo Clinic. It consisted of centrifuging the urine specimen, de- canting the supernatent fluid, and examining the sediment for the presence of red and white blood cells and cell casts. Ten microscopic fields were examined for each specimen, using lO—power and 40—power magnification. However, if the voided urine was dilute, the counts on urinary sediments would be much lower than if the urine sample had been highly concentrated. For this reason it was decided to do the microscopic analysis only on the adult subsample of persons 20-74 years of age who were also to receive the diabetes test. This group would have had a sufficient number of hours of fluid deprivation immediately preceding the test, during the time spent sleeping, to produce sufficiently concentrated urine (specific gravity of 1.015 or greater) for the test. This particular procedure was also used in a study of kidney disease in the Scandinavian population.15 One finding from that study was an average of almost 60- percent lower frequency of pyuria in both men and women when midstream specimens were used. Therefore, a midstream collection procedure was used for women and a 2-glass procedure for men, with the sediment analysis carried out on the second specimen. Dipstick tests for bilirubin, nitrite, urobilinogen, blood glucose, and ketones were also included in the NHANES 11 program. Optical density, as read on a refractometer, was also determined to assist in inter- preting the data, since it gives some indication of the concentration of urine. In addition, an osmolarity determination, another index of the concentration of urine, was made at the central laboratory where pesti- cide determinations in urine were made. Liver disease.——There is a lack of reliable epidemi- ological data on the prevalence of liver disease in the general population. Some information on the preva- lance of hepatitis comes as a result of serological tests; and considerable evidence based on mortality data, including autopsy records, indicates that liver disease is fairly widespread. Experts, including Dr. Paul Beck, of the National Institutes of Health, and Dr. Norman Javitt, of Cornell Medical Center, were consulted. The problem was to decide on appropriate tests to use in a sample survey. Unfortunately, the most commonly used test to detect liver disease (the BSP test), one both sensitive and specific, involves the intravenous injection of a material that may not be entirely safe. For this reason it was out of the ques- tion that it be used in the NHANES 11 program. Other tests that were considered, including various enzyme tests such as the SGOT, SGPT, alkaline phosphatase, and so on, are not as sensitive as the BSP test; nor are they specific, since results can be elevated when conditions other than liver disease are present. In this situation, Dr. Javitt suggested that a test for elevated serum postprandial bile acids be used. Bile acids are removed by the liver from blood returning to the heart via the portal vein. The liver cells rapidly secrete the recirculated bile salts into cuniculi where they pass down the ductal system to enter the gallbladder. Under the influence of gastro- intestinal horrnones, the bile is discharged into the intestine. The bile acids are then absorbed by the intestine and later enter the portal vein to start the cycle again. Because a diseased liver will not remove bile acids as efficiently as a healthy liver, and bile acids will accumulate in the blood stream, a meas- urement of bile acids in the serum is relevant. A meal containing fat causes a contraction of the gallbladder and in effect results in a greater elevation of bile acids than that occurring under fasting conditions. For the NHANES 11 survey it was decided that sufficient fat to elevate bile acids could be obtained by the sample person’s drinking an eggnog preparation. Peanut butter cups were substituted for eggnog for the occasional person who was allergic to eggs and egg products. Blood was collected 2 hours after admini- stering the eggnog preparation or the substitute, and the test was given only to adults 35 years of age and over, since the cost of laboratory work was relatively high. The results of the test were to be combined with information from special medical history ques- tions related to liver disease. Since data on alcohol 11 consumption were also collected in NHANES II, there is the possibility of relating such data to the findings with respect to liver disease. Allergy.—The need for better data on the epide- miology of allergic conditions in the US population has long been known and was specifically pointed out to the National Center for Health Statistics by Dr. Sheldon C. Siegal, who at the time was president of the American Academy of Allergy. Dr. Siegal strongly recommended that an allergy component be included in the examination survey program. Data from other NCHS surveys and from other sources showed that the clinical manifestations of allergy were responsible for a large number of ambulatory care visits and widespread use of prescription and nonprescription drugs. Seasonality would be a problem in measuring the clinical manifestations of allergies in a survey with the NHANES design because of the scheduling of the examination sites. However, reactions to skin tests are closely related to the presence of various respiratOry conditions, including asthma and allergic rhinitis.16 Further consultation on the possibility of including such a component was held with Dr. Phillip S. Norman, who succeeded Dr. Siegal as president of the Academy. It was recommended that data be col- lected, including an allergy history and the results of a skin test. At Dr. Siegal’s request, Drs. John Farghan, Charles Read, and Albert Schaeffer drew up a specific format and content for the allergy examination. The recommendation of the consultants was that the prick test be used, which, along with the scratch test, is considered to be among the safest procedures used for skin testing. The test involves pricking the skin through a drop of antigen placed on the skin. Their recommendation was adopted, as was the recommendation to use eight separate aeroallergen extracts: housedust, alternaria, cat fur, dog fur, mixed long and short ragweed, oak, perennial rye grass, and Bermuda grass. In addition to the eight allergens, two controls, one containing the diluent used for the antigens and another consisting of a histamine phosphate solution, were used. The allergy skin test was administered to exam- inees 6—74 years of age. The back, frequently con- sidered the most uniform site for skin tests, was deemed impractical to use for testing because of lack of facilities for keeping examinees in a prone position for the required time. Therefore, the non- vascular area of the forearm was used. Special pre- cautions were taken for individuals with a history of allergy to ragweed and even more particularly to cats or dogs, as revealed from the allergy history questions. After the administration of the allergens, readings were taken both at 10- and 20-minute (the more commonly used standard measurement) per- iods. Both the length and width of the wheal and its flare were measured, and standard clinical recordings were made of the allergic reaction. The consultants 12 had originally recommended that lyophilized extracts of the allergen be used, but they were not commer- cially available, and standard scratch test antigens preserved in glycerin were used instead. Other important targetconditions Osteoarthritis and disc degeneration—Osteo- arthritis is one of the most common diseases in older Americans. The disease is an important cause of disability, causing limitation of activity and mobility. Osteoarthritis has two basic causes. A gene that is very common in the population produces a syndrome of hereditary osteoarthritis associated with Heberden’s Nodes. In this condition, severe disc degeneration and degeneration of the apophysial joint of the cervical spine are commonly seen. The second type of osteo- arthritis is due to mechanical wear and tear. There‘ is little doubt that individuals who are exposed to high degrees of trauma develop severe disc degeneration of the cervical and lumbar spines. In addition to chronic pain, many syndromes may be noted. For example, severe involvement of the cervical spine may produce vertebral artery insufficiency and can cause severe dysphagia. Although findings from physical examina- tion often lead to an inaccurate assessment of osteo- arthritis, radiological methods are available for accu- rately assessing the severity of lesions. These methods were used in NHANES II. X—ray films taken in the survey include lateral views of the lumbar and the cervical spine. To avoid any possible X—ray damage to a fetus, lumbar spine X—rays of females were taken only at ages 50 and over. As in previous cycles of the National Health Examination Surveys, certain aspects of the physical examination and medical history were included in the survey to give a picture of the functioning of the joints and the disabilities associated with joint pathology. Consultation on this aspect of the survey was mostly with Dr. William O’Brien of the University of Virginia and Dr. Peter Bennett, National Institute of Arthritis, Metabolism, and Digestive Diseases. The proposal was also reviewed by the Subcommittee of Epidemiology of the National Arthritis Commission. Cardiovascular conditions—One part of the planned NHANES II cardiovascular component was an investigation of cardiac arrhythmia by means of Holter electrocardiogram recordings. Because cardiac arrhythmias are believed to be responsible for most sudden cardiac deaths, this study appeared to provide the opportunity for uncovering epidemiological data of major importance. In clinical practice, the Holter electrocardiogram recorders are attached to the pa- tient, and recordings are made during a 10- or 24- hour period while the patient goes about usual daily activities. To reduce the number of recorders and to lessen the operational complexities in NHANES II, the recordings were to be made over only a 2-hour period, while the examinee was engaged in other parts of the examination. A tryout of the procedure during the pilot test demonstrated that recordings of a good quality could be obtained. However, an expert committee assembled by NCHS and the National Heart, Lung, and Blood Institute to give advice on the proper processing of the tapes was of the opinion that certain parts of the examination, such as the glucose tolerance test, would affect the production of arrhythmias. Unfortunately, the committee rec- ommendations would have necessitated a redesign of the examination that would have added more time to the length of the examination than was judged feasible. When this determination had been reached, there was not enough time left in the plan- ning process to explore alternative proposals, and so the Holter electrocardiogram recordings had to be eliminated from the final NHANES 11 plan. To record the electrocardiogram, equipment that would record three channels of data simultaneously (lZ-standard lead and 3-Frank lead), with immediate conversion from analog to digital format, was used. The electrocardiogram was taken with the examinee resting in a supine position. It should be noted that the computer program available for three-channel processing was much more accurate than those pre- viously available for one-channel processing. To obtain continuing information on hypertension and the status of related medical control efforts in the United States, blood pressures were taken and appro— priate medical history questions were included in NHANES II, as they had been in the previous cycle of examinations (NHANES I). As is mentioned above, determinations were made of cholesterol, triglycer- ides, and high density lipoproteins (HDL). Spirometry.—To provide normative data on pulmonary function similar to that obtained in NHANES I for persons 25-74 years of age, spirometry was performed in NHANES II on individuals 6-24 years of age. As in NHANES I, the data Were recorded on tape, using the same equipment as that used for the electrocardiogram recordings. A computer program was used for processing the data and converting it into the individual parameters that describe pulmonary function. The data can be analyzed in relation to the allergy component and the respiratory data obtained from the medical history and examination. Speech pathology and hearing—The originally planned speech and hearing component of the survey was markedly shortened as a result of consultation and pilot testing. Impedance audiometry had been an important component of the original plan. This procedure was designed to give a measure of the prevalence of middle ear pathology in the United States. During the pilot test, however, difficulties were encountered in getting an adequate airseal; several examinees experienced discomfort; and the test took longer than expected. A decision to discon- tinue the procedure was made after the pilot test, since although additional months of experience with the procedure might have reduced the problems encountered, the entire survey schedule would still have been disrupted. Although impedance audiome- try was dropped from the survey, puretone audiome- try was included for all sample persons 4-19 years of age. It had originally been planned to obtain a speech sample from individuals 4-74 years of age for speech pathology testing, but the instrument finally selected for the speech test was the Stephens Oral Language Test,17 a test using standardized stimulus sentences that had been used to screen children of from 4 through 6 years of age for deficiencies in syntax and articulation. Although the test had been used exten- sively in the 4—6 age group, there was only a very limited experience of its use in older age groups. In NHANES 11 only those 4—6 years of age were tested, since the test had received adequate validation only in that group. Because of substantial oversampling of this age group for the nutrition survey, there were enough children for the resulting data to be useful. Since trained speech pathologists were not avail- able for the survey team, speech recordings of the 15 sentences used in the test were made at the examina- tion site. These recordings could be evaluated subse- quently by a speech pathologist. Considerable effort was expended in designing a recording setup that would produce excellent high-fidelity recordings. In order to provide a standard stimulus for eliciting the speech sample, Dr. Irene Stephens, Associate Pro- fessor, Department of Communicative Disorders, Northern Illinois University, recorded a reading of the speech test on separate Language Master cards. Subse- quent evaluation by Dr. Stephens of about 400 re- cordings taped by the survey demonstrated the fea- sibility of this approach. Blood tests: carbon monoxide, lead and pesticide levels, and venereal disease—The increasing involve- ment of NHANES in studying environmental health factors has reflected the increasing interest in the effect of the environment on health. In NHANES I the major project in the environmental field was the collection and analysis of household water samples for various bulk elements and trace metals. New environmentally related tests were developed for NHANES 11. Air pollution or, specifically, carbon monoxide pollution is an often cited problem in many cities of the United States. Carbon monoxide is a colorless, odorless gas that is a product of incomplete combus- tion and is primarily produced from industrial plants, electric power plants, and automobile exhaust. It has been suggested that carbon monoxide may act to precipitate cardiac symptomatology or episodes by reducing the supply of oxygen to a heart already compromised by coronary disease. Because of the lack of acceptable information on the body burden 13 of carbon monoxide and the potential deleterious health effects due to carbon monoxide air pollu- tion, it was thought to be an appropriate area of study for NHANES 11. Since smoking also results in higher carbon monoxide levels, questions on smoking were included in the survey. Carboxyhemoglobin determinations were done on a half-sample of examinees 3-74 years of age. Special care was taken in quality control for the laboratory determinations, including the use of a reference laboratory. Analysis of data should indicate whether and where carbon monoxide pollution is a significant problem. For many years lead poisoning has been consid- ered an important public health problem, particularly in children. Some important causes of high body levels of lead are contaminated foods, automobile exhaust, and, in children, lead paint. Lead poisoning can produce many adverse effects, including anemia, anorexia, colic, parietitis, hypertension, arteriola degeneration, permanent renal damage, encephalo- pathy, mental retardation, blindness, cerebral atro- phy, glycosuria, visual disturbances, epilepsy, and palsy. In a meeting on trace elements, Dr. Katherine Mahaffey of the Food and Drug Administration gave the following rationale for a survey of lead levels in blood: 0 Available data come either from populations where lead contamination is suspected to be high or from specific control groups where lead con- tamination is expected to be very low. There is no information about the distribution of lead levels in blood for the general US population. 0 The variability with age is not known. 0 With expected large-scale changes in exposure of the population to lead, knowledge of present serum lead levels is needed as a baseline for future studies. Normative information is essential to sub- stantiate regulatory decisions based upon knowl- edge of the biological meaning of high lead levels coupled with available data on lead levels at mini- mal lead exposure. Blood determinations were made on all children through the age of 6 and on a half-sample of all exam- inees over that age. Because of the interest of the Food and Drug Administration in the lead deter- minations, the laboratory cost of the test was under- written by the Bureau of Foods, Food and Drug Administration, and the determinations were made by the Bureau of Laboratories of the Center for Disease Control. The Environmental Protection Agency is author- ized under Public Law 92-516 to monitor not only 14 the environment but human beings as well for evi- dence of pesticide exposure or contamination. The National Human Monitoring Program for Pesticides is operated by the Environmental Protection Agency in partial fulfillment of the legislative mandate. The program’s goal is to determine on a national scale the amount of exposure of the general population to pesticides. It was considered by the Environmental Protection Agency that NHANES II could establish important baseline data on the body burdens of several types of pesticides through blood and urine analysis (appendix III). With the use of chlorinated hydrocarbon pesticides declining and that of organo- phosphate carbamate and phenoxy-type compounds increasing, the capacity to determine human exposure to these new, widely used pesticides has become im- perative. In order to obtain this information, the Environmental Protection Agency offered to under- write the laboratory cost of pesticide level deter- minations of a half-sample of NHANES II examinees 12-74 years of age. A few questions relating to expo- sure to pesticides were added to the questionnaires, and blood and urine specimens were obtained on the half-sample. The Center for Disease Control asked NCHS to include a survey component for venereal disease in NHANES II. The two diseases to be studied were gonorrhea and syphilis. Syphilis testing involved few problems because it had already been included in NHES I (1960-62)1 and the 1974—75 NHANES I Augmentation Survey.5 Inclusion of the serological tests for syphilis on the full sample of persons 12-74 years of age provided opportunity for analysis of the data by population subgroups as well as a comparison with the 1960-62 survey. The serology determina- tions for syphilis included qualitative and quantitative ART, an FTA—ABS, and an MHA—TP. The tests are classified respectively as flocculation, immuno- fluorescence, and hemeagglutination. It is more difficult to test for the presence of gonorrhea. At present there is no serological test for gonorrhea specific enough to be suitable for survey purposes. The standard clinical method for women involves taking an endocervical culture at the same time that a Pap specimen is taken. Experi- ' ence at our initial pretesting operation indicated that many women were unwilling to undergo this proce- dure in a survey setting, and it was therefore decided to omit it from the examination. Instead, a somewhat less sensitive method was used that involved culturing urinary sediments obtained after centrifuging urine specimens. The age range of individuals studied was 12-40 years for males and females, and of those fe- males who received the glucose tolerance test, only those 20-24 years of age had the gonorrhea test done. Sample design for NHANES II The general structure of the NHANES 11 sample design is similar to the designs of NHANES I4 and the first three health examination surveys conducted by the National Center for‘Health Statistical-3a 18 The design is a stratified, multistage, probability cluster sample of households throughout the United States. The process of selecting a sample of persons to be examined is a cascading one that involves the selec- tion of primary sampling units (PSU’s—a PSU is a county or small group of contiguous counties), census enumeration districts (ED’s), segments (a segment is a cluster of households), households, eligible persons, and finally sample persons. The major difference between the NHANES I and NHANES 11 designs is the use of a different set of definitions and stratifica- tion procedures for PSU’s. The details of the NHANES II sampling plan, which resulted in a total of 27,803 sample persons and 20,325 examined persons in 64 PSU’s throughout the United States, are described in the following sections. Design specifications The planning phase for NHANES II is described in a previous section, along with many of the survey objectives. The survey specifications that, directly affected the sample design were as follows: 0 NHANES II should be a probability sample whose target population is the civilian, noninstitution- alized population of the United States (including for the first time Alaska and Hawaii) for persons 6 months through 74 years of age. 0 Subgroups of the population of special interest for nutritional assessment should include pre- school children (6 months - 5 years), the aged (60 - 74 years), and the poor (persons below the poverty level as defined by the US. Bureau of the Census using 1970 census results). These groups should be oversampled to improve the reliability of the statistics for the subgroups. o The total sample size selected for NHANES II should result in approximateh‘ 21,000 examined persons. 0 The number of sample persons selected in each PSU should be between 300 and 600. O The data collection mechanism used in NHANES I should be used in NHANES II with appropriate modifications. Examinations should be conducted in three mobile examination centers. At any time during the survey period (except holidays) two of the centers should be operating in different loca- tions while the third is being serviced or relocated. 0 The total period of data collection should be 3 to 4 years. 0 The average length of an individual examination should be between 2 and 3 hours, but it should vary depending on the age of the examinee. The time required to examine a preschooler should be less than 1 hour, while the time for an adult should not exceed 2% to 3 hours. 0 Approximately one person per sample household should be selected for an examination. The exact number of persons selected for an examination in each household should be determined by applying the sampling rates designated for the different age groups. 0 The size of the PSU should be defined so that it is optimal with respect to cost and response and results in national statistics with an acceptable level of precision. O The survey should be designed so that precise statistics can be produced for the four broad geographic regions of the United States and for the total population by age, sex, race, and income classifications. These sample design specifications took a number of factors into account, including budgetary resources, logistical constraints, time limitations, equipment mobility, and unit operating costs. The specifications 15 also reflected the experience gained from past exami- nation surveys. One of the major survey objectives of NHANES II was the examination of a high percent of sample per- sons. The overall response rates in the examination surveys conducted by NCHS had continually declined since the 1960’s. The response rate for the two surveys of the total US population had declined from 87 percent in the early 1960’s to 74 percent in the early and mid-1970’s. There were multiple reasons for this decline in response—some control- lable and some not. Whatever the reasons, the results of the survey may have been biased because a large proportion of sample persons had not been examined. A design change that was investigated for improving response was the use of smaller geographical areas as PSU’s. The PSU’s used in previous examination surveys had been defined either as a single county or as a group of cdntiguous counties (except in certain parts of New England). Many of the larger PSU’s were defined as standard metropolitan statistical areas (SMSA’s) and often contained several counties. The PSU’s that contained several counties and covered a large area were not ideally suited for an examination survey. Attempting to survey large geographic areas from a centrally located examination center created a number of logistical problems. Some examinees had been asked to travel more than 50 miles to be exam- ined, while others had been asked to travel through very congested areas. Many respondents were reluc- tant to travel under such conditions. The cost of followup visits to the households was also a function of the distance or time from the examination center. An analysis of the response rates for several stands in NHANES I lent further support to these assumptions. The use of smaller areas as PSU’s would reduce both the average distance traveled to the examination center by examinees and the cost of the field work. These considerations were the basis for redefining and restratifying the PSU’s in NHANES II. Definition and stratification of primary sampling units The first-stage sampling units selected in the previous NHES and NHANES I surveys were subsets of the sample PSU’s in the National Health Interview Survey (NHIS). NHIS is one of the NCHS major data collection programs, the design of which is described in an NCHS report19 and in a technical paper20 by the US Bureau of the Census. In NHIS the United States is subdivided into 1,924 PSU’s, with 376 of the PSU’s being selected for the sample. Sixty-five of these 376 sample PSU’s were selected as the NHANES I sample. In redefining PSU’s for NHANES II, the formation of PSU’s for NHIS was reviewed. The PSU’s for NHIS had been defined by the Bureau of the Census and are the same as those used for the Current Population Survey.20 With some slight over- 16 simplifications the following criteria had been used to define PSU’s for NHIS: 0 Each SMSA is a separate PSU. 0 Each PSU is composed of a single county or con- tiguous counties (in some New England States minor civil divisions are used). 0 EachPSU isdefined within the four census regional boundaries. 0 The area of a PSU is less than 2,000 square miles in the West and less than 1,500 square miles else- where. 0 The 1970 population of a PSU is at least 7,500 in the West and at least 10,000 elsewhere. The NHIS PSU’s that contained more than one county were either SMSA’s or had been defined using the last criterion above and represent rural areas. Since rural areas have traditionally had high response rates in the health examination surveys, the only PSU’s considered for redefinition were the SMSA’s. In the NHIS design, about 60 percent of the SMSA’s contained a sufficiently large population to be selected for the sample with certainty (with a prob- ability of one) and are referred to as self-representing PSU’s. In NHIS, 156 of the 376 PSU’s are self-rep- resenting SMSA’s. It was these ‘156 self-representing SMSA’s in the NHIS design that were redefined and restratified for the NHANES II design. For NHANES II, the self-representing PSU’s in NHIS were first split along county boundaries. Within each region, each of the counties was classified as being either a self-representing or a nonself—repre— senting PSU. The PSU’s that were nonself-representing were further combined into homogeneous classes or strata equal in size to the NHIS strata containing nonself-representing PSU’s. The formation of new strata were governed by the following rules: 0 Each new PSU with a population of more than 250,000 in 1970 was classified as a self-represent- ing PSU. In a few special cases, some PSU’s with slightly smaller populations were classified as self- representing. 0 The remaining newly defined PSU’s were com- bined with other PSU’s having similar sociodemo- graphic characteristics to form a number of nonself-representing strata. The PSU’s within a stratum were all located in the same geographic region. 0 Each of the nonself-representing strata was made to have about the same population. The average stratum contained about 350,000 persons in 1970. This method of stratification and the stratifica- tion variables used to form NHIS nonself—representing strata are the basis for the procedures used to form the larger strata for NHANES 11 described in the next section. The regional boundaries used in stratifying PSU’s differ from regional boundaries as defined by the Bureau of the Census. Figure 1 shows the different regional boundaries used in NHANES II and the census. In order to produce regional estimates with approximately equal precision, the NHANES II regions were defined so that they would each contain approximately the same number of sample PSU’s. Because of the small sample size for NHANES II, a regionally balanced design was needed for producing regional statistics. Table A shows the effect of subdividing the self- representing PSU’s in NHIS and redefining the PSU’s by using county boundaries. A total of 397 PSU’s were formed from the 156 self-representing PSU’s: 198 were defined as self-representing, and 199 were defined as nonself-representing and subsequently used to form an additional 43 nonself-representing strata. The average population of a self-representing PSU was reduced from 838,000 to 584,000. In area, the average size of these PSU’s was reduced more than 60 percent, from 2,185 square miles to 855 square miles. Formation of superstrata in NHANES II After the 461 first-stage units (NHIS strata) had been defined, they were further stratified into a total of 64 superstrata for the NHANES 11 design. One PSU was selected from each of the superstrata, and these PSU’s represented the 64 geographic locations visited by the mobile examination centers during the survey period. The stratification and selection of first-stage units in NHANES II is as follows. The number of primary sampling units had to be determined before the number of superstrata could be determined. Because of the design specifications, the maximum number of locations that could be visited during a 4-year period is approximately 80 stands. In order to decide the number of first-stage units to select, a series of design calculations were made. A general description of the process is presented else- where.18 The design model used incorporated such factors as total budget, unit costs, and precision of estimates obtained in previous surveys for a variety of health characteristics. These calculations showed that the optimum number of locations to select was 130, examining 160 persons per stand. One important variable not built into the design model, however, was “down time.” Moving from one location to another requires 1 full week, even when a third examination center can be relocated and hooked up in advance. Time is required for closing the office, packing the equipment, traveling to the new location, and setting up and calibrating the equipment. Locating in 130 different areas over a 3- to 4-year period implies that 2 weeks or less would be spent at each location. This length of time was felt to be too short to achieve required response rates since, in many areas, repeated callbacks are required to achieve a 75-percent exami- nation rate. Previous field experience had indicated that staying in an area for only 2 weeks could reduce response rates by as much as 10 percent. Taking all of the logistical problems into consider- ation led to the selection of a design of 64 primary locations with an average expected number of about 440 sample persons per location. Thus, an examina- tion center would be located in each area for a period of 4 to 6 weeks. With two examination teams being Table A. Number and population of National Health Interview Survey (NHIS) strata before and after subdivision of self-representing primary sampling units, by type of stratum and National Health and Nutrition Examination Survey region [Population estimates are based on 1970 Decennial Census] NHIS strata Rede fined strata Type ““5"” Number Population Average Number Population Average and region . . . . of In population of In population stra ta thousands in th ousan ds strata lb ousands in thousands Sel f-represen ting All strata ......... 156 130,760 838 198 1 15,629 584 Northeastern ....... 50 41,897 838 64 36,795 575 Midwestern ...... 30 31,890 1,063 43 27,831 647 Southern ......... 38 22,706 598 49 19,674 402 Western .......... 38 34,266 902 42 31,329 746 N onsel f—rep resenting All strata ......... 220 72,679 330 263 87,811 334 Northeastern ....... 20 7,144 357 34 12,246 360 Midwestern ....... 61 20,279 332 73 24,339 333 Southern ......... 84 26,752 318 93 29,785 320 Western .......... 55 18,504 336 63 21,441 340 17 Regional Boundaries for the National Health and Nutrition Examination Survey, 1976-80 lLAIIA U.S. Bureau of the Census Regional Boundaries V4 “u“ ~ Q ................... g. , ..... 18 Figure 1. Comparison of regional boundaries for the National Health and Nutrition Examination Survey, 1976-80, with those defined by the U.S. Bureau of the Census employed simultaneously, about 16 stands could be. completed per year. A final comparison was made between the selected design and the design that was optimum with respect to sampling error. It was con- cluded that the final selected design would decrease the reliability of the survey estimates by about 10 percent from those of the optimum design but would substantially reduce the nonsampling component of error. ‘ Because of the small number of primary sampling units, it was decided that the maximum amount of stratification should be used: that the NHIS strata be stratified in 64 superstrata and one PSU be selec- ted per superstratum. The object of stratification is to group the strata with similar characteristics into homogeneous superstrata. A stepwise regression analysis was used to determine which variables would be most effective for collapsing NHIS strata into superstrata. Since NHANES II is a health survey, it would be preferable to use health or health-related variables for stratification. The variables used for stratification must, however, be available at the county level to combine counties or groups of coun- ties into strata. Since health variables were not available at the county level, the stepwise regression analysis was used to study the relationship between the sociodemographic variables that are available for all counties and a set of selected health variables from a previous health examination survey. For the analy— sis, measurements on all the variables listed below were made for each of the sample PSU’s in the first health examination survey. The dependent variables used in the regression analysis were 0 Infant mortality rate and number of infant deaths. 0 Percent and number of persons with kidney trouble. 0 Percent and number of persons with heart trouble. 0 Percent and number of persons with hypertension. 0 Percent and number of persons with high levels of serum cholesterol. The independent variables used in the analysis were Population. Rate of growth. Density (population per square mile). Percent urban. Percent manufacturing. Median income. Percent races other than white. Percent below poverty level. Percent Hispanic origin. Total Hispanic population. 0 Population below poverty level. These variables were defined by the US. Bureau of the Census and included the variables that had previously been used for stratification in NCHS examination surveys. A stepwise regression was performed for each of the dependent variables. When the total number (rather than percent) of persons with a health condi- tion was used for a PSU as the dependent variable, the only independent variable that entered the regression model was population. This demonstrates the importance of either stratifying the PSU’s accord- ing to their population size or selecting the sample PSU’s from strata with a probability proportional to their size. When the stepwise regressions were run for the percent of persons with a given health condition, a number of independent variables entered the regression model. Table B presents the results of the analysis by region. Table C shows the correlation matrix for the health variables and for selected sociodemographic variables. The independent variables that entered the final regression model varied by health condition and among regions. Summarizing the results over all of the health conditions within each region led to some general conclusions: median income was the first or second most important independent variable within each region; the percent of the population below the poverty level was always among the three most important variables in each region; and either “percent races other than white” or “percent Hispanic origin” was among the three most important variables in all but one of the regions. These results were further supported by the correla- tions shown in table C for the total US population. Although the overall correlation between percent Hispanic and the health variables is low for the total United States, percent Hispanic entered the regression model for the Northeastern and Western Regions. ' Because of these results, the following sample design decisions were made and implemented: 0 The first and second most significant independent variables in each region were used as stratification variables. 0 The third most important independent variable in the stepwise regression analysis in each region was used as a control selection variable (described in the next section). 0 The formation of superstrata was performed sepa- rately for self-representing and nonself-repre- senting strata within each region. 0 Population size was used at the first level of stratification within each region. 0 Sixteen superstrata were formed in each region. The superstrata were each about the same size, each containing approximately 3,200,000 persons according to the 1970 decennial census. 19 Table B. Variables in final stepwise regression model, by region Dependen 1‘ variable Independent variables in final regression model Northeastern Region Midwestern Region Southern Region Western Region Infant mortality rate Percent below poverty level Percent races other than white Median income Percent Hispanic origin Percent manufacturing Percent races other than white Percent Hispanic origin Percent races other than white Percent urban Percent below poverty level Median income Percent below poverty level Median income Percent manufacturing Rate of growth Percent Hispanic origin Percent with kidney trouble Percent Hispanic origin Percent below poverty level Median income Percent races other than white Median income Rate of growth Percent manufacturing Percent below poverty level Median income Percent Hispanic origin Percent races other than white Rate of growth Percent manufacturing Percent below poverty level Median income Percent with heart trouble Percent races other than white Percent manufacturing Percent Hispanic origin Median income Median income Rate of growth Percent below poverty level Median income Percent manufacturing Percent urban Percent Hispanic origin Percent with hypertension Rate of growth Percent below poverty level Rate of growth Percent races other than white Percent below poverty level Percent Hispanic origin Median income Percent below poverty level Median income Rate of growth Percent urban Percent races other than white Percent Hispanic origin Rate of growth Percent manufacturing Median income Percent with high serum cholesterol Percent Hispanic origin Median income Percent manufacturing Percent below poverty level Median income Percent below poverty level Percent Hispanic origin Percent races other than white Percent manufacturing Percent below poverty level Median income Infant mortality rate Median income Percent Hispanic origin Rate of growth In accordance with the decision to use the first and second most significant independent variables in addition to population size, the following variables were used as stratification variables for NHANES II: Northeastern Region: Population in stratum Median income Percent below proverty level Midwestern Region: Population in stratum Median income Rate of growth Southern Region: Population in stratum Median income Races other than white plus Hispanics Western Region: Population in stratum Median income Races other than white plus Hispanics The actual formation of the superstrata in NHANES II was performed in two stages. During the 20 first stage the NHIS strata were classified into 64 superstrata according to region, type of stratum (self- representing or nonself-representing), size of stratum (large or small), income (low, middle, or high), percent races other than white plus Hispanics (low or high), and percent below poverty level or rate of growth (low or high). The classification procedure used to form the preliminary superstrata is shown in table D. An important effect of the stratification process was the formation of superstrata containing pnly central cities, suburban counties, or rural coun- ties. Although some precision was lost by splitting the larger SMSA’s, it was hoped that a gain in precision would result from the division of central cities and noncentral cities into separate strata. The final stage in the formation of superstrata was a cluster analysis of the superstrata formed in the first stage. The cluster analysis was performed sepa- rately in each region for the self-representing and nonself—representing strata. Within each of these subdomains the strata were ranked from lowest to highest by population size, area, percent manufac- turing, rate of growth, percent urban, percent races other than white plus Hispanics, median income, and percent below poverty level. For each pairwise Table 0. Correlation matrix for health and sociodemogrmhic variables Percent Percent below Percent races other dran whim Hispanic origin poverty level Median Income Percent manu facturing Percen t urban with hyper- tension widr Infant .. 5?; rag-EB “ 2% Percent Percent Percent with heart trouble Density kidney trouble mortality mm .20 1 .00 1 .00 Percent with high serum cholesterol Percent with kidney trouble Percent with heart trouble Percent with hypertension Infant mortality rate 61 -.09 1 00 .13 1.00 1 .00 Percent manufacturing Percent races other than white Percent below poverty level Percent Hispanic origin Rate of growth Median income Density Percent urban Population .14 .42 .48 .47 .21 .32 .21 Average absolute correlation with health variables combination of strata, the Euclidean distance be- tween the ranks was computed. For stratumA and stratum B, the Euclidean distance is defined as p d(A,B) = g (rm-rm? where p is the number of variables, riA is the rank of the ith variable for NHIS stra— tum A, and r“3 is the rank of the ith variable for NHIS stra— tum B. The smaller the value of d(A,B) the more alike the strata are. The d(A,B) values were then evaluated for each pairwise combination of strata in the NHANES superstrata. Because of the overlap between the variables used for stratification and the variables used to compute the measure d(A,B), the d(A,B) values within a superstratum should be relatively small. This was generally true. A substantial number of individual strata were identified, however, whose sum of d(A,B) values with other members of the superstratum was large. In these cases, an attempt was made to realine the strata within the superstrata so _that the sum of the d(A,B) values over all of the superstrata was minimized for each subdomain. Because of the number of constraints imposed on the stratification process, these adjustments were per- formed manually. This procedure substantially reduced the sum of the d(A,B) values within the superstrata and produced a more efficient stratifica- tion. Cluster analysis was also similarly used for the formation of nonself-representing strata using the newly defined nonself—representing PSU’s. Selection of sample locations The selection of one PSU per superstratum utilized a modified Goodman-Kish“,22 control selection technique. The control selection procedure was used to insure that the selected first—stage sam- pling units represented a “balanced” sample with respect to the control selection variables used. For example, within a region one might want to insure that the final sample PSU’s were distributed evenly across States or across groups of States. This could be achieved by using the “State groups” within a region to control the number of PSU’s selected within each State group. The first step in this selection process involves defining a set of admissible patterns (samples) so that each pattern has an acceptable distribution of PSU’s across the control classes. A pattern or potential sample is admissible if the difference between the number of selected PSU’s is within 1 of the number of PSU’s expected to be .21 Table D. Variables used for stratification in the National Health and Nutrition Examination Survey, by region Stratification variables Region and type Maggi 0f ofstratum strata Income Races other than Rate ofgrowth or white plus Hispanics percent below poverty level Percent below poverty level Northeastern ..................... 16 Self-representing strata ............... 12 Highly urban—New England1 ......... 1 Other urban—New England .......... 1 Large counties (by population) ........ 6 high, medium, low high, low Small counties (by population) ........ 4 high, low, high, low Nonself-representing strata ............. 4 New England places ............... 1 Other ....................... 3 high, medium, low . Rate of growth Midwestern ..................... 16 Self-representing strata ............... 8 Certainty2 .................... 1 Large counties (by population) ........ 4 high, low high, low Small counties (by population) ........ 3 high, medium, low Nonself—representing strata ............. 8 Large strata (by population) .......... 4 high, low high, low Small strata (by population) .......... 4 high, low high, low Southern ....................... 16 Self-representing strata ............... 6 Large counties (by population) ........ 3 high, medium, low Small counties (by population) ........ 3 high, medium, low Nonself-represen ting strata. ............ 10 Large strata (by population) .......... 6 high, medium, low high, low Small strata (by population) .......... 4 high, low high, low Western ........................ 16 Self-representing strata ............... 9 Certainty2 .................... 2 Large counties (by population) ........ 4 ‘high, low high, low Small counties (by population) ........ 3 high, medium, low Nonself—representing strata. . . , .......... 7 Large strata (by population) .......... 4 hiya, low high, low Small strata (by population) .......... 3 high, medium, low 1New England is subdivided into townships rather than counties. Cook County in the Midwestern Region and Los Angeles County (2 stands) in the Western Region were selected into the sample with a probability of 1. drawn from each control class based on its population. The total set of patterns is formed so that the proba- bility of selecting any PSU within a superstratum is proportional to its population. Each pattern within the set is assigned a probability of selection based on the size of the sample PSU’s within the pattern. The sum of the probabilities of selection over all patterns is equal to 1. After the probabilities of selection for the patterns were accumulated, a sample pattern was randomly selected for NHANES II. A detailed de- scription of this controlled selection process is given in an NCHS report.18 Two control selection variables were chosen within each region for NHANES II. The variable “State group” was used in all four regions, and “percent below poverty level” was used in every region except the Northeastern, where “percent races other than white plus Hispanics” was used. Thus, the final sample of PSU’s was drawn so that the sample did not appreciably overrepresent or underrepresent 22 any State group or quartiles representing percent below poverty level or percent races other than white plus Hispanics. The control selection procedure was applied separately within the self-representing and nonself-representing superstrata in every region except the Northeastern, where the control selection was applied to the total region. The control variables used within each region are defined in table E, and the expected and actual number of PSU’s selected from each control class are shown in table F. The “percent below poverty level” or “percent of races other than white plus Hispanics” classes were defined within each region by classifying approximately equal numbers of NHIS strata into quartiles. Classifying the strata into control classes was straightforward for the self-representing strata (one PSU per stratum). The classification of the nonself- representing strata into control classes was more complicated. The PSU’s within each of the NHIS strata are often not all in the same State group, “percent below poverty level,” or “percent races other than white plus Hispanics.” This complication was remedied by selecting a sample PSU within each of the nonself-representing strata. Within each of the original NHIS nonself-representing strata, the NHIS sample PSU was designated as the NHANES II sample PSU. In the newly defined nonself-representing strata a sample PSU was selected with a probability propor- tional to its size. The sample PSU’s within the strata were selected before the sample strata were selected within the superstrata. The sample PSU’s within the nonself-representing strata were then used to classify the strata by State group, percent below poverty level, or percent races other than white plus Hispanics. The selected survey locations for NHANES II are shown in table G. Selection of housing units within sample locations The Bureau of the Census had the responsibility for selecting housing units and sample persons within each of the 64 primary locations. The Bureau of the Census was also responsible for specifying and implej menting the sample design within PSU’s and for over— sampling the subgroups of the population of special interest. Two sampling frames were used to select the sample of housing units within each of the PSU’s. The larger frame was based on the 1970 census of the population. This frame was supplemented by a frame that contained new housing units constructed since the 1970 census. The first stage of design within a PSU involved the selection of clusters of housing units (segments) within enumeration districts (ED’s). An ED is a geo- graphical area containing approximately 300 housing units. In order to oversample persons with low in- comes, the ED’s were sorted into poverty or n_o_n-_ poverty strata as follows: the poverty strata contained ED’s with 13 percent or more of persons below the poverty level, and the nonpoverty strata contained ED’s with less than 13 percent of persons below the poverty level as determined by the 1970 census. The poverty index for households was based on 1969 income, size of family, sex of head of family, age, (under 65 years or 65 years and over) of head of family, and farm or nonfann status. A measure of size was determined for each ED by dividing the number of listed housing units in an ED by 4. Within each stratum the ED’s were then selected with a proba- bility proportional to their measure of size. The number of ED’s selected in each stratum was based on a number of factors that are described below. According to previous experience, it was assumed that a response rate of approximately 75 percent would be obtainable in NHANES II. To examine 21,000 persons, approximately 28,000 persons needed to be selected from the sample households. A mathematical model23 was used to determine the sample size for each PSU and the optimum number to select in the poverty and nonpoverty strata within PSU’s. The sample was allocated in such a way as to minimize the variance of the estimated proportion of persons below the poverty level for a fixed total Table E. Definition of control classes used for the selection of primary sampling units, by region: National Health and Nutrition Examination Survey, 1976-80 Ist variable 2nd variable Region State group State group Quartile Definition of quartile code Percent races other than white plus Hispanics Northeastern A Connecticut, Maine, Massachusetts, New 1 Lowest Hampshire, Rhode Island, Vermont 2 Low—middle B New York 3 Middle-high C New Jersey, Pennsylvania 4 Highest Rate of growth and percent below poverty level Midwestern A Ohio 1 Lowest ' B Indiana, Michigan, Wisconsin 2 Low-middle C Illinois 3 Middle-high D Minnesota 4 Highest E Iowa, Missouri Percent below poverty level Southern A Delaware, District of Columbia, Maryland, Virginia 1 Lowest B Kentucky, Tennessee, West Virginia 2 Low-middle C Alabama, Arkansas, Louisiana, Mississippi 3 Middle-high D Georgia, North Carolina, South Carolina 4 Highest E Florida Western A California 1 Lowest B Oregon, Washington 2 Low-middle C Texas 3 Middle-high D Arizona, Colorado, Idaho, Montana, Nevada, New 4 Highest Mexico, Oklahoma, Utah, Wyoming, Alaska, Hawaii E Kansas, Nebraska, North Dakota, South Dakota 23 Table F. Expected and actual number of sample primary sampling units (PSU's) within control classes, by region and type of stratum [The control classes are defined in table E. The expected number of PSU's in a control class is based on its population] Quartile: represen ting percent below . State group poverty level or percent races Region and type ofstratum other than white plus Hispanics A C D E 1 2 3 4 Northeastern1 Expected number of PSU's ............... 3.86 5.56 6.58 . . . . . . 4.42 3.66 3.97 3.94 Actual number of PSU’s ................. 4 7 . 4 4 4 4 Midwestern Self-representing strataz: Expected number of PSU’s ............... 1.93 2.71 0.80 0.57 0.99 1.05 2.73 2.38 0.84 Actual number of PSU’s ................. 2 1 1 1 1 2 3 1 NonseIf-representing strata: Expected number of PSU’s ............... 1.17 3.57 0.65 0.84 1.76 2.05 1.86 2.02 2.07 Actual number of PSU’s ................. 1 1 1 1 2 2 2 2 Southern Self-representing strata: Expected number of PSU’s ............... 1.94 0.72 0.95 1.02 1.37 1.61 1.57 1.54 1.28 Actual number of PSU's ................. 2 1 1 2 2 2 1 1 Nonself-representing strata: Expected number of PSU's ............... 1.18 2.45 2.83 2.82 0.72 2.44 2.57 2.46 2.53 Actual number of PSU's ................. 1 3 3 2 3 2 3 Western Self-representing stratazz Expected number of PSU's ............... 3.16 0.84 1.55 1.26 0.19 2.01 1.76 2.09 1.15 Actual number of PSU’s ................. 3 1 1 1 2 2 2 1 Nonself-representing strata: Expected number of PSU’s ............... 0.82 0.98 1.92 2.16 1.12 1.80 1.81 1.73 1.65 Actual number of PSU's ................. 1 2 2 1 2 2 1 2 1SeIf-representing and nonself-rapresenting strata combined for control selection. Excludes self-representing superstrata from the National Health and Nutrition Examination Survey, 1976-80. sample size. The allocation procedure employed produced a sample that varied in expected sample size from 281 to 781, with an average of 437 persons per PSU. All but 11 of the sample sizes were within the operationally acceptable range of 300 to 600 sample persons. To conform to the design specifications, the expected sample size for each of these PSU’s was adjusted to fall between 315 and 585 persons. The average ratio of the sampling rate within the poverty stratum to the sampling rate within the nonpoverty stratum was 2.3. This ratio ranged from 1.48 to 5.01 across the sample PSU’s, with 90 percent of the ratios being between 1.5 and 3.0. The households within each ED were clustered into segments in order to reduce the expense of interviewing within ED’s. Results from previous surveys had indicated that a cluster of eight listed addresses would provide an adequate design. To further insure the sampling reliability, clusters of 16 listed addresses were drawn from the sampling frames and then systematically subsampled at a rate of 1 out of 2 to produce a final segment of eight address listings. Using the survey specification that approximately one person should be examined per household (see 24 the next section for the household sampling proce- dure), the expected number of segments needed within each PSU was determined by dividing the PSU sample size by 8. The segments were drawn separately from within the poverty and nonpoverty strata. A systematic sample of segments were then selected across all ED’s, with no more than one segment being selected per ED. The new construction frame was sampled at the same rate as the nonpoverty stratum. Several factors were used to decide the sample size within each PSU. The sample size needed in each PSU was a function of the age distribution within the PSU, the proportion of the population below the poverty level, the expected number of vacant and other types of ineligible units, the expected number of refusals, and the expected number of persons in group quarters. Since the census information did not include the number of persons per segment and was out of date, an additional 15 reserve segments were drawn for each PSU as a precautionary measure. These segments were drawn from both poverty and nonpoverty strata. Because of the complexity of the examination survey and the logistical arrangements that had to be planned in advance, the number of persons selected Table G. Primary sampling units, stand sites, and percent of persons examined, by region: National Health and Nutrition Examination Survey, 1976—80 , , . Percent . . . Percent Primary samp/{ng um ts ) Stand site of persons Primary sampling units Stand site of persons Within reg/ans examined Within regions examined United States ........... 64 731 Southern .............. 16 73.8 1 Northeastern ............ 16 67.4 3:47:32 Szwsicityl """" Atlanta 70.6 Bronx, NY ............. New York City1 61.8 Hampton (city), Va ........ Newport News-Hampton1 79.3 Westchester, NY .......... New York City‘ 51.4 Dade, Fla .............. Miami1 72.8 Manhattan, N.Y. ......... New York City1 56.7 District of Columbia ....... Washington, DC.1 68.7 Bergen, N.J ............. Patterson-C|ifton-Passaic1 63.6 Caddo, La .............. Shreveport1 . 71.4 Allegheny, Pa ............ Pittsburgh1 60.4 Brevard, Fla ............. Cocoa 74.2 Mercer, N.J ............. Trenton 70.5 Poinsett, Ark ............ Marked Tree 84.7 Montgomery, Pa .......... Philadelphia.I 57.8 Bledsoe, McMinn, Meigs, Union, N.J ............. Newark1 61.9 Rhea, Tenn ............ Athens, Pikeville 71.4 Erie, Pa ............... Erie1 77.4 Blount, St. Claire, Ala ....... Oneonta, Pell City 73.3 Orange, N.Y ............. Middletown1 70.8 Hardin, Larue, Nelson, Ky. . . . Elizabethtown, Norfolk (part), Mass ........ Boston1 58.0 Bordstown 76.0 Hartford (part), New Haven Greene, Harrisonburg (city), (part), Conn ............ New Britain,1 Meriden1 69.2 Rockingham, Va. ........ Harrisonburg 70.4 Cumberland (part), Maine . . . . Portland1 70.8 Lafayette, La ............ Lafayette1 69.2 Lycoming, Pa ............ Williamsport 79.0 Floyd, Johnson, Magoffin, Ky. . Saylersville, Prestonburg 69.1 Delaware, N.Y. .......... Oneonta 79.5 Craven, Pitt, N.C .......... Greenville, New Bern 76.0 Bristol (part), Norfolk (part), Banks, Hall, Towns, White, Ga. . Gainesville, Cleveland 74.5 Mass ................. Pawtucket 74.8 Cherokee, York, S.C ........ Rock Hill 78.6 Midwestern ............ 16 73.7 Western ............... 16 77.4 Cook, Ill ............... Chicago1 54.8 Harris, Tex ............. Houston1 65.2 Wayne, Mich. ........... Detroit1 71.4 Santa Clara, Calif .......... San Jose1 74.2 Hamilton, Ohio .......... Cincinnati1 73.2 Honolulu, Hawaii ......... Honolulu‘l 71.8 Marion, |nd ............. Indianapolis.I 70.7 San Diego, Calif. ......... San Die 01 73.4 Hennepin, Minn. ......... Minneapolis-St. Paul1 79.3 Pierce, Wash ............. Tacoma 80.4 Montgomery, Ohio ........ Dayton1 74.2 Sedgwick, Kans ........... Wichita1 76.7 Lake, Ill ............... Chicago.l 65.8 Fresno, Calif ............ Fresno 82.8 Polk, Iowa ............. Des Moines1 73.0 Linn, Oreg .............. Albany 84.1 Dakota, Minn ............ Minnea olis-St. Paul1 83.7 Potter, Randall, Tex ........ Amarillo1 79.7 Racine, Wis ............. Racine 78.1 Yolo, Calif. ............ Woodland 82.6 Greene, Monroe, |nd ........ Bloomington 78.5 Laramie, Wyo ............ Cheyenne 83.4 Coles, Cumberland, Ill ....... Mattoon 74.3 Bingham, Idaho .......... Blackfoot 88.4 Ionia, Montcalm, Mich ....... Greenville 80.6 Hickory, St. Clair, Mo ....... Osceola 75.8 Richland, Ohio .......... Mansfield1 74.8 Parmer, Tex ............. Bovena 85.4 Cheboygan, Emmet, Mich.. . . . Cheboygan 78.5 Los Angeles (part), Calif ...... Los Angeles1 62.4 New Madrid, Stoddard, Mo. . . . Baxter 73.6 Los Angeles (part), Calif ...... Los Angeles1 69.5 11970 standard metropolitan statistical area containing the survey location. Some of the SMSA's have been redefined since 1970. for examination had to be carefully controlled. A sequential sampling procedure known as “Perkins’ Stop Rule” was used to insure that the number of persons selected in each PSU was within 15 of the expected number of sample persons. Perkins’ Stop Rule, as described in a Bureau of the Census publica- tion,24 is an unbiased procedure for determining both the number of reserve segments to use in each PSU and when to stop interviewing sample persons within selected households. Since the expected number of persons in each PSU is between 315 and 585, the stop rule also insures that the actual number of sample persons in each PSU is between 300 and 600. For NHANES II, the number of sample persons ranged from 306 to 598 with an average of 334 per PSU. Selection of sample persons After the sample segments had been identified and assigned to interviewers, a sample of persons to be examined from individual households was selected. The sample was selected so that young and old age. ' groups were oversampled and so that approximately one person was selected per household. The Bureau of the Census evaluated a number of alternative sub- sampling schemes within the household with respect to these objectives. The subsampling procedure that best satisfied both of these survey objectives was one that selected 3 out of every 4 persons who were 6 months through 5 years of age or 60 years through 74 years of age and 1 out of every 4 persons who were 6 through 59 years. The sample person selection sheet is shown in figure 2. Once in the household, the interviewer listed everyone who lived in the household in a specified order. The number of persons within each age group was indicated, and letter codes were used to select persons from each of the three age groups for the sample. The letters used to sample persons from each age group are shown in figure 2. After a random start, 64 three-letter combinations were systematically 25 assigned to the household questionnaires for each PSU in the Bureau of the Census regional office. Three letters were circled on each questionnaire before it was assigned to an interviewer. For example, suppose that the letters “A,” “K,” and “W” were circled on the household questionnaire for a family of four: one baby 9 months old, two adults of ages 30 and 31, and one adult aged 66. The number of persons in each of the three age groups (see figure 2) is 1, 2, and 1, respectively. The letters “A,” “K,” and “W” indicate that the interviewer should select the first person in the age group 6 months to 5 years, the second person listed in the 6-59 years age group, and the second person in the 60-74 years age group, as sample persons. In the example, since there was no secondperson’ listed in the 60-74 years- age group, the 9-month-old son and the 31-year-old wife were selected as sample persons for the examination. Io. VII-av is the none at the head of thin hounnold? Enrer name on lust line, Be sure to list all persons in (he correct order. 0. Are any .1 the persons in this household now on lull-time ii. who? I" the nonm ol all other persons who live here? LIS! all persons who live here. e. l have lined (Read names). It then onyone elu staying here new, meh u lrlends, relativu, or room-n? C] d. "an I missed anyone who USUALLY live: here he is new away iron home? .................... I. Do any of the people in this household have a horn- anywhere else? ...... Yes ' No 8% ‘ Apply memory membership rules. mm duty with .5. Armed Fm" .l m United 5mm ..... r3 Yes——> Lme(s) (Delete) 14’ No «1°37; "'t'fil: -- “l" mm A 0 one o . o -- s I ? 3: Name mm. middle .muol. last) Use (M w the“ Em'fsfé :20" ClIClO lme number or W‘" “‘F “"4 "1' Cime SP‘s household respondent (or (anSlSllle 1.. 21.. 2:. u. 2°- neunonwp Monm Dly Vent A" ger- ‘9‘“ :2; Kabev‘l‘ E. shnflv A‘ea 4/ 10 07 ’7’? 3 0 X M M 5. 5mm wxfg Is’ 29 ’7’? 3/ (>9 Pad 5 fin/7% 50/; J} // 7i 7m: (3) Ear/ 6'. Jones fafiew w- /m 06 34 I3 44 X omuom.uqk~l— SAMPLE PERSON SELECTION PERSONS PERSONS PERSONS 6 months :5 years 6 years - 59 years 60 years — 74 years (A) , v lst. 2nd. 3rd, Srh. 61h, 7m lsr. 5rh. 9th lst, 2nd. 3rd. 5th, 61h, 7th B (:9 v? 2nd, 3rd. 4th. 6th. 71h. 31h 2nd, 62h. IOrh 2nd. 3rd. 4th, 61h. 7th. 81h C L X lsr, 3rd, 4th, 5th. 7th. 8th 3rd, 7th, llrh Isr, 3rd. 4th. Srh. 7th, 8th In. 2nd, 4rh,DSrh. 61h. 8th 4th. 8er“ |2rh lst. 2nd. 4th.ZSth. 6th, 81h CHECK ITEM A > K D No Sample Personl s) - Explain to respondent why no further questions. Go to page I, llem l3. Sample Person“) — Fill Medical History Notes E Figure 2. While of a sample person selection sheet used in the National Health and NutriTtEhr Examination Survey, 1976-80 Operational plan Stand sequencing and scheduling As in previous cycles of NHES and NHANES, the scheduling of stands (examination locations) for NHANES II was arranged so that the North was avoided in winter. This was done because of opera- tional problems that would otherwise have resulted. To the extent that any of the items of data collected by the survey were subject to seasonal variation, this procedure may have resulted in some bias, but since the survey was designed more to measure the preva- lence of chronic conditions rather than acute manifes— tations of conditions, seasonal variation was not considered to be a major factor. Another important consideration in the sequenc- ing of stands was economy in operation. Efforts were made to insure the minimum amount of travel by sequencing examination locations with regard to geographic proximity. At each location, the regular procedure involved the following sequence of advance arrangements: US. Bureau of the Census interviewing in the household, mobile exam center setup, dry-run examinations, and, finally, follow—back with sample persons by Health Examination Representatives when indicated, and regular examinations of the sample persons. The number of weeks allotted for examina- tions was dependent upon the expected sample size at a particular stand but varied between 4 and 6 weeks. Advance contacts and logistics Before household interviewing could begin in a sample area, contacts with professionals and the public and logistical arrangements were necessary. It was the policy of the survey to contact the Public Health Service representatives in the Department of Health and Human Services (formerly the Department of Health, Education, and Welfare) regional offices, the State andlocal health authorities, and the medical, dental, and osteopathic professional organizations in the States and communities. This was done to ac- quaint them with the NHANES objectives and methods of operation, including the local schedule of operations. School officials were also notified because of the necessity of requesting release from school for the examination of school children. This notification usually consisted of a letter announcing the survey, the local areas to be sampled, and the dates of survey operations, along with a brochure describing the survey, mailed 2 months before examinations were scheduled to begin. The letters to local health au- thorities included a request to provide NHAN ES with a listing of local and State health agencies and clinics to which NHANES examinees who did not have current medical resources and who required medical care could be referred, or to which a report of the examination findings could be sent. Personal visits by NHANES medical staff were made to any health agencies or societies requesting them. A general news release explaining the program was prepared for each sample area and distributed to local news media. The release was timed to coincide with the start of the Bureau of the Census interview- ing. As a result, local newspapers at most of the locations published items concerning the program. Special efforts were also made to obtain television and radio publicity for the survey. Any pictures taken for these efforts used NHANES staff as subjects, because pictures of examinees would have involved a loss of confidentiality. Sample households with known addresses were sent an “advance” letter by the Bureau of the Census several days before interviewing began. This letter informed the household members that a Bureau of the Census interviewer would call at their home within the next few days in connection with a survey being conducted in the area by the Public Health Service. Six to eight weeks before the start of examinations at a particular location, a member of the NHANES field staff, the Field Operations Manager, visited the sample area to make physical arrangements for the mobile examination center and the administrative 27 offices, to meet personally with local health and school officials, and to initiate the many logistical actions required for the survey. Selection of a site for the mobile examination center and arrangements for electricity, water, sewerage, telephone, and trans- portation services were also made on this initial visit to the area. Household interviewing and appointment process Trained Bureau of the Census personnel con- ducted the household interviews to obtain household composition, demographic, and other data. At this initial visit the census interviewer determined which members of the household were to be selected for inclusion in the sample. The census interviewer explained the survey, asked a series of medical history questions of the prospective examinees, and made appointments for the selected sample persons willing to come in for the examination. As an incentive to participate in the examination, the sample persons were told that they would receive $20 for any incon- venience caused them because of their participation. The census interviewer also obtained written consent for the examination of minors and written authori- zation to obtain additional information from the records of physicians, hospitals, schools, and State registry offices. The census interviewer informed sample persons that reports of significant findings would be sent to their physicians or clinics if they so desired. Ah individual who did not make an appointment at the time of the visit by the census interviewer was subsequently visited by a Health Examination Repre- sentative, who explained the program more fully, using photographs and a film strip. The Health Exam— ination Representative answered any questions about how the sample was selected or the examination conducted and about what was included in the exam- ination. Points that were stressed included personal benefit to be derived from the examination, contri- butions to medical research, and civic pride. In addi— tion, it was stressed to sample persons that they were statistically chosen for the survey and no one else could be substituted for them. By carefully explaining details of the examination, the representative at- tempted to allay any fears or anxieties about it. This additional effort resulted in scheduling for examina- tion many of the persons from whom the census interviewer had been unable to obtain appointments. The typical weekly examination schedule called for five morning sessions (including Saturday), three afternoon sessions (including Saturday), and two evening sessions. Individuals receiving the glucose tolerance test were scheduled for the morning sessions only. Sample persons could elect to drive themselves to the examination center, but use of ataxi for which arrangements had been made was encouraged. Trans- 28 portation costs were paid by NHANES under either arrangement. Appointments for persons who for one reason or another had canceled or broken their appointments or who had not been available for taxi pickup at the scheduled time were rescheduled if possible. Any necessary rescheduling was accomplished by the health representative as soon as possible, preferably the same day, a policy that helped rein- force in the sample persons’ minds the importance placed on their participation. Examination center and staff As in the previous examination programs, exam— inations were carried out in specially designed mobile examination centers (figure 3), which were moved from location to location in a predetermined fashion so that a sample of the civilian noninstitutionalized population was administered a standardized set of questions, examinations, and laboratory tests in comparable settings by a fully trained staff. Each mobile examination center consisted of three trailers, each 45 feet long and 8 feet wide. The sets of trailers constructed for NHANES I had been refitted with some interior modifications and used for NHANES II. They were set up side by side on a level hard surface area and connected by enclosed passageways. The trailers themselves were then further leveled to enable connection of the plumbing and proper alinement of the passageways. Heating and air-conditioning units Body Qrulo measurements room room : ometvy Wash ' Audl- sun room sun X .n‘fflnc. Physlcal Wash oxamlnatlan I‘H-matology— room Dark X‘ "W room ilntorvlow room Pulmo- [ju Waltlng r r or Labo am y nary W area lmervlow room we m [‘t“" "9 Examlnee entrance Figure 3. Mobile examination center helped provide a standardized environment in which to conduct the examinations and perform laboratory procedures. For NHANES II the trailer setup was as follows: The first trailer contained the waiting room where the sample persons were checked in by a coordinator. The coordinator’s main function was to assign the examinees to the staff members conducting different parts of the examination in such a way as to minimize the examinees’ total waiting time. To the side of the waiting room were two small rooms used for dietary interviews. Another slightly larger room in this trailer was used for administering the allergy test and conducting health interviews. A laboratory was equipped with a Coulter Counter, a hemoglobinom— eter, an incubator, a microhematocrit centrifuge and reader, a centrifuge, a refrigerator and freezer, a microscope, and a laminar flow table. The room where respiratory testing was done was located next to the laboratory and contained a spirometer, a two-channel paper recorder, and an oscilloscope. The spirometer was connected to a Marquette electrocar- diogram recorder located in the third trailer. The second trailer had an X—ray room containing an X-ray machine, reciprocating buckey, and table. This room was used for chest, back, and neck X-rays. Adjoining the X-ray room was a dark room. An X-omat for developing X-ray film automatically was in an open space adjacent to the dark room. The walls of the open space contained X-ray viewing boxes. The second trailer also contained one of the two wash- rooms used for dressing and obtaining urine speci- mens. In the second trailer there were two other rooms. One of these rooms contained an examining table and a mercury sphygmomanometer, and the other a table and equipment for drawing blood. The third trailer contained a soundproof room used for hearing tests. At test frequencies, the back- ground noise level was below 35 decibels relative to American Standards Association audiometric zero (National Bureau of Standards). This room contained an audiometer with masking capability and earphones for pure-tone audiometry. It also contained a Revox tape deck, a condenser microphone, and a playback machine for the Stephens Oral Language Screening Test. Adjoining the audiometry room was a wash- room. Another room contained the Marquette electrocardiogram recorder and a table. Electrocar- diograms as well as spirometries were recorded on tape there. The final examination room was the body-measurement room. It contained a large and very accurate weight scale, a set of calibration weights, a device for measuring heights, an examining table for measuring sitting heights, and a variety of anthro- pometric instruments. The third trailer also included a staff room. There was storage space both within and under the trailers. The field staff necessary to carry out the opera- tion of the survey consisted of three groups. The first one was the team of census interviewers and their supervisor. The second group consisted of admini- strative staff and Health Examination Representatives. The usual complement was a field operations manager, field management assistant, one or two local part- time employees, and five Health Examination Re- presentatives. The third group was the examining staff, operating within the mobile examination center, consisting of a physician, a nurse, two dietary interviewers, three health technicians, two laboratory technicians, and a coordinator. Everyone on the examining staff had been thoroughly trained to conduct the standardized procedures. All the field staff except the physician were civil service employ- ees; the physicians ‘were employed on long-term personal services contracts. The administrative staff was responsible for all procedures involved in process- ing examinees prior to their entry in the exam center. The health technicians conducted most of the testing, including taking X-rays, electrocardiograms, body measurements, and spirometries; and audiometry, the allergy exam, and the administration of question- naires. The laboratory technicians performed all the laboratory work that had to be done on site, includ- ing preparation of blood and urine specimens for shipment. The nurse was mainly occupied with drawing blood. Examination process and medical reports Each examinee was assigned to whatever examin- er happened to be free at the time. However, certain restrictions were built into the examination. For example, since oral glucose intake induces changes in electrocardiogram patterns, the electrocardiogram had to be done before the glucose tolerance test. Similarly, because of a possibility that an occasional allergy test might affect pulmonary function, spi- rometry was done before the allergy test. The require- ment of a concentrated urine for microscopic exami- nation necessitated urine collection before the glucose tolerance test. It was also desirable to expedite blood samples in order not to stretch out the labora- tory work day unduly. A report of medical findings, including laboratory results, was sent to the examinee’s personal physician or other source of medical care designated by the examinee. Any condition that in the opinion of the examining physician required immediate medical attention was immediately reported by phone to the personal physician or medical care facility designated by the examinee. A chest X-ray and a copy of the electrocardiogram were sent with the report. Some findings were not included on the regular report be- cause they were not available at the time the report was mailed. For example, the back and neck X- rays were read by 'three rheumatologists at a later 29 time, so the results of their assessment were not immediately available. If some degree of pathology was found, these results were reported to the ex- 30 aminee’s source of medical care when they became available. Quality control Measurement error, an important concern in any survey, was even more so in one as complex as NHANES. Minimizing measurement error required a considerable amount of careful effort. Before the collection of data, it was necessary to define precisely what was to be measured and to describe clearly how the measurements were to be taken. Before the survey began, the NHANES staff, assisted by advisers, delineated the necessary definitions and instructions, which were incorporated into a staff instruction manual covering all procedures. Intensive specialized training was given to all staff members in the specific procedures performed by them in the survey. Periodic retraining was provided in order to achieve consisten- cy over the entire survey period. An important requirement for quality control is the proper calibration of instruments. Among the instruments calibrated were the spirometers, audi- ometers, earphones, electrocardiogram recorders, speech recording equipment, laboratory equipment, scales, and body measurement equipment. The instruments were calibrated at different intervals, that is, with each examination, daily, weekly, or before the beginning of each stand location. Calibration of a particular instrument might be done in more than one fashion: for example, the spirometer was calibrated both electronically and pneumatically. Calibration of the audibméters was done both in the field and also more thoroughly at a central laboratory to which they were sent on a rotating basis. Preventive maintenance was also quite important in keeping the equipment running properly. Prompt repair of the instruments was essential in order to avoid excessive loss of data. The staff biomedical engineer was invaluable in providing for the proper functioning of the equipment. The engineer also played a major role in designing the equipment setup, arranging for its installation, and working out any difficulties that developed in the system. Several methods were used to obtain adequate quality control. For certain procedures such as those involved with height, weight, X—rays, spirometry, electrocardiographs, and speech, “hard documents” were produced, the quality of which could be evalu- ated and the significance assessed at a central location. For example, X-ray films were evaluated for readabil- ity, interpreted by expert readers, and subjected to replicate readings. Replicates involved having the same part of the examination, for example, body measurements, performed independently at different times by two observers. Another more experienced observer, such as a supervisory technician, could be used as the standard. Replicates were a powerful tool in demonstrating interobserver differences. For biochemistry tests, replicates took the form of a duplicate pair of specimens being sent, one of them under a “dummy” number, to the same labora- tory. Another method of quality control in the evalua- tion of the different procedures was to compare mean values and frequency distributions by stand location and by individual observers. If there was an unusual set of results in one location, this could be investigat— ed. Similarly, if one of the technicians consistently obtained higher or lower values than the others, this could also be investigated. All recording forms were reviewed by the examin- ing staff before the examinees left in order to detect errors such as omission of data. Samples of the forms were checked again, more thoroughly, at headquarters. If the staff was making a systematic error, it could be detected, and proper remedial action taken. The performance of some of the field staff could also be checked by tape recordings. At every location, each dietary interviewer recorded two complete interviews on randomly selected subjects. The re- corded interviews were evaluated later at headquar- ters for adherence to established procedures. Retention of a reserve container of serum pro- vided an opportunity for repeating and possibly cor- recting biochemical assessments. If an error was detected in the processing of a batch of serum, or an unusual value was observed, a reserve supply of serum was available for many sample persons to provide 31 analytical results, either to replace the unsatisfactory data or to verify the unusual value. In all laboratories to which specimens had been sent for analysis, standard quality control procedures were used. These included blind quality control specimens from known control pools. For quality control samples, several statistics were produced, including trend lines, plots, means, and standard deviations. Known test materials were used; and all reagents, calibrations, and the like were logged. Deter- minations were repeated for specimens showing extreme values. A useful procedure for quality control of labora- tory data was implemented in 1978. This procedure was as follows: from a frequency distribution of values, the value closest to the 75th percentile was selected. For example, suppose fasting blood glucose data showed .246 of the population with values of 98 or over. In a run of 13 specimens, if one were to find 9 specimens with values of 98 or over, the chances of this happening according to the cumula- tive binomial distribution is .0009. This is quite unlikely, and the matter would be carefully looked into. A similar procedure was followed with a low cutoff value at or near the 25th percentile. In fact, the glucose determinations showed only four runs with a probability of less than .01 out of a total of 240 (including both high and low cutoffs). Since on a chance basis five runs might have been expected, this suggested that the procedure was in control during this period. A major errdnwasfim’m’ai NHES surveys—tel, control and reduce the magnitude of the nonresponse. If the nonrespondents in a survey differ from res- pondents with respect to the measurements being made, the survey results will be biased. The potential for a nonresponse bias is much greater when response 32 rates are low. A_ number of steps taken to reduce nonresponse in NHANES II have already been dis- cussed. The size of the primary sampling units was reduced primarily to decrease the logistical problems of sample persons coming to the mobile examination centers. Much of the advance publicity was directed to improving the overall response rate in a commu- nity. The extra efforts of the Health Examination Representatives to schedule appointments and to arrange transportation to the Mobile Examination Centers were very important in the achievement of acceptable response rates. Several reports have been written that discuss cooperation in National Health Examination Surveys and the factors related to response.2 5 '2 8 The response rates for both NHANES I and NHANES II were between 70 and 75 percent—lower than the response rates obtained in previous NCHS examination surveys. Concern over the lower response rate in the NHANES programs resulted in two studies’ being conducted to determine the effect of paying respondents to participate in NHANES. The first study was conducted in San Antonio, Tex., in 1972. The findings from that study showed that the offer of a payment of $10 to sample persons to participate in NHANES significantly improved the response rate.29 As a result of that study, a payment of $10 was rou- tinely offered to all sample persons for participating in the examination. A second study on the effects of remuneration to sample persons was conducted in two locations in 1978. A slightly more elaborate design was used to study the relationship between the amount of the payment offered sample persons to participate in the examination and the number of sample persons in the household. The results showed that the total amount of remuneration in a household had a significant positive effect on response.3 0 Pilot testing Pilot testing was much shorter in NHANES II than in NHANES I. The first pilot test was in Atlanta, Ga., from November 17 through December 19, 1975. Center for Disease Control personnel and their fam- ilies were the examinees. The location was next to the Center for Disease Control in order to have ready access to assistance in carrying out the complicated laboratory procedures. The second pretest was held in another part of the Atlanta metropolitan area from January 21 through February 12, 1976, using a pop- ulation sample of the area selected by the U.S. Bureau of the Census. The NHANES 11 survey began examinations at its first regular location in Miami, Fla., on February 19, 1976. 33 Plans for analysis and publication of data Producing reports of findings involves the follow- ing steps: 0_ Sometimes, as with X-rays, there must be further processing to produce the data unit that is to be tabulated. This type of processing is done under contract concurrently with data collection if resources permit. Data must be reduced to machine-readable form. Data must be edited and validated. Data must be analyzed. Reports must be written, edited, and printed. In addition, before any analysis can take place, the sampling weights, that is, the designated number of people a sample person represents in the popula- tion, must be determined. For selected measures, imputation procedures for item nonresponse must be developed and reviewed by consultants. The procedure used before 1977 was to allot a certain number of years after completion of a survey in which NHANES analytical staff could publish series reports based on the survey. After that, a set of computer tapes containing the edited data was pre- pared for the use of outside investigators in universi- ties, other government agencies, and so forth. The procedure used since 1977 has been to release for outside use all completely edited, validated, and documented tapes, whether or not NCHS has pub- lished reports based on the data. It was planned to have a series of edited tapes containing the NHANES II data available for purchase from 1 to 2 years after completion of the NHANES II survey. In general, descriptive, analytical, and methodolog- ical reports are published by the National Center for Health Statistics in Vital and Health Statistics, series 1, 2, and 11. To a lesser extent, information is made available in journal articles and in papers presented at professional meetings. The reports are written by NCHS staff, staff of Federal agencies collaborating on data collection, and experts who are not Federal . employees. In addition, to expedite publication of more detailed analyses of selected topics covered in the data collection, NCHS plans to support to a limited extent competitively awarded contractual analyses and report-writing efforts. A limited number of special tabulations and analyses are furnished on request to various individuals and groups both inside and outside the Government. Procedures and methods manuals are made avail— able upon request about a year after the surveys are completed or concurrently with the release of micro- data tapes. In this way the data can be evaluated, and the methodology employed by NCHS in NHANES can be utilized by others. 