Main

During the course of the past two centuries in many industrialized countries, a striking increase of mean stature and an earlier sexual maturation, usually called positive secular growth change, has been observed. It is generally assumed that this secular change is elicited by a change of environmental conditions, in particular by removing factors that had blocked full expression of the biologic potential, such as infectious diseases, inadequate nutrition, poverty, and suffering (1, 2). Growth of a population can therefore be used as a “mirror of conditions in society” (3). A positive secular change is assumed to reflect improvements in the nutritional, hygienic, and health status of a population (4).

The question arises how long favorable environmental conditions have to be around, before the maximum biologic stature is reached in a population. In the Netherlands, a positive secular growth change has been documented since 1858 (preceded by a period of economic growth), interrupted by relatively short periods of a diminished height gain or even height loss during agricultural crises, the world economic crisis in 1930, and World War II (5). Before 1955, secular growth changes were mainly estimated from data on conscripts and other nonrandom samples from the population. In 1955, 1965, and 1980, however, large cross-sectional nationwide growth studies were performed, showing that young Dutch adults belonged to the tallest on earth (68). In this study, we present growth reference data for length/height, weight, head circumference, and secondary sex characteristics from the fourth nationwide growth study in 1997, using a methodology similar to the previous studies, and compare the results of all four nationwide growth studies, to assess whether any further secular growth change has occurred.

METHODS

Subjects.

Data collection for the first nationwide growth study in the Netherlands took place in 1952–1956 and included 16 910 children (6). The second study was performed in the period 1964–1966 and included 54 776 children (7). The third study, conducted in 1978–1980, consisted of 42 000 children (8). In the fourth study, a total of 14 500 children, 7 482 boys and 7 018 girls, were included. They were measured in 1996–1997. In a subgroup of 2 524 boys and 3 028 girls, pubertal stages were also determined. The study was approved by the institutional ethical review board.

Details of the first three studies have been published previously (68). Exclusion criteria for the fourth study were similar to those for the previous studies: children with diagnosed growth disorders and those on medication known to interfere with growth were excluded from the sample (n = 108). Children of non-Dutch parents were also excluded, except if one parent was Dutch and the other West European. In contrast to previous growth studies, infants with a birth weight <2500 g were included for the development of the reference charts, but excluded for assessing the difference in height with the previous growth studies.

The sample was stratified by province, municipal size, sex, and age according to the geographical distribution based on nationwide demographic data (9). The provinces were clustered into four geographical regions. A fifth region was formed by the cities of Amsterdam, Rotterdam, Utrecht, and the Hague.

The planned sample size (16 188 children, represented by 226 boys and 226 girls for each age group) was almost achieved (n = 14 500). In the first year of life, infants were divided into six age groups (n = 2 438), in the second year, divided into four age groups (n = 1 604), and in the third year, 918 children were divided into two age groups. At ages 3–8 y, measurements were taken at average ages of 3.9, 5.5, and 7.5 y (n = 1 677). For 9–20 y of age, the intervals were 0.5 y (n = 7 863). Until 4 y of age, measurements were mainly performed during the regular periodical health examinations by instructed health professionals in 24 Well Baby Clinics. From the age of 4 y onward, children were measured at 25 offices of Municipal Health Services during regular preventive health assessments (at mean ages 5.5 and 7.5 y) or after receiving a personal invitation on the basis of a stratified sample from the Municipal Register Office (≥9 y of age).

The calculation of the planned sample size was based on the aim to detect a 1.8-cm height difference (p = 0.05) between the 1997 and 1980 studies (with a power of 99%). As in earlier growth studies, difficulties were encountered in collecting a sufficiently large sample of adolescents and young adults. The nonresponders (those who refused to come to the health clinic or refused a measurement) varied from 20% in 11 y olds to >60% in 17 y olds. To detect any selection bias , we approached a randomly sampled subgroup of 230 adolescents and young adults from the nonresponder group. In total, 170 answered a questionnaire, of whom six did not report their height. The self-reported height (n = 164) was not significantly different from the study population (mean height SDS, −0.10; 95% confidence interval, −0.26 to +0.05;p = 0.19). No significant differences were found for the distribution of demographic variables either.

To obtain a sufficiently large sample, additional measurements took place at high schools, universities, and a youth festival and during medical examinations for joining the army. In the age range 12–17 y, 25% of the measurements of the study population were derived from this additional sample, and between 17 and 21 y, 40%. No statistically significant differences were found between the original sample and the additional sample for height SDS.

The distribution of the total sample for age, sex, geographical region, municipal size, family size, and child education was found to be similar to national distributions except for geographical region for girls aged ≥18 y (10). For them a weighted correction for geographical region was performed to obtain height references.

Measurements.

Length of infants, until 2 y of age, was measured to the nearest 0.1 cm in the supine position fully extended with their heels in contact with a baseboard. From 2.0 y of age, standing height was measured to the nearest 0.1 cm by using a so-called microtoise, which is an accurate measuring tape, incorporating an indicator and calibrated in millimeters. Infants up to 15 mo of age were weighed naked, on calibrated baby scales. Older children were weighed on calibrated mechanical or electronic step-scales, wearing underwear only. Weight was recorded in decigrams for infants, and rounded to the nearest 0.1 kg for older children. The pubertal stages were determined by visual inspection, using the criteria and definitions described by Tanner (11), extended by a sixth stage for PH (7). Testicular volume was estimated by means of a Prader orchidometer. The age at M was determined by the status quo method, i.e. each girl was asked whether or not she had had her first period.

A questionnaire, filled in by a health professional, was used to assess demographic variables. Within each region, five categories of municipal size were used, namely <20 000, 20 000–<50 000, 50 000–<100 000, 100 000–<200 000 and ≥200 000 inhabitants. Family size was defined by the number of children in a household (1, 2, 3, and ≥4). The same categories were used for birth rank. As an indicator of socioeconomic status, the highest completed educational level of the parents was used. The educational level of the child was determined at the time of measurement. If an adolescent of >15 y of age had left any educational system, the highest completed education was recorded.

Statistical analysis.

Reference SD curves were estimated by the LMS method (12). This method describes a variable y as a semiparametric regression function of a time-dependent variable t, so that the distribution of y changes gradually when plotted against t. The distribution of y is summarized by three natural spline curves, which vary in time: the Box-Cox transformation power that converts y to normality and minimizes the skewness of the data set (L), the median (M), and the coefficient of variation (S). In the LMS model, the smoothness of L, M, and S curves is characterized by the number of effective degrees of freedom. The choice of these smoothing factors for the L, M, and S curves was made by creating local detrended QQ plots (special plots to compare distributions) (13) of the SDS of the reference sample across 16 age groups. Optimal values of the smoothing factors will produce detrended QQ plots that are essentially flat. Reference curves for menarche and the stages of secondary sex characteristics were estimated by generalized additive models that use binomial logit link functions to describe the probability of each stage as a smoothing spline function of age (14). Optimal smoothing factors were determined by means of analysis of deviance.

Differences in mean heights between the samples of 1997, 1980, 1965, and 1955 were computed per age group. Sampling variation in the 1997 study, expressed as a 95% confidence band, was computed by modeling a smoothing spline around mean height. Because the previous studies had larger sample sizes, it was assumed that these had at least equal precision. The association of demographic variables (geographical region, municipal size, family size, birth rank, and educational level for both child and parents) with height SDS was assessed by ANOVA. The variables were taken as a set of categorical independent variables.

RESULTS

For all ages, the mean, median, and SD were calculated, resulting in reference charts for boys and girls, indicating SD lines (0, ±1, ±2 and ±2.5 SD). Length, weight, and head circumference reference charts were made for the age range 0–15.0 mo. Separate reference charts were made for height and head circumference for age, and weight for height for the age ranges 0–4.0 y and 1–21.0 y (15).

Height and weight for age.

Table 1 summarizes the mean and SD for length/height, weight, and head circumference for age in 1997. The data for height had a normal distribution for all ages, so that the mean is equal to the median (P50) and to 0 SD. The mean difference between supine length and standing height measurements was +0.4 cm at 24 mo of age. The curves for length and height were smoothly joined by maintaining the supine values to 2.0 y and by shifting the values for 2.25 y 0.2 cm upward (7). Weight had a skewed distribution, implying that 0 SD is equal to the median (P50), but not to the mean. Because of the skewed distribution, both −2 SD and +2 SD are given for weight for age.

Table 1 Reference data (0 SD, SD, −2 SD, and +2 SD, when appropriate) for length, height, weight, and head circumference for boys and girls in the age range of 2.0–60.0 wk and 1.0–21.0 y in the 1997 Dutch Growth Study

Comparison of mean heights between the sample according to the exclusion criteria of previous studies (which excluded children with a birth weight <2500 g) and the total sample showed for boys mean height differences of +0.2 cm <1 y of age, +0.1 cm at the age range 1–14 y, and no differences from 15 y of age. For girls, mean height differences were +0.2 cm <1 y of age and approximately +0.15 cm over the whole age range ≥1 y. In this article, these differences were added to the 1997 reference values for comparison with the previous studies. Thus, height references would slightly increase if the previous criteria had been used.

Figure 1 shows the mean final height in 1997 in relation to the three previous national growth studies of 1955, 1965, and 1980. The 1955 study was used as baseline (taking as mean final height at age 21 y 176.0 cm for boys and 163.0 cm for girls). The 95% confidence interval of the median 1997 curve is approximately ±0.5 cm across all ages. At 1 y of age, length was similar in all studies. In the age range 1–4 y, the positive secular change was limited to 1–2 cm in comparison to the 1955 study, with little change during the past 32 y. The major part of the secular change occurred in the age range 5–10 y of age in both sexes, which in boys was followed by a further increase up to final height. In 1997, the mean final height at age 21 y for boys and girls amounted to 184.0 cm and 170.6 cm, respectively. During the last 42 y, the mean final height has increased by 8.0 cm for boys and by 7.75 cm for girls.

Figure 1
figure 1

Secular differences in mean height between the 1997, 1980, and 1965 growth studies in comparison to data from the 1955 growth study.

Figure 2 shows the mean final height in 1965, 1980, and 1997 compared with the 1955 data for both sexes. The mean increase in final height was, respectively, 2.7, 5.7, and 7.9 cm. Thus, the rate of the positive secular change has slowly decreased from 2.7 cm/decade (1955–1965) to 2.0 (1965–1980) and 1.3 cm/decade (1980–1997). A small increase in SD was observed in final height in comparison to 1965 and 1980, but the coefficient of variation remained stable (approximately 3.8%). Thus, mean height increase represented a global shift of the entire height distribution. The difference in mean final height between boys and girls was 13.55 cm (corrected for exclusion criteria) compared with 13.7, 11.7, and 13.0 in the 1980, 1965, and 1955 studies (average difference, 13 cm).

Figure 2
figure 2

Final height (cm) in the 1965, 1980, and 1997 growth studies relative to the 1955 growth study for boys and girls and their average.

Weight for height.

Table 2 contains the actual 0 SD and ±2 SD values of weight for height in boys and girls. As during adolescence, weight for height is age-dependent; references are presented for two age ranges: <16 y, and ≥16 y. In particular, adolescents and young adults had higher weight for height values than in 1980. Detailed comparisons, combined with body mass index references, are reported elsewhere (16).

Table 2 Reference data (−2 SD, 0 SD, and +2 SD) for weight for height in boys and girls in two age ranges in the 1997 Dutch Growth Study

Associations of height with demographic variables.

The 1965 study demonstrated that mean height for age was related to geographical regions (northern children were taller than southern children), socioeconomic status (children with higher socioeconomic status were taller), and educational levels (children attending special education and lower secondary education were shorter) (5). In 1980, these differences had diminished, but still existed (8). In the 1997 study, mean height was related to geographical region (p < 0.001), family size (p = 0.002), and educational level of the parents (p < 0.001) and the child (p = 0.002). No significant association was found with birth rank, and only a slight trend of smaller mean height SDS for increasing birth rank was observed. When taken separately, municipal size was not significantly related to height SDS. However, if municipal size was combined with the other factors, it was significantly associated with height SDS. This effect appeared mainly an effect of the large cities (≥200 000 inhabitants). As this category was already corrected for in geographical region, municipal size was left out of the final set of factors. The magnitude of the effects is shown in Figure 3.

Figure 3
figure 3

Mean height SDS (with 95% confidence intervals) for geographical region, family size, and level of education. Children, <5 y of age, not attending primary education, are assigned to the category none. Shown are the means adjusted for the effects of the other three factors.

Sexual maturation.

The P10, P50, and P90 ages at attaining pubertal stages (G, PH, and testicular volume for boys, and B, PH, and M for girls) in the 1997 study are shown in Table 3. In 1997, girls developed stages B2 and B3 2 mo later than in 1980, but 2–3 mo earlier than in 1965. However, the median age at B4 and B5 was virtually identical to 1980, both being 6–12 mo lower than in 1965. The 3-mo decrease of the mean interval between B2 and B4 since 1980 was not statistically significant (p = 0.18, two-sided). The median age at M decreased rapidly between 1955 (13.66 y) and 1965 (13.40 y), followed by a slow decrease to 13.28 and 13.15 y in 1980 and 1997, respectively (1 mo/decade). The median age at PH stages 2–5 in 1997 was slightly higher than in 1980, but somewhat lower than in 1965. In boys, the age at G2 has slightly increased during the last 32 y, but the median age at G3-G4 has remained similar. The 2-mo decrease of the mean interval between G2 and G4 and between PH2 and PH4 since 1980 was not statistically significant (p = 0.20, two-sided).

Table 3 P10, P50, and P90 ages of reaching the stages of secondary sex characteristics for boys and girls in the 1997 Dutch Growth Study

DISCUSSION

Comparison of the results of the fourth nationwide growth study with those of the three previous studies shows that during a period of 42 y, the mean height of Dutch children, adolescents, and young adults, in 1980 already among the tallest on earth, has further increased during the past 17 y. At present, the average final height is 184.0 cm for young men and 170.6 cm for young women. Although the rate of secular growth change has gradually diminished, the size of the secular change during the last 17 y (1.3 cm/decade) renders it unlikely that the maximum stature has been reached. The positive shift in the Netherlands is in contrast with the arrest of secular change reported in Scandinavian countries, where the mean height of conscripts has remained 179.4 cm for the past 17 y (17).

The secular change during the last 42 y varies at different ages, with virtually no change in the first year, very little change until 4–5 y, and a major change from 5 to 10 y of age, followed by a stabilization in females and a gradual further rise in males. With respect to the first year of life, the length of female infants in the 1965, 1980, and 1997 studies was even somewhat shorter than in 1955, with in 1997 still a difference of −0.5 cm. The length of male infants at 1 y of age remained very similar during the years, in 1997 being only a few millimeters greater than in 1955. It is unclear why there is no secular change in infancy.

Usually, a positive secular growth change is accompanied by an advance of sexual maturation. In contrast, in Dutch boys the median age at stage G2 tended to increase during the past 30 y, although the mean interval between successive stages tended to decrease. In girls, a progressive shift in median age at M slowly decreased during the past four decades, but, the shift during the past 32 y has become small (<1.2 mo/decade). However, stages B2 and B3 tended to occur later than in 1980. Consistent with this maturation shift in both sexes, the final height difference between males and females has remained rather stable, close to 13 cm, a difference similar to that observed in other European and American data (18). If the four growth studies are taken together, the secular change has been almost identical for males and females.

It is generally assumed that improvement of the quantity and quality of food is a most important cause of secular growth change, but it is far from easy to pinpoint the association between the evolution of food consumption and the secular trend (19). In fact, during the past 42 y, the general wealth of the population has increased considerably, and at present virtually all children have easy access to food. The increase in available calories coincided with a decrease in energy lost or expended, mainly because of improved transportation (20). With regard to the quality of food, a clear rise of the consumption of animal proteins and saturated fat was observed between 1936 and 1975 (21), and the present consumption of dairy products in the Netherlands is one of the highest in the world, particularly the use of fermented products (22). Nowadays, the quality of food and a hygienic preparation may be more important to growth than food quantity.

Another major determinant of growth in a population may be the general level of child health. Like most European countries, child health has improved during the past four decades, because of a freely accessible preventive child health system, better hygiene, and a generalized vaccination program (covering 95% of Dutch infants) (23).

The level of education of the parents is significantly correlated with height, in agreement with many other studies in Europe (24). This may be related to differences in lifestyle (smoking, alcohol consumption, consumption of fruit and vegetables) (25). A cross-national comparison in 10 European countries showed a persistence of the size of the education-related height differences in the period 1920–1970, suggesting that inequalities in childhood living conditions will continue to contribute to inequalities in height during the decades to come (24). The lower height of children attending special education may be related to a higher frequency of subclinical congenital disorders. In an earlier study on children with idiopathic short stature without clinical syndromes, we found a clearly elevated number of children undergoing special education (26).

Regional height differences in a small country as the Netherlands, already noted since 1880, are remarkable. For a long time the northern part was predominantly Protestant and the southern part, Roman Catholic. It has been shown that in the 19th century, large-scale education programs were started much earlier in Protestant areas than in Catholic ones (27). Possibly related to this, marital fertility rates and infant mortality (28) decreased in the north from 1870 onward, whereas in the southern provinces this occurred not earlier than 1900. In the southern provinces, the marital fertility remained relatively high until the 1970s (29). Until very recently, differences existed in socioeconomic status, morbidity, lifestyle, and risk behavior between the northern and southern regions (25). Differences in nutrition pattern certainly existed in the past (21), but in the last decade no regional differences in nutrient intake have been found in children and adolescents (30). An additional factor may be that the genetic make-up of both regions has remained relatively unchanged, because little migration occurred between northern and southern provinces (31).

A major methodological issue in population-based growth studies is whether the procedure sufficiently ensures a representative sample, adequately stratified for the variables known to affect height. In the four consecutive growth studies in the Netherlands, all possible precautions were taken to arrive at a representative study sample, and the methodology has been essentially equal. Still, one problem has been shown to be difficult to resolve, i.e. the high number of nonresponders in adolescents and young adults. In all four growth studies, this problem was faced, and additional samples with less strict stratification were necessary to arrive at a sufficient number of cases. In our most recent study, we tried to assess the differences between the responders and nonresponders through a questionnaire, which indicated that there were no significant differences. However, we could only take a small sample from the total number of nonresponders, and not all individuals responded to the questionnaire, so that there cannot be absolute certainty about a possible selection bias. As to the additional sample, there was an overrepresentation of individuals from the northern provinces in the female adolescents, so that a correction step was needed. We feel confident that the final sample in these age ranges is representative for the population, but it is not unlikely that the fluctuations in the secular change in males and females during the past 42 y are at least partially caused by effects of sample selection.

Another major issue is what is meant by the term “population” of the country, in particular with respect to (second- and third-generation) immigrants. In the four studies described in this article, the population was defined as people who had at least one parent of Dutch origin, so that immigrant children were excluded. One should note, however, that 40–60% of the children in the four largest cities have non-Dutch parents, and that 17% of the whole population are immigrants (32). Although the use of this criterion is needed to assess the secular change in the defined population, it will become less and less useful if the growth references are aimed at representing the whole population. Because the height of Turkish and Moroccan children is approximately 1.5 SD lower than that of Dutch children, inclusion of individuals of non-Dutch origin will have a major impact on the growth references and on the course of the secular change in coming decades.

In conclusion, in children, adolescents, and young adults of Dutch origin, the positive secular growth change has still continued, although at a slower rate. In infancy, no secular shift of length was found. Weight for height increased. The onset of sexual maturation in both sexes occurred slightly later than in 1980. Median age at M continued to decrease during the past 42 y but at a slow rate. Although the socioeconomic conditions have been favorable for a number of decades, the positive secular change still continues.