In a previous post (), I discussed the frequently reported lowest-mortality body mass index (BMI), which is about 26. The empirical results reviewed in that post suggest that fat-free mass plays an important role in that context. Keep in mind that this "BMI=26 phenomenon" is often reported in studies of populations from developed countries, which are likely to be relatively sedentary. This is important for the point made in this post.
A lowest-mortality BMI of 26 is somehow at odds with the fact that many healthy and/or long-living populations have much lower BMIs. You can clearly see this in the distribution of BMIs among males in Kitava and Sweden shown in the graph below, from a study by Lindeberg and colleagues (). This distribution is shifted in such a way that would suggest a much lower BMI of lowest-mortality among the Kitavans, assuming a U-curve shape similar to that observed in studies of populations from developed countries ().
Another relevant example comes from the China Study II (see, e.g., ), which is based on data from 8000 adults. The average BMI in the China Study II dataset, with data from the 1980s, is approximately 21; for an average weight that is about 116 lbs. That BMI is relatively uniform across Chinese counties, including those with the lowest mortality rates. No county has an average BMI that is 26; not even close. This also supports the idea that Chinese people were, at least during that period, relatively thin.
Now take a look at the graph below, also based on the China Study II dataset, from a previous post (), relating total daily calorie intake with longevity. I should note that the relationship between total daily calorie intake and longevity depicted in this graph is not really statistically significant. Still, the highest longevity seems to be in the second tercile of total daily calorie intake.
Again, the average weight in the dataset is about 116 lbs. A conservative estimate of the number of calories needed to maintain this weight without any physical activity would be about 1740. Add about 700 calories to that, for a reasonable and healthy level of physical activity, and you get 2440 calories needed daily for weight maintenance. That is right in the middle of the second tercile, the one with the highest longevity.
What does this have to do with the lowest-mortality BMI of 26 from studies of samples from developed countries? Populations in these countries are likely to be relatively sedentary, at least on average, in which case a low BMI will be associated with a low total calorie intake. And a low total calorie intake will lead to a low intake of nutrients needed by the body to fight disease.
And don’t think you can fix this problem by consuming lots of vitamin and mineral pills. When I refer here to a higher or lower nutrient intake, I am not talking only about micronutrients, but also about macronutrients (fatty and amino acids) in amounts that are needed by your body. Moreover, important micronutrients, such as fat-soluble vitamins, cannot be properly absorbed without certain macronutrients, such as fat.
Industrial nutrient isolation for supplementation use has not been a very successful long-term strategy for health optimization (). On the other hand, this type of supplementation has indeed been found to have had modest-to-significant success in short-term interventions aimed at correcting acute health problems caused by severe nutritional deficiencies ().
So the "BMI=26 phenomenon" may be a reflection not of a direct effect of high muscularity on health, but of an indirect effect mediated by a high intake of needed nutrients among sedentary folks. This may be so even though the lowest mortality is for the combination of that BMI with a relatively small waist (), which suggests some level of muscularity, but not necessarily serious bodybuilder-level muscularity. High muscularity, of the serious bodybuilder type, is not very common; at least not enough to significantly sway results based on the analysis of large samples.
The combination of a BMI=26 with a relatively small waist is indicative of more muscle and less body fat. Having more muscle and less body fat has an advantage that is rarely discussed. It allows for a higher total calorie intake, and thus a higher nutrient intake, without an unhealthy increase in body fat. Muscle mass increases one's caloric requirement for weight maintenance, more so than body fat. Body fat also increases that caloric requirement, but it also acts like an organ, secreting a number of hormones into the bloodstream, and becoming pro-inflammatory in an unhealthy way above a certain level.
Clearly having a low body fat percentage is associated with lower incidence of degenerative diseases, but it will likely lead to a lower intake of nutrients relative to one’s needs unless other factors are present, e.g., being fairly muscular or physically active. Chronic low nutrient intake tends to get people closer to the afterlife like nothing else ().
In this sense, having a BMI=26 and being relatively sedentary (without being skinny-fat) has an effect that is similar to that of having a BMI=21 and being fairly physically active. Both would lead to consumption of more calories for weight maintenance, and thus more nutrients, as long as nutritious foods are eaten.
Showing posts with label mortality. Show all posts
Showing posts with label mortality. Show all posts
Monday, July 2, 2012
Monday, June 18, 2012
The lowest-mortality BMI: What is its relationship with fat-free mass?
Do overweight folks live longer? It is not uncommon to see graphs like the one below, from the Med Journal Watch blog (), suggesting that, at least as far as body mass index (BMI) is concerned (), overweight folks (25 < BMI < 30) seem to live longer. The graph shows BMI measured at a certain age, and risk of death within a certain time period (e.g., 20 years) following the measurement. The lowest-mortality BMI is about 26, which is in the overweight area of the BMI chart.
Note that relative age-adjusted mortality risk (i.e., relative to the mortality risk of people in the same age group), increases less steeply in response to weight variations as one becomes older. An older person increases the risk of dying to a lesser extent by weighing more or less than does a younger person. This seems to be particularly true for weight gain (as opposed to weight loss).
The table below is from a widely cited 2002 article by Allison and colleagues (), where they describe a study of 10,169 males aged 25-75. Almost all of the participants, ninety-eight percent, were followed up for many years after measurement; a total of 3,722 deaths were recorded.
Take a look at the two numbers circled in red. The one on the left is the lowest-mortality BMI not adjusting for fat mass or fat-free mass: a reasonably high 27.4. The one on the right is the lowest-mortality BMI adjusting for fat mass and fat-free mass: a much lower 21.6.
I know this may sound confusing, but due to possible statistical distortions this does not mean that you should try to bring your BMI to 21.6 if you want to reduce your risk of dying. What this means is that fat mass and fat-free mass matter. Moreover, all of the participants in this study were men. The authors concluded that: “…marked leanness (as opposed to thinness) has beneficial effects.”
Then we have an interesting 2003 article by Bigaard and colleagues () reporting on a study of 27,178 men and 29,875 women born in Denmark, 50 to 64 years of age. The table below summarizes deaths in this study, grouping them by BMI and waist circumference.
These are raw numbers; no complex statistics here. Circled in green is the area with samples that appear to be large enough to avoid “funny” results. Circled in red are the lowest-mortality percentages; I left out the 0.8 percentage because it is based on a very small sample.
As you can see, they refer to men and women with BMIs in the 25-29.9 range (overweight), but with waist circumferences in the lower-middle range: 90-96 cm for men and 74-82 cm for women; or approximately 35-38 inches for men and 29-32 inches for women.
Women with BMIs in the 18.5-24.9 range (normal) and the same or lower waists also died in small numbers. Underweight men and women had the highest mortality percentages.
A relatively small waist (not a wasp waist), together with a normal or high BMI, is an indication of more fat-free mass, which is retained together with some body fat. It is also an indication of less visceral body fat accumulation.

Note that relative age-adjusted mortality risk (i.e., relative to the mortality risk of people in the same age group), increases less steeply in response to weight variations as one becomes older. An older person increases the risk of dying to a lesser extent by weighing more or less than does a younger person. This seems to be particularly true for weight gain (as opposed to weight loss).
The table below is from a widely cited 2002 article by Allison and colleagues (), where they describe a study of 10,169 males aged 25-75. Almost all of the participants, ninety-eight percent, were followed up for many years after measurement; a total of 3,722 deaths were recorded.

Take a look at the two numbers circled in red. The one on the left is the lowest-mortality BMI not adjusting for fat mass or fat-free mass: a reasonably high 27.4. The one on the right is the lowest-mortality BMI adjusting for fat mass and fat-free mass: a much lower 21.6.
I know this may sound confusing, but due to possible statistical distortions this does not mean that you should try to bring your BMI to 21.6 if you want to reduce your risk of dying. What this means is that fat mass and fat-free mass matter. Moreover, all of the participants in this study were men. The authors concluded that: “…marked leanness (as opposed to thinness) has beneficial effects.”
Then we have an interesting 2003 article by Bigaard and colleagues () reporting on a study of 27,178 men and 29,875 women born in Denmark, 50 to 64 years of age. The table below summarizes deaths in this study, grouping them by BMI and waist circumference.

These are raw numbers; no complex statistics here. Circled in green is the area with samples that appear to be large enough to avoid “funny” results. Circled in red are the lowest-mortality percentages; I left out the 0.8 percentage because it is based on a very small sample.
As you can see, they refer to men and women with BMIs in the 25-29.9 range (overweight), but with waist circumferences in the lower-middle range: 90-96 cm for men and 74-82 cm for women; or approximately 35-38 inches for men and 29-32 inches for women.
Women with BMIs in the 18.5-24.9 range (normal) and the same or lower waists also died in small numbers. Underweight men and women had the highest mortality percentages.
A relatively small waist (not a wasp waist), together with a normal or high BMI, is an indication of more fat-free mass, which is retained together with some body fat. It is also an indication of less visceral body fat accumulation.
Labels:
BMI,
body fat,
body fat loss,
longevity,
mortality,
multivariate analysis,
muscle gain,
muscle loss
Monday, May 7, 2012
The 2012 Arch Intern Med red meat-mortality study: The “protective” effect of smoking
In a previous post () I used WarpPLS () to analyze the model below, using data reported in a recent study looking at the relationship between red meat consumption and mortality. The model below shows the different paths through which smoking influences mortality, highlighted in red. The study was not about smoking, but data was collected on that variable; hence this post.
When one builds a model like the one above, and tests it with empirical data, the person does something similar to what a physicist would do. The model is a graphical representation of a complex equation, which embodies the beliefs of the modeler. WarpPLS builds the complex equation automatically for the user, who would otherwise have to write it down using mathematical symbols.
The results yielded by the complex equation, partly in the form of coefficients of association for direct relationships (the betas next to the arrows), have a meaning. Some may look odd, and require novel interpretations, much in the same way that odd results from an equation describing planetary motions may have led to the development of the theory of black holes.
Nothing is actually "proven" by the results. They are part of the long and painstaking process we call "research". To advance new knowledge, one needs a lot more than a single study. Darwin's theory of evolution is still being tested. Based on various tests and partial refutations, it has itself evolved a great deal since its original formulation.
One set of results that are generated based on the model above by WarpPLS, in addition to coefficients for direct relationships, are coefficients of association called "total effects". They aggregate all of the effects, via multiple paths, between each pair of variables. Below is a table of total effects, with the total effects of smoking on diabetes incidence and overall mortality highlighted in red.
As you can see, the total effects of smoking on diabetes incidence and overall mortality are negative, but small enough to be considered insignificant. This is interesting, because smoking is definitely not health-promoting. Among hunter-gatherers, who often smoke tobacco, it increases the incidence of various types of cancer (). And it may be at the source of many of the health problems suggested by analyses on the China Study II data ().
So what are these results telling us? They tell us that smoking has an intermediate protective effect, very likely associated with its anorexic effect. Smoking is an appetite suppressor. Its total effect on food intake is negative, and strong. As we can see from the table of total effects, just below the two numbers highlighted in red, the total effect of smoking on food intake is -0.356.
Still, it looks like smoking is nearly as bad as overeating to the point of becoming obese (), in terms of its overall effect on health. Otherwise we would see a positive total effect on overall mortality of comparable strength to the negative total effect on food intake.
Smoking may make one eat less, but it ends up hastening one’s demise through different paths.
When one builds a model like the one above, and tests it with empirical data, the person does something similar to what a physicist would do. The model is a graphical representation of a complex equation, which embodies the beliefs of the modeler. WarpPLS builds the complex equation automatically for the user, who would otherwise have to write it down using mathematical symbols.
The results yielded by the complex equation, partly in the form of coefficients of association for direct relationships (the betas next to the arrows), have a meaning. Some may look odd, and require novel interpretations, much in the same way that odd results from an equation describing planetary motions may have led to the development of the theory of black holes.
Nothing is actually "proven" by the results. They are part of the long and painstaking process we call "research". To advance new knowledge, one needs a lot more than a single study. Darwin's theory of evolution is still being tested. Based on various tests and partial refutations, it has itself evolved a great deal since its original formulation.
One set of results that are generated based on the model above by WarpPLS, in addition to coefficients for direct relationships, are coefficients of association called "total effects". They aggregate all of the effects, via multiple paths, between each pair of variables. Below is a table of total effects, with the total effects of smoking on diabetes incidence and overall mortality highlighted in red.
As you can see, the total effects of smoking on diabetes incidence and overall mortality are negative, but small enough to be considered insignificant. This is interesting, because smoking is definitely not health-promoting. Among hunter-gatherers, who often smoke tobacco, it increases the incidence of various types of cancer (). And it may be at the source of many of the health problems suggested by analyses on the China Study II data ().
So what are these results telling us? They tell us that smoking has an intermediate protective effect, very likely associated with its anorexic effect. Smoking is an appetite suppressor. Its total effect on food intake is negative, and strong. As we can see from the table of total effects, just below the two numbers highlighted in red, the total effect of smoking on food intake is -0.356.
Still, it looks like smoking is nearly as bad as overeating to the point of becoming obese (), in terms of its overall effect on health. Otherwise we would see a positive total effect on overall mortality of comparable strength to the negative total effect on food intake.
Smoking may make one eat less, but it ends up hastening one’s demise through different paths.
Monday, April 2, 2012
The 2012 Arch Intern Med red meat-mortality study: Eating 234 g/d of red meat could reduce mortality by 23 percent
As we have seen in an earlier post on the China Study data (), which explored relationships hinted at by Denise Minger’s previous and highly perceptive analysis (), one can use a multivariate analysis tool like WarpPLS () to explore relationships based on data reported by others. This is true even when the dataset available is fairly small.
So I entered the data reported in the most recent (published online in March 2012) study looking at the relationship between red meat consumption and mortality into WarpPLS to do some exploratory analyses. I discussed the study in my previous post; it was conducted by Pan et al. (Frank B. Hu is the senior author) and published in the prestigious Archives of Internal Medicine (). The data I used is from Table 1 of the article; it reports figures on several variables along 5 quintiles, based on separate analyses of two samples, called “Health Professionals” and “Nurses Health” samples. The Health Professionals sample comprised males; the Nurses Health sample, females.
Below is an interesting exploratory model, with results. It includes a number of hypotheses, represented by arrows, which seem to make sense. This is helpful, because a model incorporating hypotheses that make sense allows for easy identification of nonsense results, and thus rejection of the model or the data. (Refutability is one of the most important characteristics of good theoretical models.) Keep in mind that the sample size here is very small (N=10), as the authors of the study reported data along 5 quintiles for the Health Professionals sample, together with 5 quintiles for the Nurses Health sample. In a sense, this is somewhat helpful, because a small sample tends to be “unstable”, leading nonsense results and other signs of problems to show up easily – one example would be multivariate coefficients of association (the beta coefficients reported near the arrows) greater than 1 due to collinearity ().
So what does the model above tell us? It tells us that smoking (Smokng) is associated with reduced physical activity (PhysAct); beta = -0.92. It tells us that smoking (Smokng) is associated with reduced food intake (FoodInt); beta = -0.36. It tells us that physical activity (PhysAct) is associated with reduced incidence of diabetes (Diabetes); beta = -0.25. It tells us that increased food intake (FoodInt) is associated with increased incidence of diabetes (Diabetes); beta = 0.93. It tells us that increased food intake (FoodInt) is associated with increased red meat intake (RedMeat); beta = 0.60. It tells us that increased incidence of diabetes (Diabetes) is associated with increased mortality (Mort); beta = 0.61. It tells us that being female (SexM1F2) is associated with reduced mortality (Mort); beta = -0.67.
Some of these betas are a bit too high (e.g., 0.93), due to the level of collinearity caused by such a small sample. Due to being quite high, they are statistically significant even in a small sample. Betas greater than 0.20 tend to become statistically significant when the sample size is 100 or greater; so all of the coefficients above would be statistically significant with a larger sample size. What is the common denominator of all of the associations above? The common denominator is that all of them make sense, qualitatively speaking; there is not a single case where the sign is the opposite of what we would expect. There is one association that is shown on the graph and that is missing from my summary of associations above; and it also makes sense, at least to me. The model also tells us that increased red meat intake (RedMeat) is associated with reduced mortality (Mort); beta = -0.25. More technically, it tells us that, when we control for biological sex (SexM1F2) and incidence of diabetes (Diabetes), increased red meat intake (RedMeat) is associated with reduced mortality (Mort).
How do we roughly estimate this effect in terms of amounts of red meat consumed? The -0.25 means that, for each standard deviation in the amount of red meat consumed, there is a corresponding 0.25 standard deviation reduction of mortality. (This interpretation is possible because I used WarpPLS’ linear analysis algorithm; a nonlinear algorithm would lead to a more complex interpretation.) The standard deviation for red meat consumption is 0.897 servings. Each serving has about 84 g. And the highest number of servings in the dataset is 3.1 servings, or 260 g/d (calculated as: 3.1*84). To stay a bit shy of this extreme, let us consider a slightly lower intake amount, which is 3.1 standard deviations, or 234 g/d (calculated as: 3.1*0.897*84). Since the standard deviation for mortality is 0.3 percentage points, we can conclude that an extra 234 g of red meat per day is associated with a reduction in mortality of approximately 23 percent (calculated as: 3.1*0.25*0.3).
Let me repeat for emphasis: the data reported by the authors suggests that, when we control for biological sex and incidence of diabetes, an extra 234 g of red meat per day is associated with a reduction in mortality of approximately 23 percent. This is exactly the opposite, qualitatively speaking, of what was reported by the authors in the article. I should note that this is also a minute effect, like the effect reported by the authors. (The mortality rates in the article are expressed as percentages, with the lowest being around 1 percent. So this 23 percent is a percentage of a percentage.) If you were to compare a group of 100 people who ate little red meat with another group of the same size that ate 234 g more of red meat every day, over a period of more than 20 years, you would not find a single additional death in either group. If you were to compare matched groups of 1,000 individuals, you would find only 2 additional deaths among the folks who ate little red meat.
At the same time, we can also see that excessive food intake is associated with increased mortality via its effect on diabetes. The product beta coefficient for the mediated effect FoodInt --> Diabetes --> Mort is 0.57. This means that, for each standard deviation of food intake in grams, there is a corresponding 0.57 standard deviation increase in mortality, via an increase in the incidence of diabetes. This is very likely at levels of food consumption where significantly more calories are consumed than spent, ultimately leading to many people becoming obese. The standard deviation for food intake is 355 calories. The highest daily food intake quintile reported in the article is 2,396 calories, which happens to be associated with the highest mortality (and is probably an underestimation); the lowest is 1,202 (also probably underestimated).
So, in summary, the data suggests that, for the particular sample studied (made up of two subsamples): (a) red meat intake is protective in terms of overall mortality, through a direct effect; and (b) the deleterious effect of overeating on mortality is stronger than the protective effect of red meat intake. These conclusions are consistent with those of my previous post on the same study (). The difference is that the previous post suggested a possible moderating protective effect; this post suggests a possible direct protective effect. Both effects are small, as was the negative effect reported by the authors of the study. Neither is statistically significant, due to sample size limitations (secondary data from an article; N=10). And all of this is based on a study that categorized various types of processed meat as red meat, and that did not distinguish grass-fed from non-grass-fed meat.
By the way, in discussions of red meat intake’s effect on health, often iron overload is mentioned. What many people don’t seem to realize is that iron overload is caused primarily by hereditary haemochromatosis. Another cause is “blood doping” to improve athletic performance (). Hereditary haemochromatosis is a very rare genetic disorder; rare enough to be statistically “invisible” in any study that does not specifically target people with this disorder.
So I entered the data reported in the most recent (published online in March 2012) study looking at the relationship between red meat consumption and mortality into WarpPLS to do some exploratory analyses. I discussed the study in my previous post; it was conducted by Pan et al. (Frank B. Hu is the senior author) and published in the prestigious Archives of Internal Medicine (). The data I used is from Table 1 of the article; it reports figures on several variables along 5 quintiles, based on separate analyses of two samples, called “Health Professionals” and “Nurses Health” samples. The Health Professionals sample comprised males; the Nurses Health sample, females.
Below is an interesting exploratory model, with results. It includes a number of hypotheses, represented by arrows, which seem to make sense. This is helpful, because a model incorporating hypotheses that make sense allows for easy identification of nonsense results, and thus rejection of the model or the data. (Refutability is one of the most important characteristics of good theoretical models.) Keep in mind that the sample size here is very small (N=10), as the authors of the study reported data along 5 quintiles for the Health Professionals sample, together with 5 quintiles for the Nurses Health sample. In a sense, this is somewhat helpful, because a small sample tends to be “unstable”, leading nonsense results and other signs of problems to show up easily – one example would be multivariate coefficients of association (the beta coefficients reported near the arrows) greater than 1 due to collinearity ().
So what does the model above tell us? It tells us that smoking (Smokng) is associated with reduced physical activity (PhysAct); beta = -0.92. It tells us that smoking (Smokng) is associated with reduced food intake (FoodInt); beta = -0.36. It tells us that physical activity (PhysAct) is associated with reduced incidence of diabetes (Diabetes); beta = -0.25. It tells us that increased food intake (FoodInt) is associated with increased incidence of diabetes (Diabetes); beta = 0.93. It tells us that increased food intake (FoodInt) is associated with increased red meat intake (RedMeat); beta = 0.60. It tells us that increased incidence of diabetes (Diabetes) is associated with increased mortality (Mort); beta = 0.61. It tells us that being female (SexM1F2) is associated with reduced mortality (Mort); beta = -0.67.
Some of these betas are a bit too high (e.g., 0.93), due to the level of collinearity caused by such a small sample. Due to being quite high, they are statistically significant even in a small sample. Betas greater than 0.20 tend to become statistically significant when the sample size is 100 or greater; so all of the coefficients above would be statistically significant with a larger sample size. What is the common denominator of all of the associations above? The common denominator is that all of them make sense, qualitatively speaking; there is not a single case where the sign is the opposite of what we would expect. There is one association that is shown on the graph and that is missing from my summary of associations above; and it also makes sense, at least to me. The model also tells us that increased red meat intake (RedMeat) is associated with reduced mortality (Mort); beta = -0.25. More technically, it tells us that, when we control for biological sex (SexM1F2) and incidence of diabetes (Diabetes), increased red meat intake (RedMeat) is associated with reduced mortality (Mort).
How do we roughly estimate this effect in terms of amounts of red meat consumed? The -0.25 means that, for each standard deviation in the amount of red meat consumed, there is a corresponding 0.25 standard deviation reduction of mortality. (This interpretation is possible because I used WarpPLS’ linear analysis algorithm; a nonlinear algorithm would lead to a more complex interpretation.) The standard deviation for red meat consumption is 0.897 servings. Each serving has about 84 g. And the highest number of servings in the dataset is 3.1 servings, or 260 g/d (calculated as: 3.1*84). To stay a bit shy of this extreme, let us consider a slightly lower intake amount, which is 3.1 standard deviations, or 234 g/d (calculated as: 3.1*0.897*84). Since the standard deviation for mortality is 0.3 percentage points, we can conclude that an extra 234 g of red meat per day is associated with a reduction in mortality of approximately 23 percent (calculated as: 3.1*0.25*0.3).
Let me repeat for emphasis: the data reported by the authors suggests that, when we control for biological sex and incidence of diabetes, an extra 234 g of red meat per day is associated with a reduction in mortality of approximately 23 percent. This is exactly the opposite, qualitatively speaking, of what was reported by the authors in the article. I should note that this is also a minute effect, like the effect reported by the authors. (The mortality rates in the article are expressed as percentages, with the lowest being around 1 percent. So this 23 percent is a percentage of a percentage.) If you were to compare a group of 100 people who ate little red meat with another group of the same size that ate 234 g more of red meat every day, over a period of more than 20 years, you would not find a single additional death in either group. If you were to compare matched groups of 1,000 individuals, you would find only 2 additional deaths among the folks who ate little red meat.
At the same time, we can also see that excessive food intake is associated with increased mortality via its effect on diabetes. The product beta coefficient for the mediated effect FoodInt --> Diabetes --> Mort is 0.57. This means that, for each standard deviation of food intake in grams, there is a corresponding 0.57 standard deviation increase in mortality, via an increase in the incidence of diabetes. This is very likely at levels of food consumption where significantly more calories are consumed than spent, ultimately leading to many people becoming obese. The standard deviation for food intake is 355 calories. The highest daily food intake quintile reported in the article is 2,396 calories, which happens to be associated with the highest mortality (and is probably an underestimation); the lowest is 1,202 (also probably underestimated).
So, in summary, the data suggests that, for the particular sample studied (made up of two subsamples): (a) red meat intake is protective in terms of overall mortality, through a direct effect; and (b) the deleterious effect of overeating on mortality is stronger than the protective effect of red meat intake. These conclusions are consistent with those of my previous post on the same study (). The difference is that the previous post suggested a possible moderating protective effect; this post suggests a possible direct protective effect. Both effects are small, as was the negative effect reported by the authors of the study. Neither is statistically significant, due to sample size limitations (secondary data from an article; N=10). And all of this is based on a study that categorized various types of processed meat as red meat, and that did not distinguish grass-fed from non-grass-fed meat.
By the way, in discussions of red meat intake’s effect on health, often iron overload is mentioned. What many people don’t seem to realize is that iron overload is caused primarily by hereditary haemochromatosis. Another cause is “blood doping” to improve athletic performance (). Hereditary haemochromatosis is a very rare genetic disorder; rare enough to be statistically “invisible” in any study that does not specifically target people with this disorder.
Labels:
diabetes,
haemochromatosis,
iron overload,
longevity,
mortality,
red meat,
warppls
Monday, March 19, 2012
The 2012 red meat-mortality study (Arch Intern Med): The data suggests that red meat is protective
I am not a big fan of using arguments such as “food questionnaires are unreliable” and “observational studies are worthless” to completely dismiss a study. There are many reasons for this. One of them is that, when people misreport certain diet and lifestyle patterns, but do that consistently (i.e., everybody underreports food intake), the biasing effect on coefficients of association is minor. Measurement errors may remain for this or other reasons, but regression methods (linear and nonlinear) assume the existence of such errors, and are designed to yield robust coefficients in their presence. Besides, for me to use these types of arguments would be hypocritical, since I myself have done several analyses on the China Study data (), and built what I think are valid arguments based on those analyses.
My approach is: Let us look at the data, any data, carefully, using appropriate analysis tools, and see what it tells us; maybe we will find evidence of measurement errors distorting the results and leading to mistaken conclusions, or maybe not. With this in mind, let us take a look at the top part of Table 3 of the most recent (published online in March 2012) study looking at the relationship between red meat consumption and mortality, authored by Pan et al. (Frank B. Hu is the senior author) and published in the prestigious Archives of Internal Medicine (). This is a prominent journal, with an average of over 270 citations per article according to Google Scholar. The study has received much media attention recently.
Take a look at the area highlighted in red, focusing on data from the Health Professionals sample. That is the multivariate-adjusted cardiovascular mortality rate, listed as a normalized percentage, in the highest quintile (Q5) of red meat consumption from the Health Professionals sample. The non-adjusted percentages are 1.4 percent mortality in Q5 and 1.13 in Q1 (from Table 1 of the same article); so the multivariate adjustment-normalization changed the values of the percentages somewhat, but not much. The highlighted 1.35 number suggests that for each group of 100 people who consumed a lot of red meat (Q5), when compared with a group of 100 people who consumed little red meat (Q1), there were on average 0.35 more deaths over the same period of time (more than 20 years).
The heavy red meat eaters in Q5 consumed 972.37 percent more red meat than those in Q1. This is calculated with data from Table 1 of the same article, as: (2.36-0.22)/0.22. In Q5, the 2.36 number refers to the number of servings of red meat per day, with each serving being approximately 84 g. So the heavy red meat eaters ate approximately 198 g per day (a bit less than 0.5 lb), while the light red meat eaters ate about 18 g per day. In other words, the heavy red meat eaters ate 9.7237 times more, or 972.37 percent more, red meat.
So, just to be clear, even though the folks in Q5 consumed 972.37 percent more red meat than the folks in Q1, in each matched group of 100 you would not find a single additional death over the same time period. If you looked at matched groups of 1,000 individuals, you would find 3 more deaths among the heavy red meat eaters. The same general pattern, of a minute difference, repeats itself throughout Table 3. As you can see, all of the reported mortality ratios are 1-point-something. In fact, this same pattern repeats itself in all mortality tables (all-cause, cardiovascular, cancer). This is all based on a multivariate analysis that according to the authors controlled for a large number of variables, including baseline history of diabetes.
Interestingly, looking at data from the same sample (Health Professionals), the incidence of diabetes is 75 percent higher in Q5 than in Q1. The same is true for the second sample (Nurses Health), where the Q5-Q1 difference in incidence of diabetes is even greater - 81 percent. This caught my eye, being diabetes such a prototypical “disease of affluence”. So I entered the whole data reported in the article into HCE () and WarpPLS (), and conducted some analyses. The graphs below are from HCE. The data includes both samples – Health Professionals and Nurses Health.
HCE calculates bivariate correlations, and so does WarpPLS. But WarpPLS stores numbers with a higher level of precision, so I used WarpPLS for calculating coefficients of association, including correlations. I also double-checked the numbers with other software, just in case (e.g., SPSS and MATLAB). Here are the correlations calculated by WarpPLS, which refer to the graphs above: 0.030 for red meat intake and mortality; 0.607 for diabetes and mortality; and 0.910 for food intake and diabetes. Yes, you read it right, the correlation between red meat intake and mortality is a very low and non-significant 0.030 in this dataset. Not a big surprise when you look at the related HCE graph, with the line going up and down almost at random. Note that I included the quintiles data from both the Health Professionals and Nurses Health samples in one dataset.
Those folks in Q5 had a much higher incidence of diabetes, and yet the increase in mortality for them was significantly lower, in percentage terms. A key difference between Q5 and Q1 being what? The Q5 folks ate a lot more red meat. This looks suspiciously suggestive of a finding that I came across before, based on an analysis of the China Study II data (). The finding was that animal food consumption (and red meat is an animal food) was protective, actually reducing the negative effect of wheat flour consumption on mortality. That analysis actually suggested that wheat flour consumption may not be so bad if you eat 221 g or more of animal food daily.
So, I built the model below in WarpPLS, where red meat intake (RedMeat) is hypothesized to moderate the relationship between diabetes incidence (Diabetes) and mortality (Mort). Below I am also including the graphs for the direct and moderating effects; the data is standardized, which reduces estimation error, particularly in moderating effects estimation. I used a standard linear algorithm for the calculation of the path coefficients (betas next to the arrows) and jackknifing for the calculation of the P values (confidence = 1 – P value). Jackknifing is a resampling technique that does not require multivariate normality and that tends to work well with small samples; as is the case with nonparametric techniques in general.
The direct effect of diabetes on mortality is positive (0.68) and almost statistically significant at the P < 0.05 level (confidence of 94 percent), which is noteworthy because the sample size here is so small – only 10 data points, 5 quintiles from the Health Professionals sample and 5 from the Nurses Health sample. The moderating effect is negative (-0.11), but not statistically significant (confidence of 61 percent). In the moderating effect graphs (shown side-by-side), this negative moderation is indicated by a slightly less steep inclination of the regression line for the graph on the right, which refers to high red meat intake. A less steep inclination means a less strong relationship between diabetes and mortality – among the folks who ate the most red meat.
Not too surprisingly, at least to me, the results above suggest that red meat per se may well be protective. Although we should consider a least two other possibilities. One is that red meat intake is a marker for consumption of some other things, possibly present in animal foods, that are protective - e.g., choline and vitamin K2. The other possibility is that red meat is protective in part by displacing other less healthy foods. Perhaps what we are seeing here is a combination of these.
Whatever the reason may be, red meat consumption seems to actually lessen the effect of diabetes on mortality in this sample. That is, according to this data, the more red meat is consumed, the fewer people die from diabetes. The protective effect might have been stronger if the participants had eaten more red meat, or more animal foods containing the protective factors; recall that the threshold for protection in the China Study II data was consumption of 221 g or more of animal food daily (). Having said that, it is also important to note that, if you eat excess calories to the point of becoming obese, from red meat or any other sources, your risk of developing diabetes will go up – as the earlier HCE graph relating food intake and diabetes implies.
Please keep in mind that this post is the result of a quick analysis of secondary data reported in a journal article, and its conclusions may be wrong, even though I did my best not to make any mistake (e.g., mistyping data from the article). The authors likely spent months, if not more, in their study; and have the support of one of the premier research universities in the world. Still, this post raises serious questions. I say this respectfully, as the authors did seem to try their best to control for all possible confounders.
I should also say that the moderating effect I uncovered is admittedly a fairly weak effect on this small sample and not statistically significant. But its magnitude is apparently greater than the reported effects of red meat on mortality, which are not only minute but may well be statistical artifacts. The Cox proportional hazards analysis employed in the study, which is commonly used in epidemiology, is nothing more than a sophisticated ANCOVA; it is a semi-parametric version of a special case of the broader analysis method automated by WarpPLS.
Finally, I could not control for confounders because, given the small sample, inclusion of confounders (e.g., smoking) leads to massive collinearity. WarpPLS calculates collinearity estimates automatically, and is particularly thorough at doing that (calculating them at multiple levels), so there is no way to ignore them. Collinearity can severely distort results, as pointed out in a YouTube video on WarpPLS (). Collinearity can even lead to changes in the signs of coefficients of association, in the context of multivariate analyses - e.g., a positive association appears to be negative. The authors have the original data – a much, much larger sample - which makes it much easier to deal with collinearity.
Moderating effects analyses () – we need more of that in epidemiological research eh?
My approach is: Let us look at the data, any data, carefully, using appropriate analysis tools, and see what it tells us; maybe we will find evidence of measurement errors distorting the results and leading to mistaken conclusions, or maybe not. With this in mind, let us take a look at the top part of Table 3 of the most recent (published online in March 2012) study looking at the relationship between red meat consumption and mortality, authored by Pan et al. (Frank B. Hu is the senior author) and published in the prestigious Archives of Internal Medicine (). This is a prominent journal, with an average of over 270 citations per article according to Google Scholar. The study has received much media attention recently.
Take a look at the area highlighted in red, focusing on data from the Health Professionals sample. That is the multivariate-adjusted cardiovascular mortality rate, listed as a normalized percentage, in the highest quintile (Q5) of red meat consumption from the Health Professionals sample. The non-adjusted percentages are 1.4 percent mortality in Q5 and 1.13 in Q1 (from Table 1 of the same article); so the multivariate adjustment-normalization changed the values of the percentages somewhat, but not much. The highlighted 1.35 number suggests that for each group of 100 people who consumed a lot of red meat (Q5), when compared with a group of 100 people who consumed little red meat (Q1), there were on average 0.35 more deaths over the same period of time (more than 20 years).
The heavy red meat eaters in Q5 consumed 972.37 percent more red meat than those in Q1. This is calculated with data from Table 1 of the same article, as: (2.36-0.22)/0.22. In Q5, the 2.36 number refers to the number of servings of red meat per day, with each serving being approximately 84 g. So the heavy red meat eaters ate approximately 198 g per day (a bit less than 0.5 lb), while the light red meat eaters ate about 18 g per day. In other words, the heavy red meat eaters ate 9.7237 times more, or 972.37 percent more, red meat.
So, just to be clear, even though the folks in Q5 consumed 972.37 percent more red meat than the folks in Q1, in each matched group of 100 you would not find a single additional death over the same time period. If you looked at matched groups of 1,000 individuals, you would find 3 more deaths among the heavy red meat eaters. The same general pattern, of a minute difference, repeats itself throughout Table 3. As you can see, all of the reported mortality ratios are 1-point-something. In fact, this same pattern repeats itself in all mortality tables (all-cause, cardiovascular, cancer). This is all based on a multivariate analysis that according to the authors controlled for a large number of variables, including baseline history of diabetes.
Interestingly, looking at data from the same sample (Health Professionals), the incidence of diabetes is 75 percent higher in Q5 than in Q1. The same is true for the second sample (Nurses Health), where the Q5-Q1 difference in incidence of diabetes is even greater - 81 percent. This caught my eye, being diabetes such a prototypical “disease of affluence”. So I entered the whole data reported in the article into HCE () and WarpPLS (), and conducted some analyses. The graphs below are from HCE. The data includes both samples – Health Professionals and Nurses Health.
HCE calculates bivariate correlations, and so does WarpPLS. But WarpPLS stores numbers with a higher level of precision, so I used WarpPLS for calculating coefficients of association, including correlations. I also double-checked the numbers with other software, just in case (e.g., SPSS and MATLAB). Here are the correlations calculated by WarpPLS, which refer to the graphs above: 0.030 for red meat intake and mortality; 0.607 for diabetes and mortality; and 0.910 for food intake and diabetes. Yes, you read it right, the correlation between red meat intake and mortality is a very low and non-significant 0.030 in this dataset. Not a big surprise when you look at the related HCE graph, with the line going up and down almost at random. Note that I included the quintiles data from both the Health Professionals and Nurses Health samples in one dataset.
Those folks in Q5 had a much higher incidence of diabetes, and yet the increase in mortality for them was significantly lower, in percentage terms. A key difference between Q5 and Q1 being what? The Q5 folks ate a lot more red meat. This looks suspiciously suggestive of a finding that I came across before, based on an analysis of the China Study II data (). The finding was that animal food consumption (and red meat is an animal food) was protective, actually reducing the negative effect of wheat flour consumption on mortality. That analysis actually suggested that wheat flour consumption may not be so bad if you eat 221 g or more of animal food daily.
So, I built the model below in WarpPLS, where red meat intake (RedMeat) is hypothesized to moderate the relationship between diabetes incidence (Diabetes) and mortality (Mort). Below I am also including the graphs for the direct and moderating effects; the data is standardized, which reduces estimation error, particularly in moderating effects estimation. I used a standard linear algorithm for the calculation of the path coefficients (betas next to the arrows) and jackknifing for the calculation of the P values (confidence = 1 – P value). Jackknifing is a resampling technique that does not require multivariate normality and that tends to work well with small samples; as is the case with nonparametric techniques in general.
The direct effect of diabetes on mortality is positive (0.68) and almost statistically significant at the P < 0.05 level (confidence of 94 percent), which is noteworthy because the sample size here is so small – only 10 data points, 5 quintiles from the Health Professionals sample and 5 from the Nurses Health sample. The moderating effect is negative (-0.11), but not statistically significant (confidence of 61 percent). In the moderating effect graphs (shown side-by-side), this negative moderation is indicated by a slightly less steep inclination of the regression line for the graph on the right, which refers to high red meat intake. A less steep inclination means a less strong relationship between diabetes and mortality – among the folks who ate the most red meat.
Not too surprisingly, at least to me, the results above suggest that red meat per se may well be protective. Although we should consider a least two other possibilities. One is that red meat intake is a marker for consumption of some other things, possibly present in animal foods, that are protective - e.g., choline and vitamin K2. The other possibility is that red meat is protective in part by displacing other less healthy foods. Perhaps what we are seeing here is a combination of these.
Whatever the reason may be, red meat consumption seems to actually lessen the effect of diabetes on mortality in this sample. That is, according to this data, the more red meat is consumed, the fewer people die from diabetes. The protective effect might have been stronger if the participants had eaten more red meat, or more animal foods containing the protective factors; recall that the threshold for protection in the China Study II data was consumption of 221 g or more of animal food daily (). Having said that, it is also important to note that, if you eat excess calories to the point of becoming obese, from red meat or any other sources, your risk of developing diabetes will go up – as the earlier HCE graph relating food intake and diabetes implies.
Please keep in mind that this post is the result of a quick analysis of secondary data reported in a journal article, and its conclusions may be wrong, even though I did my best not to make any mistake (e.g., mistyping data from the article). The authors likely spent months, if not more, in their study; and have the support of one of the premier research universities in the world. Still, this post raises serious questions. I say this respectfully, as the authors did seem to try their best to control for all possible confounders.
I should also say that the moderating effect I uncovered is admittedly a fairly weak effect on this small sample and not statistically significant. But its magnitude is apparently greater than the reported effects of red meat on mortality, which are not only minute but may well be statistical artifacts. The Cox proportional hazards analysis employed in the study, which is commonly used in epidemiology, is nothing more than a sophisticated ANCOVA; it is a semi-parametric version of a special case of the broader analysis method automated by WarpPLS.
Finally, I could not control for confounders because, given the small sample, inclusion of confounders (e.g., smoking) leads to massive collinearity. WarpPLS calculates collinearity estimates automatically, and is particularly thorough at doing that (calculating them at multiple levels), so there is no way to ignore them. Collinearity can severely distort results, as pointed out in a YouTube video on WarpPLS (). Collinearity can even lead to changes in the signs of coefficients of association, in the context of multivariate analyses - e.g., a positive association appears to be negative. The authors have the original data – a much, much larger sample - which makes it much easier to deal with collinearity.
Moderating effects analyses () – we need more of that in epidemiological research eh?
Subscribe to:
Posts (Atom)