Skip Navigation


Health Policy and Planning Advance Access originally published online on October 9, 2006
Health Policy and Planning 2006 21(6):459-468; doi:10.1093/heapol/czl029
This Article
Right arrow Full Text Freely available
Right arrow FREE Full Text (PDF) Freely available
Right arrow All Versions of this Article:
21/6/459    most recent
czl029v1
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (8)
Right arrowRequest Permissions
Right arrow Disclaimer
Google Scholar
Right arrow Articles by Vyas, S.
Right arrow Articles by Kumaranayake, L.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Vyas, S.
Right arrow Articles by Kumaranayake, L.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

© The Author 2006. Published by Oxford University Press in association with The London School of Hygiene and Tropical Medicine. All rights reserved.

Constructing socio-economic status indices: how to use principal components analysis

Seema Vyas and Lilani Kumaranayake

HIVTools Research Group, Health Policy Unit, Department of Public Health and Policy, London School of Hygiene and Tropical Medicine, London, UK

Correspondence: Seema Vyas, HIVTools Research Group, Health Policy Unit, Department of Public Health and Policy, London School of Hygiene and Tropical Medicine, Keppel Street, London, WC1E 7HT, UK. Tel: +44 (0) 20 7612 7828; Fax: +44 (0) 20 7637 5391; E-mail: seema.vyas{at}lshtm.ac.uk

Theoretically, measures of household wealth can be reflected by income, consumption or expenditure information. However, the collection of accurate income and consumption data requires extensive resources for household surveys. Given the increasingly routine application of principal components analysis (PCA) using asset data in creating socio-economic status (SES) indices, we review how PCA-based indices are constructed, how they can be used, and their validity and limitations. Specifically, issues related to choice of variables, data preparation and problems such as data clustering are addressed. Interpretation of results and methods of classifying households into SES groups are also discussed. PCA has been validated as a method to describe SES differentiation within a population. Issues related to the underlying data will affect PCA and this should be considered when generating and interpreting results.

Key Words: socio-economic status, principal components analysis, cluster analysis, methodology

1A vector that results in a scalar multiple of itself when multiplied by a matrix is known as an eigenvector, and the scalar is its associated eigenvalue. Eigenvectors can only be found for square matrices (though not all), and for an n x n matrix, there are n eigenvectors. For a more detailed description of matrix algebra, and in particular eigenvectors and eigenvalues, see Manly (1994).

2Brazil is a lower-middle-income country with a GNI per capita of US$3090. With a GNI per capita of US$110 Ethiopia is one of the world's poorest countries ([http://www.worldbank.org]). The urban population was 83% in Brazil and 16% in Ethiopia in 2003 (UNDP 2005). We used the 1996 Brazil DHS and 2000 Ethiopia DHS.

3The construction of a number of binary variables from categorical variables is another way to organize the data, although nominally new variables are created. For example, the categorical variable RELIGION, with the values Christian, Muslim, Jewish, Buddhist, converted to binary form would mean the creation of four new variables CHRISTIAN, MUSLIM, JEWISH, BUDDHIST, all of which took on the value of 0 or 1. As the nature of categorical variables is that there is no hierarchical relationship between the variables (which is why they cannot be converted into a meaningful quantitative scale), their conversion into binary variables and inclusion as additional variables does not change the relationship between the variables nor add any additional variation or correlation in the dataset. Rather, having individual variables, PCA can determine which of the particular religion variables can differentiate between households.

4PCA is not invariant to differences in the units of measurement among variables, therefore it is usual to standardize the variables in this instance (Bolch and Huang 1974). Standardization is the process of transforming variables so that the new set of scores has a mean equal to zero and standard deviation equal to one. The correlation matrix is a standardized version of the co-variance matrix.

5Factor score for ownership of a bicycle not stated in Houweling et al. (2003).


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
J. Nutr.Home page
J. H. Rah, P. Christian, A. A. Shamim, U. T. Arju, A. B. Labrique, and M. Rashid
Pregnancy and Lactation Hinder Growth and Nutritional Status of Adolescent Girls in Rural Bangladesh
J. Nutr., August 1, 2008; 138(8): 1505 - 1511.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Clin. Nutr.Home page
L. L Iannotti, N. Zavaleta, Z. Leon, A. H Shankar, and L. E Caulfield
Maternal zinc supplementation and growth in Peruvian infants
Am. J. Clinical Nutrition, July 1, 2008; 88(1): 154 - 160.
[Abstract] [Full Text] [PDF]


Home page
Health Policy PlanHome page
A. H Baqui, A. M Rosecrans, E. K Williams, P. K Agrawal, S. Ahmed, G. L Darmstadt, V. Kumar, U. Kiran, D. Panwar, R. C Ahuja, et al.
NGO facilitation of a government community-based maternal and neonatal health programme in rural India: improvements in equity
Health Policy Plan., July 1, 2008; 23(4): 234 - 243.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.