Spatio-temporal co-occurrence of hotspots of tuberculosis, poverty and air pollution in Lima, Peru
Infectious Diseases of Poverty volume 9, Article number: 32 (2020)
Growing evidence suggests pollution and other environmental factors have a role in the development of tuberculosis (TB), however, such studies have never been conducted in Peru. Considering the association between air pollution and specific geographic areas, our objective was to determine the spatial distribution and clustering of TB incident cases in Lima and their co-occurrence with clusters of fine particulate matter (PM2.5) and poverty. We found co-occurrences of clusters of elevated concentrations of air pollutants such as PM2.5, high poverty indexes, and high TB incidence in Lima. These findings suggest an interplay of socio-economic and environmental in driving TB incidence.
In 2017, 10 million new cases of tuberculosis (TB) occurred worldwide  which constitute a major health burden that strains middle- and low-income countries. Many socio-economic factors within these countries are frequently associated with higher TB incidence such as poverty, unemployment, low income, overcrowding, and population density . It is well known that TB is prone to spatial aggregation often in poor areas of cities and can even be associated with a higher risk of infection, as observed in Southern Ethiopia where the risk is 4.16 times higher inside a cluster . The use of geographical surveillance in public health allows for the detection of areas with a high prevalence or incidence of a particular disease in order to identify socio-economic factors associated with the phenomenon . These methods have been applied to TB transmission . Spatial information contributes to appropriate decision-making with a more efficient budget and human resources allocation and has been used previously in infectious diseases to detect hotspots and epidemics .
Environmental factors such as pollution and suspended particles are considered to play important roles in the development of TB . This is explained by secretion clearance impairment by epithelial cells of the respiratory tract which is the primary defense mechanism against Mycobacterium tuberculosis .
Studies on spatio-temporal distribution of TB cases have principally addressed their association with demographic and geographical predictors for multidrug-resistant TB in Peru [9,10,11], nonetheless, their possible association with environmental factors such as air pollution have yet to be of interest. This descriptive study sought to determine the spatial distribution and clustering of TB cases in Lima, Peru and determine co-occurrence with clusters of PM2.5 and economic index.
Ecological analysis using Peruvian Ministry of Health (MoH) data of TB incidence cases from 2015 to 2017 and high-resolution fine particulate matter with aerodynamic diameter of 2.5 μm or smaller (PM2.5) from the National Aeronautics and Space Administration (NASA) .
Study area and population
This study was conducted in Lima, Peru. Lima has a population of almost 9 million in an area of 2672 km2. It is divided in 43 districts (Fig. 1a), some of which are the most densely populated districts in Peru. Overall, Lima is the most developed province in the country with the largest internal migration rate.
All new TB cases are reported in the 342 health centers of the MoH in Lima; new cases diagnosed in hospitals are reported from their corresponding health center. We obtained incidence data from 2015 to 2017. TB diagnosis in the health centers is based on clinical suspicion (cough for more than 2 weeks, fever, night sweats) and/or microbiological confirmation. All reported new pulmonary TB cases (smear positive or negative) were used for this study. The populations of each district were provided by the National Institute of Statistics and Informatics (INEI) via the REtrieval of DATa for small Areas by Microcomputer (REDATAM) platform; incidence rates were computed at district-level per 100 000 habitants. Data handling and formatting were performed using R software.
Particulate matter 2.5 μm (PM2.5)
Air quality data from the Socioeconomic Data and Applications Center (SEDAC) by NASA was used . SEDAC provides annual global surface concentrations (micrograms per cubic meter, μg/m3) of mineral dust and sea-salt filtered atmospheric PM2.5 [12, 14]. From 2015 to 2016, PM2.5 gridded datasets were provided at a spatial resolution of 0.01 degrees (~ 1.11 km at the equator) . This product is the computation of aerosol optical depth (AOD) from multiple satellite instruments including the NASA Moderate Resolution Imaging Spectroradiometer (MODIS), Multi-angle Imaging Spectro Radiometer (MISR), and the Sea-Viewing Wide Field-of-View Sensor (SeaWiFS). SEDAC used a GEOS-Chem chemical transport model to relate the total column measure of aerosol to near-surface PM2.5 concentration and a geographically weighted regression (GWR) with global ground-based measurements to predict and adjust for the residual PM2.5 bias per grid cell in the initial satellite-derived values, previously validated . We processed the raster data in Google Earth Engine (GEE) and summarized as the median PM2.5 value per year and district, scaled by 100.
Poverty level at the individual, household and neighborhood level was provided by INEI . The downscaling of poverty indicators  were conducted using data from the National Census and the National Survey of Households (ENAHO), a specialized survey that includes detailed information about incomes and expenditures. Information on level of poverty was only provided for 2016, however, no significant relative changes were observed in previous estimates from 2013 . Dalenius-Hodges method  was used to compute five poverty-stratum at individual, household and neighborhood levels. A dimension reduction was conducted using a principal component analysis (PCA) in order to assign a single poverty level value per district. Final poverty index was computed as the additive inverse of all principal components with an eigenvalue greater than 1, higher values reflecting an impoverished population and lower values a wealthy population.
Spatial autocorrelation of TB cases, poverty, and PM2.5 were assessed using global Moran’s I statistics to describe the overall spatial dependence in the entire study area. In addition, local Getis-Ord Gi* statistic (a type of Local Indicator of Spatial Association - LISA) was used to identify local patterns and high-risk areas. A first-order queen contiguity-based weighted neighborhoods (districts with contiguous boundaries) were used for Moran’s I and Getis-Ord Gi*. Gi* statistic was categorized based on the sign (cold or hotspot) and percentile (90, 95, 99%) to prevent bias due to multiple and dependent tests . Gi* statistic might be sensible to the fact that units near the edge would have fewer neighbors than those in the middle of the study area, known as edge-effect . However, there is no consensus on the edge-effects on hotspot detection using areal data and their correction methods. In this study area, the median number of neighbors for districts located at the edges is four and five for those located in the middle. Under this small difference, the spatial analysis did not account for edge-effect correction, however, precaution would be taken if clusters were near the border of the study area.
Kendall’s W test was performed to evaluate the co-occurrence between cluster categories of TB cases, poverty, and PM2.5 as previously described . Kendall’s W measure the concordance between two features, ranging from + 1 (complete agreement) to − 1 (no agreement) and was calculated for all combinations of two variables including TB cases (TB cases-poverty, TB cases-PM2.5) and the three at the same time. Legendre method  was used to compute a P-value based on Monte Carlo randomizations.
The spatial data processing, analysis, and visualization were performed using the ‘spdep’ and ‘sf’ packages; and Kendall’s W test were performed using the ‘synchrony’ package in R software.
The Gini Index was calculated to assess the disproportioned distribution of cases in Lima districts. The Gini coefficient is a common measure of the inequality among values of a frequency distribution (TB cases). It is defined as a ratio with values between 0 (perfect equality) and 1 (perfect inequality). The Gini coefficient was computed using the ‘ineq’ package in R software.
A negative binomial generalized linear mixed model (GLMM) was created to assess the importance of poverty level and PM2.5 as drivers of spatial variation in TB incidence rate across Lima. A baseline model was formulated as follows:
Where the TB rate for each district and year (log(ρst)) is modeled by 1) an intercept (α), 2) an exchangeable random effects for each year (γt(a)) to account for interannual variation in TB over time (yearly random effect); and 3) spatially unstructured (φs), and structured (υs) random effects using a convolution prior that combines area-specific overdispersion and a neighborhood dependency structure . Covariates (poverty level and PM2.5) were added to this model and model parameters were estimated within a Bayesian framework using Integrated Nested Laplace Approximation (INLA) , an alternative to Markov Chain Monte Carlo (MCMC) methods. Models were fitted using the ‘INLA’ package in R software.
Baseline characteristics and TB incidence
A total of 28 381 new pulmonary TB cases were reported during the study period (2015–2017) with stable yearly rates. The incidence rate at district-level ranged between 18.8–214, 20.2–216, and 0–199 cases per 100 000 habitants in 2015, 2016, and 2017, respectively (Fig. 1b). The average PM2.5 concentration at district-level was slightly higher in 2016 (29.5 μg/m3) than in 2015 (26.2 μg/m3). The average PM2.5 was highly heterogeneous across districts in Lima, ranging between 13.8–40.4 μg/m3 in 2015, and 16.5–44.8 μg/m3 in 2016.
Spatial clustering and co-occurrence
An overall strong spatial autocorrelation was observed during the 2015–2017 period for TB cases (Moran’s I range: 0.24–0.36, P < 0.01), PM2.5 (Moran’s I range: 0.55–0.56, P < 0.001), and poverty level (Moran’s I = 0.353, P = 0.003) (Supplementary Fig. 1). Stable high-risk clusters of TB cases were observed in the central-east part of Lima and low-risk clusters in the south-west (Fig. 2a). High-risk clusters of PM2.5 were detected in the south and central-east of Lima, with low-risk clusters located in the south. Furthermore, cold spots of poverty level (cluster of wealthy districts) were located in the central-west. Statistically significant co-occurrence of clusters (seven categories - cold spot 99% confidence, cold spot 95% confidence, cold spot 90% confidence, not significant, hot spot 99% confidence, hot spot 95% confidence, hot spot 90% confidence) of TB cases and PM2.5 (Kendall’s W = 0.596; P = 0.046), TB cases and poverty level (Kendall’s W = 0.4714; P = 0.003), and the three variables combined (Kendall’s W = 0.4606; P = 0.001) were observed.
Mixed-effects models and inequity indexes
A moderate concentration of TB cases among Lima districts were observed in the study period (Gini Index range = 0.27–0.29) (Fig. 2b). The general trends of TB cases with PM2.5 and poverty level are present in Fig. 2c. The slight decrease in TB incidence, in spite of the increasing poverty index, can be explained by the efforts of the MoH and National TB Program towards TB control among low-resource districts. The spatio-temporal Bayesian mixed-effects negative binomial multivariate regression shows that PM2.5 (adjusted relative risk [ARR] = 1.31; 95% credible interval [CI]: 1.17–1.50) and poverty level (ARR = 1.14; 95% CI: 1.11–1.17) were associated with TB incidence rate in Lima, accounting for the spatio-temporal structure of the districts (Table 1); spatial and temporal random effects are shown in Fig. 3.
This study describes the skewed spatial distribution of TB cases in Lima. A strong spatio-temporal clustering of poverty, environmental fine particulate matter PM2.5 and incident TB cases was observed during the study period. Importantly, clusters of elevated PM2.5 concentrations, poverty, and high TB incidence significantly co-occur during the 2015–2016 period, suggesting that socio-economic determinants and environmental conditions interplay as important components of transmission and clustering of TB in this setting.
Lima is the capital with one of highest ambient air pollution levels in Latin America ; however, its burden on health remains largely unknown. Previous studies in Peru have explored the effect of air pollution as a determinant of asthma . The most important effects of air pollution manifest in the upper respiratory tract where it alters immune response , contributing as a susceptibility factor for various respiratory diseases . One study showed impaired the expression of CD69, IFN-γ and TNF-α when human peripheral blood mononuclear cells are exposed to Mycobacterium tuberculosis . Likewise, exposure to air pollution has been linked to many substantial adverse effects on human health; in particular people with chronic respiratory diseases such as asthma and chronic obstructive pulmonary disease.
These findings are consistent with previous studies supporting the hypothesis of air pollution as an environmental determinant of TB . One study in Jiangsu (China) showed association between TB and long-term pollution exposure (using PM2.5, PM10, SO2 and NO2 measurements); similar findings were found in North Carolina for long-term pollution exposure (PM10 and PM2.5) . Short-term exposure to outdoor pollution also seems to be a factor for active TB  and can also increase the risk for TB infection . Poverty condition also seems to be a key variable for unhealthy concentrations of PM2.5 and high-risk of TB alike. Although poverty has been proven to be a driver of the TB epidemic, it is also an important determinant of outdoor and indoor air pollution. The concentrations of the pollutants stated above are often highest largely in the urban areas of low- and middle-income countries . Rapid rural-urban migration, as experienced in Lima, has created overcrowded districts characterized by poverty and increased air pollution concentrations . The inequalities of TB are seldom looked at from an environmental perspective; however, these findings suggest the complex interaction between socio-economic factors and environment pollution in the transmission of TB in Lima.
Some limitations are acknowledged in this study. First, the high spatial resolution of the NASA-SEDAC PM2.5 estimates could not be fully harnessed due to the coarse spatial resolution of the TB data at the district level. Some districts present a high heterogeneity in PM2.5 between sub areas that were not included in this study. Additionally, NASA-SEDAC estimates lack a high temporal resolution; only yearly estimates were provided and seasonal variations in PM2.5 have been reported . Also, the effect of long-term exposure to PM2.5 and other markers such as PM10, SO2 and NO2 were not evaluated due missing information. Finally, TB reporting system (SIGTB) was established in 2015 and NASA-SEDAC only provided data until 2016, resulting in a short study period to observe the relation between PM2.5 and TB incidence.
This study describes the co-occurrence of clusters of elevated concentrations of air pollution (measured by PM2.5), poverty and high-risk areas of TB in Lima. These findings support previous studies and address the interplay of socio-economic and environmental drivers of TB incidence that will help to tailor TB control interventions. Further studies should be done to confirm these results at the individual level.
Availability of data and materials
All datasets are presented in the main paper and the supplementary materials. Data sharing is not applicable to this article as no datasets were generated or analyzed during the current study.
- PM2.5 :
Fine particulate matter
Ministry of Health
National Aeronautics and Space Administration
National Institute of Statistics and Informatics
Data Recuperation for Small Areas by Microcomputers
Socioeconomic Data and Applications Center
Aerosol Optical Depth
Moderate Resolution Imaging Spectroradiometer
Multi-angle Imaging SpectroRadiometer
Sea-Viewing Wide Field-of-View Sensor
Geographically Weighted Regression
Google Earth Engine
National Survey of Households
Principal Component Analysis
Local Indicator of Spatial Association
Generalized linear mixed model
Integrated Nested Laplace Approximation
Markov Chain Monte Carlo
- SO2 :
- NO2 :
World Health Organization. World Tuberculosis Report 2018. 2018.
Trauer JM, Dodd PJ, Gomes MGM, Gomez GB, Houben RMGJ, McBryde ES, et al. The importance of heterogeneity to the epidemiology of tuberculosis. Clin Infect Dis Off Publ Infect Dis Soc Am. 2019;69:159–66.
Tadesse S, Enqueselassie F, Hagos S. Spatial and space-time clustering of tuberculosis in Gurage zone, Southern Ethiopia. PLoS One. 2018;13:e0198353.
Dominkovics P, Granell C, Pérez-Navarro A, Casals M, Orcau A, Caylà JA. Development of spatial density maps based on geoprocessing web services: application to tuberculosis incidence in Barcelona, Spain. Int J Health Geogr. 2011;10:62.
Shaweno D, Karmakar M, Alene KA, Ragonnet R, Clements AC, Trauer JM, et al. Methods used in the spatial analysis of tuberculosis epidemiology: a systematic review. BMC Med. 2018;16:193.
Wang X, Yin S, Li Y, Wang W, Du M, Guo W, et al. Spatiotemporal epidemiology of, and factors associated with, the tuberculosis prevalence in northern China, 2010-2014. BMC Infect Dis. 2019;19:365.
Smith G, Schoenbach VJ, Richardson DB, Gammon MD. Particulate air pollution and susceptibility to the development of pulmonary tuberculosis disease in North Carolina: an ecological study. Int J Environ Health Res. 2014;24:103–12.
Huff RD, Carlsten C, Hirota JA. An update on immunologic mechanisms in the respiratory mucosa in response to air pollutants. J Allergy Clin Immunol. 2019;143:1989–2001.
Shah L, Choi HW, Berrang-Ford L, Henostroza G, Krapp F, Zamudio C, et al. Geographic predictors of primary multidrug-resistant tuberculosis cases in an endemic area of Lima. Peru. 2014;19:1307–14.
Lin H-H, Shin SS, Contreras C, Asencios L, Paciorek CJ, Cohen T. Use of spatial information to predict multidrug resistance in tuberculosis patients, Peru. Emerg Infect Dis. 2012;18:811–3.
Zelner JL, Murray MB, Becerra MC, Galea J, Lecca L, Calderon R, et al. Identifying hotspots of multidrug-resistant tuberculosis transmission using spatial and molecular genetic data. J Infect Dis. 2016;213:287–94.
van Donkelaar A, Martin RV, Brauer M, Hsu NC, Kahn RA, Levy RC, et al. Global estimates of fine particulate matter using a combined geophysical-statistical method with information from satellites. Environ Sci Technol. 2016;50:3762.
SEDAC. Global Annual PM2.5 Grids from MODIS, MISR and SeaWiFS Aerosol Optical Depth (AOD) with GWR, v1: Satellite-Derived Environmental Indicators. Available from: https://sedac.ciesin.columbia.edu/data/set/sdei-global-annual-gwr-pm2-5-modis-misr-seawifs-aod. Cited 2019 Dec 19.
van Donkelaar A, Martin RV, Brauer M, Hsu NC, Kahn RA, Levy RC, et al. Global Annual PM2. 5 Grids from MODIS, MISR and SeaWiFS Aerosol Optical Depth (AOD) with GWR, 1998-2016. Palisades NY: NASA Socioeconomic Data and Applications Center (SEDAC). 2018.
van Donkelaar A, Martin RV, Brauer M, Hsu NC, Kahn RA, Levy RC, et al. Global estimates of fine particulate matter using a combined geophysical-statistical method with information from satellites, models, and monitors. Environ Sci Technol. 2016;50:3762–72.
INEI. Perú: Planos Estratificados de Lima Metropolitana a Nivel de Manzanas según Ingreso Per Cápita del Hogar 2016. Available from: https://www.inei.gob.pe/media/MenuRecursivo/publicaciones_digitales/Est/Lib1403/index.html. Cited 2019 Dec 19.
Elbers C, Lanjouw JO, Lanjouw P. Micro–level estimation of poverty and inequality. Econometrica. 2003;71:355–64.
INEI Instituto Nacional de Estadística e Informática. Mapa de pobreza provincial y distrital 2013. 2015; Available from: http://repositorio.minedu.gob.pe/handle/123456789/3638. Cited 2019 Dec 19.
Dalenius T, Hodges JL Jr. Minimum variance stratification. J Am Stat Assoc. 1959;54:88–101.
Caldas de Castro M, Singer BH. Controlling the false discovery rate: a new application to account for multiple and dependent tests in local statistics of spatial association. Geogr Anal. 2006;38:180–208.
Griffith DA. The boundary value problem in spatial statistical analysis. J Reg Sci. 1983;23:377–87.
Bisanzio D, Dzul-Manzanilla F, Gomez-Dantés H, Pavia-Ruz N, Hladish TJ, Lenhart A, et al. Spatio-temporal coherence of dengue, chikungunya and Zika outbreaks in Merida, Mexico. PLoS Negl Trop Dis. 2018;12:e0006298.
Legendre P. Species associations: the Kendall coefficient of concordance revisited. J Agric Biol Environ Stat. 2005;10:226.
Besag J, Green P, Higdon D, Mengersen K. Bayesian computation and stochastic systems. Stat Sci. 1995;10:3–41.
Rue H, Martino S, Chopin N. Approximate Bayesian inference for latent Gaussian models by using integrated nested Laplace approximations. J R Stat Soc Ser B Stat Methodol. 2009;71:319–92.
Bell ML, Cifuentes LA, Davis DL, Cushing E, Telles AG, Gouveia N. Environmental health indicators and a case study of air pollution in Latin American cities. Environ Res. 2011;111:57–66.
Orellano P, Quaranta N, Reynoso J, Balbi B, Vasquez J. Association of outdoor air pollution with the prevalence of asthma in children of Latin America and the Caribbean: a systematic review and meta-analysis. J Asthma Off J Assoc Care Asthma. 2018;55:1174–86.
Schraufnagel DE, Balmes JR, Cowl CT, De Matteis S, Jung S-H, Mortimer K, et al. Air pollution and noncommunicable diseases: a review by the forum of international respiratory societies’ environmental committee, part 1: the damaging effects of air pollution. Chest. 2019;155:409–16.
Ibironke O, Carranza C, Sarkar S, Torres M, Choi HT, Nwoko J, et al. Urban air pollution particulates suppress human T-cell responses to mycobacterium tuberculosis. Int J Environ Res Public Health. 2019;16:21.
Popovic I, Soares Magalhaes RJ, Ge E, Marks GB, Dong G-H, Wei X, et al. A systematic literature review and critical appraisal of epidemiological studies on outdoor air pollution and tuberculosis outcomes. Environ Res. 2019;170:33–45.
Zhu S, Xia L, Wu J, Chen S, Chen F, Zeng F, et al. Ambient air pollutants are associated with newly diagnosed tuberculosis: a time-series study in Chengdu, China. Sci Total Environ. 2018;631–632:47–55.
Lai TC, Chiang CY, Wu CF, Yang SL, Liu DP, Chan CC, et al. Ambient air pollution and risk of tuberculosis: a cohort study. Occup Environ Med. 2016;73:56–61.
World Health Organization. Ambient (outdoor) air pollution in cities database 2014. 2014. Retrieved World Health Organ. http://www.WhoIntphehealthtopicsoutdoorairdatabasescities-2014en.
Kristiansson M, Sörman K, Tekwe C, Calderón-Garcidueñas L. Urban air pollution, poverty, violence and health--neurological and immunological aspects as mediating factors. Environ Res. 2015;140:511–3.
Cesari D, De Benedetto GE, Bonasoni P, Busetto M, Dinoi A, Merico E, et al. Seasonal variability of PM2.5 and PM10 composition and sources in an urban background site in southern Italy. Sci Total Environ. 2018;612:202–13.
We thank the Ministry of Health and the Directorate of Prevention and Control of TB (DPCTB) for providing such useful data to researchers.
Gabriel Carrasco-Escobar was supported by NIH/Fogarty International Center Global Infectious Diseases Training Program (D43 TW007120). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Ethics approval and consent to participate
Consent for publication
The authors declare that they have no competing interests.
About this article
Cite this article
Carrasco-Escobar, G., Schwalb, A., Tello-Lizarraga, K. et al. Spatio-temporal co-occurrence of hotspots of tuberculosis, poverty and air pollution in Lima, Peru. Infect Dis Poverty 9, 32 (2020). https://doi.org/10.1186/s40249-020-00647-w
- Hotspots, air pollution