Skip to content


  • Research Article
  • Open Access

Spatial and temporal clustering analysis of tuberculosis in the mainland of China at the prefecture level, 2005–2015

Infectious Diseases of Poverty20187:106

  • Received: 2 April 2018
  • Accepted: 4 October 2018
  • Published:



Tuberculosis (TB) is still one of the most serious infectious diseases in the mainland of China. So it was urgent for the formulation of more effective measures to prevent and control it.


The data of reported TB cases in 340 prefectures from the mainland of China were extracted from the China Information System for Disease Control and Prevention (CISDCP) during January 2005 to December 2015. The Kulldorff’s retrospective space-time scan statistics was used to identify the temporal, spatial and spatio-temporal clusters of reported TB in the mainland of China by using the discrete Poisson probability model. Spatio-temporal clusters of sputum smear-positive (SS+) reported TB and sputum smear-negative (SS-) reported TB were also detected at the prefecture level.


A total of 10 200 528 reported TB cases were collected from 2005 to 2015 in 340 prefectures, including 5 283 983 SS- TB cases and 4 631 734 SS + TB cases with specific sputum smear results, 284 811 cases without sputum smear test. Significantly TB clustering patterns in spatial, temporal and spatio-temporal were observed in this research. Results of the Kulldorff’s scan found twelve significant space-time clusters of reported TB. The most likely spatio-temporal cluster (RR = 3.27, P <  0.001) was mainly located in Xinjiang Uygur Autonomous Region of western China, covering five prefectures and clustering in the time frame from September 2012 to November 2015. The spatio-temporal clustering results of SS+ TB and SS- TB also showed the most likely clusters distributed in the western China. However, the clustering time of SS+ TB was concentrated before 2010 while SS- TB was mainly concentrated after 2010.


This study identified the time and region of TB, SS+ TB and SS- TB clustered easily in 340 prefectures in the mainland of China, which is helpful in prioritizing resource assignment in high-risk periods and high-risk areas, and to formulate powerful strategy to prevention and control TB.


  • Tuberculosis
  • Space-time cluster
  • SaTscan
  • Prefecture
  • China

Multilingual abstracts

Please see Additional file 1 for translations of the abstract into the six official working languages of the United Nations.


Tuberculosis (TB) remains one of the most severe infectious disease worldwide with about 10.4 million new cases in 2016 [1]. Especially in China, there were about 0.8 million incident cases reported which alone contributed to 12% of the global reported TB incident with the number of 6.3 million. Additionally, the number of new TB cases in China was just less than India and Indonesia, ranking the third place worldwide in 2016 [1]. Though great achievements have been made for TB control work in recent two decades [2], it was still difficult to achieve the common aim of World Health Organization (WHO)'s End TB - to end the global TB epidemic for the period of 2016–2035. So the precise clustering results in spatial, temporal and spatio-temporal of TB would be helpful to renew national TB strategy of prevention and control.

TB is an airborne infectious disease with spatial autocorrelation in distribution at the international, national, provincial and even smaller levels during certain periods of time. A region with a high risk of TB would affect its neighboring areas. Global and local Moran’s I spatial autocorrelation analysis are the commonly used methods to detect whether there exist spatial autocorrelations and where the particular areas are located, separately. However, these methods could only evaluate the distribution characteristic of the disease in specific time point but unable to analyse continuous time [35]. As we all know, time is a non-negligible factors that might bias the conclusion directly. The Kulldorff’s space-time scan statistics is a more comprehensive method that could take both spatial and temporal distribution into consideration and set parameters more flexibly. More and more epidemiology studies of infectious disease have used this method widely [68]. In China, several studies about the spatio-temporal distribution of TB just in a specific province/municipality such as Beijing [9], Yunnan [10], Qinghai [11] or in the nationwide but at the provincial level [12] were analysed. Therefore, cluster analysis of nationwide TB at a more precise level is urgently needed. And study on every prefecture in the mainland of China has not been reported, so we carry out our research at the prefecture level.

Moreover, previous studies only took all TB cases as a whole and didn’t distinguish sputum smear-positive (SS+) reported TB and sputum smear-negative (SS-) reported TB. SS+ TB not only can seriously damage the patient’s own health, but also can infect others. Studies found that about 70% of individuals with SS+ TB died without regular treatment within 10 years of being diagnosed [1]. Besides, the strong infectivity of SS+ TB can easily cause the outbreak in the crowd gathering area. Though the infectivity of SS- TB is weaker than that of SS+ TB, SS- TB is difficult to diagnose and can revert back to SS+ TB without standard treatment [13], which virtually adds the difficulty to control and eliminate it. Therefore, we need to pay much attention to both SS+ TB and SS- TB.

Analyzing and evaluating the spatio-temporal patterns of TB in the mainland of China at the prefecture level is necessary for TB control and elimination. The aim of our study was to use the Kulldorff’s space-time scan statistical method to explore the spatial, temporal and spatio-temporal distribution characteristics of reported TB at the prefecture level in the mainland of China from 2005 to 2015. What’s more, spatio-temporal analysis of SS+ reported TB and SS- reported TB were also detected to summarize their variation laws.


TB data

Data of TB cases in 340 prefectures in the mainland of China (excluding Taiwan, Hong Kong and Macao) from 2005 to 2015 were collected from the China Information System for Disease Control and Prevention (CISDCP). Three hundred forty prefectures included 287 prefecture-level cities, 30 autonomous prefectures, 17 regions, 3 leagues and 3 provincial-controlled divisions, which covered the entire mainland of China (Additional file 2). Demographic data of every city in every year were extracted from China statistical yearbooks (2006–2016) ( TB is one of the most serious infectious diseases in China. It is mandated that each case of TB must be reported online within 24 h after diagnosis. Cases of TB were diagnosed using radiography, pathogen detection, and pathologic diagnosis, based on the diagnostic criteria recommended by the National Health and Family Planning Commission of China.

A total of 10 271 169 incident cases of TB were reported across 2921 hospitals and medical institutions in the mainland of China from January 2005 to December 2015. In this study, 10 200 528 cases aggregated at the prefecture-level city monthly were analyzed to detect the spatio-temporal high-risk areas of TB. And 70 641 cases without detailed information on the residential address were excluded from the analysis. The annual average reported cases were 927 321 during 2005 to 2015. And the percentage of the reported cases included in this study were 99.31%. In addition, the 10 200 528 TB cases included 5 283 983 SS- TB cases and 4 631 734 SS+ TB cases with specific sputum smear results, 284 811 cases without sputum smear test. The missing value percentage of the sputum smear test result was 2.79%. Of course, there were often outbreaks of tuberculosis in some regions with high TB incidence in almost every year especially in schools [1416].

Statistical methods

Firstly, Global Moran’s I index was used to determine whether there is a global spatial autocorrelation between 340 prefectures. The value of Global Moran’s I varies between − 1 and 1. A higher positive Moran’s I indicates that values in neighboring positions tend to cluster, while a lower negative Moran’s I implies that higher and lower values are interspersed. When Moran’s I is near 0, there is no spatial clustering, meaning that the data are randomly distributed [17]. The global spatial autocorrelation analysis was conducted by the ArcGIS 10.2 software (ESRI Inc. Redlands, CA, USA) using the packages of Spatial Autocorrelation. Z-score and P-value were calculated to evaluate the significance of Global Moran’s I.

Then, Kulldorff’s space-time scan statistical analysis was used to explore the temporal, spatial, and spatial-temporal clusters of TB as well as to verify whether the geographic clustering of TB was caused by random variation or not [18]. The discrete poisson probability model was used for scanning since the TB incidence was not very high [19]. We used the radius of the population coverage instead of the geographical radius in this study because the population in several areas was very small [11]. The space-time scan statistics were defined by a cylindrical window with a circular geographic base and with height corresponding to time. Then the cylindrical window was moved in space and time so that for each potential geographical location and size it also visited each possible time period. For each location and size of the scanning window, the alternative hypothesis was that there was an elevated risk within the window as compared to the outside [8]. The most likely cluster is the window with the maximum likelihood while the rest clusters with statistically significant log-likelihood ratios (LLR) were defined as the secondary likely clusters. The P value of LLR can be estimated using Monte Carlo method and the number of replications was limited to 999 [20]. The relative risk (RR) is the estimated risk within the cluster divided by the estimated risk outside the cluster [21]. In mathematical notation, it is:

$$ RR=\frac{c/E\left[c\right]}{\left(C-c\right)/\Big(E\left[C\right]-E\left[c\right]}=\frac{c/E\left[c\right]}{\left(C-c\right)/\left(C-E\left[c\right]\right)} $$

Where c is the number of observed cases within the cluster and C is the total number of cases in the data set. Note that since the analysis is conditioned on the total number of cases observed, E[C] = C.

The selection of the maximum radius of the spatial scanning window and the maximum length of the temporal scanning window were very important since the results of spatio-temporal scan are sensitive to them [22]. The default setting of the window sizes and temporal sizes were usually set as 50% but some studies questioned whether it is suitable [23]. For example, a high false positive rate might emerge if the window size is too large because some low-risk areas were included. Similarly, a high false negative rate would emerge if the window size is too small [11]. Many researches have explored how to choose an appropriate scanning window, and the mainly rules of those studies were reducing the overlapping areas or a single cluster should no more than 15% of the whole study area when using the irregular scan statistic [24, 25]. And some studies also used this standards in regular scan statistics [11]. So we used this experience for reference in our study. In order to select the optimal parameter of spatial cluster sizes, we analyzed the data of 2005 setting the maximum sizes from 4 to 50% of total population at risk by increments of 1%. If there were fewer overlaps between the areas defined by the radius, or the biggest area covered less than 15% of all the prefectures [11, 24, 25], the radius was considered as an optimal radius for analysis. When the maximum sizes set 4% to 11%, there are fewer overlaps and the biggest area covered no more than 15% of all the prefectures. The number of overlaps ranged from 20 to 26 and the number of prefectures covered by the biggest areas is 23. When the maximum sizes set from 12 to 19%, the number of overlaps reached the biggest of 30. And when the maximum sizes reached and exceeded 20%, the number of prefectures covered by the biggest areas increased more than 15% (51 prefectures) of all the prefectures. So the maximum spatial cluster size was set as 11% eventually. As for the temporal windows, if we set it as the default of 50%, both the pure temporal analysis and spatio-temporal analysis showed the clustering time from the early of 2005 to 2010. The results were not precise and specific, and a high false positive rate might emerge. So we must choose a more optimal temporal window in this study. Previous studies in China reported [10, 11, 26] that the temporal cluster of TB incidents were mainly concentrated in the spring and early summer, so some studies set 30% as the temporal window to do clustering analysis, which means that the maximum scan time length was 3 months for each year. Time series analysis of our study also found the epidemic characteristics of TB with the high incidence were from March to May. Based on the comprehensive summary of previous studies and the temporal characteristics of our study, the maximum temporal cluster size was set as 30% in this study in order to get closer to the real situation in China. Finally, we selected the spatial window covering 11% of the population at risk and the temporal window covering 30% of the whole study period to do our research. The statistical analysis was conducted by the SaTScan 9.3 software ( and P-value less than 0.05 were considered statistically significant.

In addition, time series seasonal decomposition analysis was conducted to identify the seasonality of TB incidence in the mainland of China [2729]. The time series of reported TB cases were decomposed into seasonal variation, long-term trend and random effect to explore the temporal patterns.


Global spatial autocorrelation analysis by global Moran’s I index

Significant global spatial autocorrelation existed in the reported TB cases in every year and the average time from 2005 to 2015 (all P <  0.001) with the Moran’s I index ranging from 0.144 to 0.289. (Table 1). So further spatio-temporal clustering analysis of TB were needed.
Table 1

Global spatial autocorrelation analysis of reported tuberculosis in the mainland of China from 2005 to 2015


Moran’s I index






<  0.001




<  0.001




<  0.001




<  0.001




<  0.001




<  0.001




<  0.001




<  0.001




<  0.001




<  0.001




<  0.001




<  0.001

Distribution of reported TB spatial clustering

Spatial clustering of the entire 11 years (2005–2015) identified a total of 166 prefectures were statistically significant high-risk areas with the spatial window covering 11% of the population at risk (Fig. 1). Spatial clustering analysis of every year from 2005 to 2015 found that the most likely clusters mainly distributed in two regions. One region was located in the southwest of Xinjiang Uygur Autonomous Region including Kashi Prefecture, Aksu Prefecture, Shihezi Prefecture, Hotan Prefecture and Kirgiz Autonomous Prefecture. The other region contained seven prefectures located in the southeast of Guangdong Province known as Guangzhou, Shenzhen, Zhuhai, Foshan, Huizhou, Dongguan and Zhongshan prefecture in the year of 2009, 2010 and 2011. The second likely clusters mainly scattered in some prefectures of northeastern China, western China and some central regions of China. Spatial clustering characteristics in every year from 2005 to 2015 were similar to the results in the total period with the most likely clusters distributed in the southwestern Xinjiang in most years. What’s more, the clustering areas presented a decreased tendency in recent years especially after 2010 (Fig. 1). The reported TB incidence in every year and the total research time emerged the same characteristics as the spatial clustering analysis (Additional file 3). That is to say, the high TB incidence prefectures were also mainly located in Xinjiang, Guangdong and so on. And the TB incidence presented a declining tendency. Spatial analysis at the prefecture level fixed the clustering location more clearly and more precisely crossing the provincial boundaries.
Fig. 1
Fig. 1

Spatial clustering of reported tuberculosis in the mainland of China from 2005 to 2015

Distribution of reported TB temporal clustering

The time series seasonal decomposition analysis of TB incidence showed a significant seasonal periodicity (Fig. 2). The temporal cluster analysis in every year showed the similar results to previous studies that TB incidence were concentrated in spring and early summer annually, ranging from February to June. The high aggregated time for TB in the whole study period was observed from March 2006 to May 2009. During this period, a total of 3 340 869 TB cases were reported, and the risk of TB related incidents was 19% (RR = 1.19, P <  0.001) higher than that in other time periods (Table 2). In addition, there was a declining trend for TB incidence generally in the study period but a slowly increase trend emerged from 2005 to 2007. (Fig. 2c).
Fig. 2
Fig. 2

The seasonal distribution of monthly tuberculosis in the mainland of China from 2005 to 2015

Table 2

Temporal clustering of reported tuberculosis in the mainland of China from 2005 to 2015


Cluster time frame

Observed cases

Expected cases





1 April–30 June 2005

286 531

231 097.37



<  0.001


1 March–31 May 2006

294 559

251 484.05



<  0.001


1 March–31 May 2007

298 483

258 417.27



<  0.001


1 March–31 May 2008

308 670

259 215.05



<  0.001


1 February–30 April 2009

275 579

239 579.59



<  0.001


1 March–31 May 2010

271 292

235 979.06



<  0.001


1 March–31 May 2011

264 335

229 929.34



<  0.001


1 March–31 May 2012

261 592

226 299.24



<  0.001


1 March–31 May 2013

242 821

215 559.30



<  0.001


1 March–31 May 2014

231 538

208 219.87



<  0.001


1 March–30 April 2015

152 886

134 380.27



<  0.001


1 March 2006–31 May 2009

3 340 869

295 7242.48


34 214.95

<  0.001

RR Relative risk, LLR Log-likelihood ratios

Distribution of reported TB spatio-temporal clustering

The results of spatio-temporal cluster analysis for reported TB in 340 prefectures of mainland China from 2005 to 2015 were shown in Table 3 and Fig. 3. A most likely cluster area and eleven secondary cluster areas were detected in the study. The most likely spatio-temporal cluster area was located at the southwestern of Xinjiang Uygur Autonomous Region and the high-risk time was from September 2012 to October 2015 (RR = 3.27, P <  0.001). Besides, the area centered at Kirgiz Autonomous Prefecture (39.608093 N, 76.162029 E) with a radius of 523.07 km covered five prefectures just like the most likely cluster in the purely spatial analysis. Other eleven secondary clusters were mainly distributed in the western China, northeastern China and several relatively small areas in the center China. And the cluster time frames mainly ranged from March 2005 to June 2009 except the most likely cluster and secondary cluster1.
Table 3

Spatio-temporal clustering of reported tuberculosis in the mainland of China from 2005 to 2015

Cluster type

Cluster time frame



Observed cases

Expected cases




Most likely cluster

1 September 2012–30 November 2015

(39.608093 N, 76.162029 E) / 523.07 km


73 552

22 572.11


36 033.32

<  0.001

Secondary cluster1

1 April 2009–30 June 2012

(22.898947 N, 113.887573 E) / 105.39 km


110 645

50 279.77


27 083.00

<  0.001

Secondary cluster2

1 April 2006–30 June 2009

(26.425756 N, 108.429245 E) / 382.47 km


351 481

238 013.99


24 198.12

<  0.001

Secondary cluster3

1 March 2005–31 May 2008

(28.754131 N, 117.356980 E) / 370.23 km


411 685

311 241.80


15 212.14

<  0.001

Secondary cluster4

1 March 2006–31 May 2009

(51.862772 N, 124.098025 E) / 1064.00 km


250 998

176 479.67


14 170.70

<  0.001

Secondary cluster5

1 June 2005–30 June 2005

(34.621634 N, 113.458645 E) / 201.59 km


13 933




<  0.001

Secondary cluster6

1 March 2006–31 May 2009

(33.910834 N, 93.584973 E) / 923.22 km


64 318

39 937.86



<  0.001

Secondary cluster7

1 March 2005–31 May 2008

(31.993508 N, 107.064020 E) / 211.26 km


198 045

154 600.80



<  0.001

Secondary cluster8

1 March 2005–31 May 2008

(41.492975 N, 110.349385 E) / 377.86 km


65 861

49 956.60



<  0.001

Secondary cluster9

1 March 2005–31 May 2005

(34.554155 N, 119.103035 E) / 195.95 km


11 829

10 452.97



<  0.001

Secondary cluster10

1 March 2008–31 May 2008

(39.466843 N, 122.299835 E) / 223.74 km






<  0.001

Secondary cluster11

1 December 2005–31 December 2005

(37.483730 N, 117.822365 E) / 0 km






<  0.001

Most likely cluster: P-value < 0.001; Secondary cluster: P-value < 0.001

N number of prefectures in the cluster, RR Relative risk, LLR Log-likelihood ratios

Fig. 3
Fig. 3

Spatio-temporal clustering of reported tuberculosis in the mainland of China from 2005 to 2015

Distribution of SS+ reported TB and SS- reported TB spatio-temporal clustering

Spatio-temporal clustering analysis showed that the most likely clusters of both SS+ reported TB and SS- reported TB were distributed in the southwestern of Xinjiang such as Kirgiz Autonomous Prefecture, Kashi Prefecture, Aksu Prefecture, Shihezi Prefecture and Hotan Prefecture, just like the clustering regions of total TB. But there were some differences in the distribution of secondary clusters between SS+ reported TB and SS- reported TB as shown in Fig. 4. In addition, the clustering time of SS+ reported TB was concentrated before 2010 while SS- reported TB was mainly concentrated after 2010. Additional files shows this in more detail (see Additional files 4 and 5).
Fig. 4
Fig. 4

Spatio-temporal clustering of SS+ reported tuberculosis (a) and SS- reported tuberculosis (b) in the mainland of China from 2005 to 2015


Kulldorff’s retrospective scan statistics is one of the most powerful method to evaluate the spatial and temporal distribution of routinely collected data [11, 20] especially infectious diseases. Many countries have applied this method to tuberculosis research. A study from Canada used this method to reveal a complex coexistence of spatial and cohort clustering with the time of 1990 to 2013, and provided the basis for public health response [30]. Studies from Ghana [31] and American [32] about Mycobacterium tuberculosis strains or genotypes also found the spatio-temporal characteristics based on this method, which could guide the formulation of TB control and prevention policies. Though there were several studies in China have explored the distribution of TB, they were only restricted to a certain province or certain area, or the nationwide research just at the provincial level not the prefecture level. Research of a single province could find the specific clusters in that province but cannot get the clustering conditions in the whole country. And as for the nationwide research, we can get the clustering provinces in the whole country but cannot focus on a more specific area of the province. In this study, Geographic Information System (GIS) and Kulldorff’s scan statistical analysis were used to examine the changing patterns and clusters of TB between 2005 and 2015 in 340 prefectures in the mainland of China. The results detected a significantly spatial, temporal, and space-time clustering distribution of TB. Furthermore, spatio-temporal clusters of SS+ TB and SS- TB were also found in this research.

The spatial scanning results indicated that the spatial clusters existed in every year from 2005 to 2015 as well as the entire 11 years. The distribution of the clustering regions were similiar in each year, which meant the most likely clusters mainly included five prefectures of southwestern Xinjiang and seven prefectures of southeastern Guangdong. The main reasons of the high TB risk in five prefectures (Kashi, Aksu, Shihezi, Hotan and Kirgiz Autonomous Prefecture) of Xinjang are relatively poor economy, high proportion of minorities and underdeveloped medical level [33, 34]. It was no doubt that Guangdong is a developed and prosperous province, but the immigrants accounts for a large part of the population especially in the seven prefectures (Guangzhou, Shenzhen, Zhuhai, Foshan, Huizhou, Dongguan and Zhongshan). For example, approximately three fourths population in Shenzhen are immigrants and the reported TB incidence in Shenzhen are 381.3 per 100 000 from 2006 to 2010. Immigrants maybe one of the most important factors of the high TB incidence in Gungdong Province [35]. The secondary clusters were scattered in almost every province in every year, but both the number of clusters and the reported TB incidences showed a downward trend in recent years owing to the effective prevention and control measures. In short, the five prefectures in Xinjiang and seven prefectures in Guangdong remain the key areas for TB prevention and control in the future. Other clustering areas also should take appropriate measures corresponding to the specific reasons to control TB, and achieve the “End TB Strategy” in China earlier.

The temporal scanning results displayed the high risk month of TB in every year from 2005 to 2015 and the entire 11 years. The epidemic season of TB in the mainland of China for each year were basically the same, which mainly concentrated in the spring and the early summer between February to June. This result was consistent with other previous studies in the United States [28], northern India [36], Wuhan City [26] and Yunnan Province [10] in China. In winter, the reduction on exposure to ultraviolet from sunlight and the deficiency of vitamin D, the increase of serious pollution days, more time in indoor activities with poor ventilation [37, 38], all of those could raise the risk of TB infection. Additionally, the Chinese traditional festival called Spring Festival is another special reason that causes the delaying health-seeking. It usually takes some weeks even several month from TB infection to symptom onset and diagnosis [39]. That’s why the temporal clustering time were in spring and early summer. Of course we should keep a close eye on the peak period of TB so as to reduce the TB incidence in future. What’s more, the long term trend of reported TB in 2005–2015 showed a declining trendency on the whole, which indicated the achievements of TB have been made in recent years.

Spatio-temporal cluster analysis identified five prefectures in the southwestern of Xinjiang Uygur Autonomous Region from September 2012 to October 2015 were the most likely cluster. As a matter of fact, every year’s TB incidences in the southwestern of Xinjiang were very high. The clustering time were concentrated in September 2012 to October 2015, which indicated that prevention and control measures in this region still need to be strengthened not only in the distant past, but also in recent years. Other secondary clusters distributed in the western China, northeastern China and scattered in the center China mainly from March 2005 to June 2009, which meant these clusters were once the high TB incidence regions but took a favourable turn in recent years. The spatio-temporal cluster analysis about SS+ reported TB and SS- reported TB also found that prefectures in the southwestern of Xinjiang were the most likely clusters which was the same as the results of all the TB cases analysis, but the clustering time of SS+ reported TB was from 2005 to 2010 while the clustering time of SS- reported TB was mainly from 2010 to 2015. Although prevention and control of TB, such as directly observed treatment strategy (DOTS) have made great progress, we still need to attach much importance to both SS+ TB and SS- TB. As for the strong infectious SS+ TB, the clustering results could guide us to prevent and discover the outbreak and epidemic of TB. The TB incubation period of children is short, so if known the aggregation characteristics of SS+ TB, we located the clustering time and space, and make early corresponding measures. As for the SS- TB, though it doesn’t have the strong infectivity, it has great potential hazards. Moreover, SS- TB is difficult to diagnose because no bacteria can be seen in the smear and on L-J medium. So one of the most challenging questions is how to improve the diagnostic sensitivity and accuracy of SS- TB. There are many new techniques have arisen in recent years due to the development of bio-sensing techniques and the device platform for TB detection. Advancements in transduction and nanotechnology in biosensors such as optical detection techniques and electrochemical detection techniques could improve the sensitivity and specificity in detecting biomarkers in complex sample matrices (urine, serum, saliva, blood) using little amount of sample, and show remarkable features in TB diagnosis compared to the conventional techniques [40]. The development of the device platform including non-molecular techniques and molecular techniques such as interferon-gamma (IFN-γ) release assays (IGRA), loop-mediated isothermal amplification (LAMP), GeneXpert analyser are emerged to ensure the fulfillment of unmet requirements in TB diagnostics such as rapidity, affordability, simplicity, precision and high sensitivity [41]. With the development of new technology, more methods will be used to diagnose SS- TB.

This study was subject to some limitations. First, we conducted spatio-temporal scan statistic to detect clusters in different space and periods of time but this method only relies on circular spatial scanning and cylinder space-time scanning, and doesn’t allow for irregular space. Second, other ecological or individual level factors affected TB incidence such as low economic level [42], poor living environment [43], backward educational and medical conditions [34], exposure to ambient air pollution [38], age, sex, smoking and so on have not been taken into account in the present study. Third, the TB risk may be underestimated because of the missing cases’ report in some areas and the prefecture is not the smallest unit of administrative regionalization.


This study identified significant spatial, temporal and spatio-temporal clusters of reported TB cases at the prefecture level in the mainland of China from 2005 to 2015. The southwest of Xinjiang Uygur Autonomous Region and the southeast of Guangdong Province were the most likely clustering areas while the spring and early summer were the most likely clustering time. The spatio-temporal clustering results showed that the clustering time of SS+ TB was concentrated before 2010 while SS- TB was mainly concentrated after 2010. In a word, our result is helpful in prioritizing resource assignment in high-risk periods and high-risk areas, and to formulate powerful strategy to prevention and control TB.



China Information System for Disease Control and Prevention



SS + : 

Sputum smear-positive


Sputum smear-negative


World Health Organization


Relative risk


Log-likelihood ratios


Geographic Information System


Directly observed treatment strategy


Human immunodeficiency virus


Interferon-gamma (IFN-γ) release assays


Loop-mediated isothermal amplification



This study was supported by the China Center for Disease Control and Prevention. The authors would like to thank colleagues of the TB program for their assistance in data collection. We thank the editors and anonymous reviewers for their helpful remarks.


The research was supported by the National S&T Major Project (Grant No. 2014ZX10004005–001).

Availability of data and materials

We do not wish to share our data. The data only can be acquired under the consent of the China Center for Disease Control and Prevention.

Authors’ contributions

ML, WL, and XG have designed the research and tools, oversaw the study. QL, YL, YM and WF analyzed the data. ML and CH have written the manuscript. WL, EA, XL, and WW assisted with the editing of the paper and provided critical comments. YZ provided the source data of the study and provided essential comments. All authors read and approved the final manuscript, and accepted the accountability for all aspects of the work.

Ethics approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Authors’ Affiliations

Department of Epidemiology and Health Statistics, School of Public Health, Capital Medical University, Beijing, 100069, China
Beijing Municipal Key Laboratory of Clinical Epidemiology, Capital Medical University, Beijing, 100069, China
Chinese Center for Disease Control and Prevention, Beijing, 102206, China
Department of Mathematics and Statistics, La Trobe University, Melbourne, 3086, Australia
School of Medical Sciences and Health, Edith Cowan University, WA6027, Perth, Australia
National Tuberculosis Clinical Laboratory of China, Beijing Chest Hospital, Capital Medical University, Beijing, 101149, China
Beijing Tuberculosis and Thoracic Tumour Research Institute, Beijing, 101149, China


  1. World health organization. Title of subordinate document. In: Global tuberculosis report 2017. World health organization; 2017. Accessed 1 Oct 2017.Google Scholar
  2. Wang L, Zhang H, Ruan Y, Chin DP, Xia Y, Cheng S, et al. Tuberculosis prevalence in China, 1990-2010; a longitudinal analysis of national survey data. Lancet. 2014;383(9934):2057–64.View ArticleGoogle Scholar
  3. Ge E, Zhang X, Wang X, Wei X. Spatial and temporal analysis of tuberculosis in Zhejiang Province, China, 2009-2012. Infect Dis Poverty. 2016;5:11.View ArticleGoogle Scholar
  4. Du HW, Wang Y, Zhuang DF, Jiang XS. Temporal and spatial distribution characteristics in the natural plague foci of Chinese Mongolian gerbils based on spatial autocorrelation. Infect Dis Poverty. 2017;6(1):124.View ArticleGoogle Scholar
  5. Rao HX, Zhang X, Zhao L, Yu J, Ren W, Zhang XL, et al. Spatial transmission and meteorological determinants of tuberculosis incidence in Qinghai Province, China: a spatial clustering panel analysis. Infect Dis Poverty. 2016;5(1):45.View ArticleGoogle Scholar
  6. Mahara G, Wang C, Huo D, Xu Q, Huang F, Tao L, et al. Spatiotemporal pattern analysis of scarlet fever incidence in Beijing, China, 2005-2014. Int J Environ Res Public Health. 2016;13:1.Google Scholar
  7. Xia J, Cai S, Zhang H, Lin W, Fan Y, Qiu J, et al. Spatial, temporal, and spatiotemporal analysis of malaria in Hubei Province, China from 2004-2011. Malar J. 2015;14:145.View ArticleGoogle Scholar
  8. Abbas T, Younus M, Muhammad SA. Spatial cluster analysis of human cases of Crimean Congo hemorrhagic fever reported in Pakistan. Infect Dis Poverty. 2015;4:9.View ArticleGoogle Scholar
  9. Li L, Xi Y, Ren F. Spatio-temporal distribution characteristics and trajectory similarity analysis of tuberculosis in Beijing, China. Int J Environ Res Public Health. 2016;13(3).View ArticleGoogle Scholar
  10. Huang L, Li XX, Abe EM, Xu L, Ruan Y, Cao CL, et al. Spatial-temporal analysis of pulmonary tuberculosis in the northeast of the Yunnan province, People's Republic of China. Infect Dis Poverty. 2017;6(1):53.View ArticleGoogle Scholar
  11. Rao H, Shi X, Zhang X. Using the Kulldorff's scan statistical analysis to detect spatio-temporal clusters of tuberculosis in Qinghai Province, China, 2009-2016. BMC Infect Dis. 2017;17(1):578.View ArticleGoogle Scholar
  12. Cao K, Yang K, Wang C, Guo J, Tao L, Liu Q, et al. Spatial-temporal epidemiology of tuberculosis in mainland China: an analysis based on Bayesian theory. Int J Environ Res Public Health. 2016;13(5).View ArticleGoogle Scholar
  13. Malhotra S, Zodpey SP, Chandra S, Vashist RP, Satyanaryana S, Zachariah R, et al. Should sputum smear examination be carried out at the end of the intensive phase and end of treatment in sputum smear negative pulmonary TB patients? PLoS One. 2012;7(11):e49238.View ArticleGoogle Scholar
  14. Ma MJ, Yang Y, Wang HB, Zhu YF, Fang LQ, An XP, et al. Transmissibility of tuberculosis among school contacts: an outbreak investigation in a boarding middle school, China. Infect Genet Evol. 2015;32:148–55.View ArticleGoogle Scholar
  15. Fang Y, Zhang L, Tu C, Ye D, Fontaine R, Ma H, et al. Outbreak of pulmonary tuberculosis in a Chinese high school, 2009-2010. J Epidemiol. 2013;23(4):307–12.View ArticleGoogle Scholar
  16. Chen W, Xia Y, Li X, Zhou L, Li C, Wan K, et al. A tuberculosis outbreak among senior high school students in China in 2011. J Int Med Res. 2012;40(5):1830–9.View ArticleGoogle Scholar
  17. Huo XN, Zhang WW, Sun DF, Li H, Zhou LD, Li BG. Spatial pattern analysis of heavy metals in Beijing agricultural soils based on spatial autocorrelation statistics. Int J Environ Res Public Health. 2011;8(6):2074–89.View ArticleGoogle Scholar
  18. Jones SG, Kulldorff M. Influence of spatial resolution on space-time disease cluster detection. PLoS One. 2012;7(10):e48036.View ArticleGoogle Scholar
  19. Zhao F, Cheng S, He G, Huang F, Zhang H, Xu B, et al. Space-time clustering characteristics of tuberculosis in China, 2005-2011. PLoS One. 2013;8(12):e83605.View ArticleGoogle Scholar
  20. Alemu K, Worku A, Berhane Y. Malaria infection has spatial, temporal, and spatiotemporal heterogeneity in unstable malaria transmission areas in Northwest Ethiopia. PLoS One. 2013;8(11):e79966.View ArticleGoogle Scholar
  21. Kulldorff M. Title of subordinate document. In: SaTScan™ user guide for version 9.3. National Cancer Institute; 2014. Accessed 8 Oct 2014.Google Scholar
  22. Ma Y, Yin F, Zhang T, Zhou XA, Li X. Selection of the maximum spatial cluster size of the spatial scan statistic by using the maximum clustering set-proportion statistic. PLoS One. 2016;11(1):e0147918.View ArticleGoogle Scholar
  23. Wang LY, Zhang WY, Ding F, Hu WB, Soares Magalhaes RJ, Sun HL, et al. Spatiotemporal patterns of Japanese encephalitis in China, 2002-2010. PLoS Negl Trop Dis. 2013;7(6):e2285.View ArticleGoogle Scholar
  24. Tango T, Takahashi K. A flexible spatial scan statistic with a restricted likelihood ratio for detecting disease clusters. Stat Med. 2012;31(30):4207–18.View ArticleGoogle Scholar
  25. Tango T, Takahashi K. A flexibly shaped spatial scan statistic for detecting clusters. Int J Health Geogr. 2005;4:11.View ArticleGoogle Scholar
  26. Yang X, Duan Q, Wang J, Zhang Z, Jiang G. Seasonal variation of newly notified pulmonary tuberculosis cases from 2004 to 2013 in Wuhan, China. PLoS One. 2014;9(10):e108369.View ArticleGoogle Scholar
  27. Xu Z, Hu W, Zhang Y, Wang X, Zhou M, Su H, et al. Exploration of diarrhoea seasonality and its drivers in China. Sci Rep. 2015;5:8241.View ArticleGoogle Scholar
  28. Willis MD, Winston CA, Heilig CM, Cain KP, Walter ND, Mac Kenzie WR. Seasonality of tuberculosis in the United States, 1993-2008. Clin Infect Dis. 2012;54(11):1553–60.View ArticleGoogle Scholar
  29. Xu Z, Hu W, Su H, Turner LR, Ye X, Wang J, et al. Extreme temperatures and paediatric emergency department admissions. J Epidemiol Community Health. 2014;68(4):304–11.View ArticleGoogle Scholar
  30. Roth D, Otterstatter M, Wong J, Cook V, Johnston J, Mak S. Identification of spatial and cohort clustering of tuberculosis using surveillance data from British Columbia, Canada, 1990-2013. Soc Sci Med. 2016;168:214–22.View ArticleGoogle Scholar
  31. Yeboah-Manu D, Asare P, Asante-Poku A, Otchere ID, Osei-Wusu S, Danso E, et al. Spatio-temporal distribution of Mycobacterium tuberculosis complex strains in Ghana. PLoS One. 2016;11(8):e0161892.View ArticleGoogle Scholar
  32. Seraphin MN, Lauzardo M, Doggett RT, Zabala J, Morris JG Jr, Blackburn JK. Spatiotemporal clustering of Mycobacterium tuberculosis complex genotypes in Florida: genetic diversity segregated by country of birth. PLoS One. 2016;11(4):e0153575.View ArticleGoogle Scholar
  33. Wei W, Yuan-Yuan J, Ci Y, Ahan A, Ming-Qin C. Local spatial variations analysis of smear-positive tuberculosis in Xinjiang using geographically weighted regression model. BMC Public Health. 2016;16(1):1058.View ArticleGoogle Scholar
  34. Du J, Pang Y, Liu Y, Mi F, Xu S, Li L. Survey of tuberculosis hospitals in China: current status and challenges. PLoS One. 2014;9(11):e111945.View ArticleGoogle Scholar
  35. Li X, Li T, Tan S. Males, ages ≥45 years, businessperson, floating population, and rural residents may be considered high-risk groups for tuberculosis infection in Guangzhou, China: a review of 136,394 tb confirmed cases. Rev Inst Med Trop Sao Paulo. 2013;55(5):366–8.View ArticleGoogle Scholar
  36. Thorpe LE, Frieden TR, Laserson KF, Wells C, Khatri GR. Seasonality of tuberculosis in India: is it real and what does it tell us? Lancet. 2004;364(9445):1613–4.View ArticleGoogle Scholar
  37. Wingfield T, Schumacher SG, Sandhu G, Tovar MA, Zevallos K, Baldwin MR, et al. The seasonality of tuberculosis, sunlight, vitamin D, and household crowding. J Infect Dis. 2014;210(5):774–83.View ArticleGoogle Scholar
  38. You S, Tong YW, Neoh KG, Dai Y, Wang CH. On the association between outdoor PM2.5 concentration and the seasonality of tuberculosis for Beijing and Hong Kong. Environ Pollut. 2016;218:1170–9.View ArticleGoogle Scholar
  39. Naranbat N, Nymadawa P, Schopfer K, Rieder HL. Seasonality of tuberculosis in an eastern-Asian country with an extreme continental climate. Eur Respir J. 2009;34(4):921–5.View ArticleGoogle Scholar
  40. Zhou L, He X, He D, Wang K, Qin D. Biosensing technologies for Mycobacterium tuberculosis detection: status and new developments. Clin Dev Immunol. 2011;2011:193963.View ArticleGoogle Scholar
  41. Dheda K, Ruhwald M, Theron G, Peter J, Yam WC. Point-of-care diagnosis of tuberculosis: past, present and future. Respirology. 2013;18(2):217–32.View ArticleGoogle Scholar
  42. Wubuli A, Xue F, Jiang D, Yao X, Upur H, Wushouer Q. Socio-demographic predictors and distribution of pulmonary tuberculosis (TB) in Xinjiang, China: a spatial analysis. PLoS One. 2015;10(12):e0144010.View ArticleGoogle Scholar
  43. Lai PC, Low CT, Tse WS, Tsui CK, Lee H, Hui PK. Risk of tuberculosis in high-rise and high density dwellings: an exploratory spatial analysis. Environ Pollut. 2013;183:40–5.View ArticleGoogle Scholar


© The Author(s). 2018