Estimating the basic reproduction number for single-strain dengue fever epidemics
© Khan et al.; licensee BioMed Central Ltd. 2014
Received: 26 September 2013
Accepted: 6 March 2014
Published: 7 April 2014
Dengue, an infectious tropical disease, has recently emerged as one of the most important mosquito-borne viral diseases in the world. We perform a retrospective analysis of the 2011 dengue fever epidemic in Pakistan in order to assess the transmissibility of the disease. We obtain estimates of the basic reproduction number R0 from epidemic data using different methodologies applied to different epidemic models in order to evaluate the robustness of our estimate.
We first estimate model parameters by fitting a deterministic ODE vector-host model for the transmission dynamics of single-strain dengue to the epidemic data, using both a basic ordinary least squares (OLS) as well as a generalized least squares (GLS) scheme. Moreover, we perform the same analysis for a direct-transmission ODE model, thereby allowing us to compare our results across different models. In addition, we formulate a direct-transmission stochastic model for the transmission dynamics of dengue and obtain parameter estimates for the stochastic model using Markov chain Monte Carlo (MCMC) methods. In each of the cases we have considered, the estimate for the basic reproduction number R0 is initially greater than unity leading to an epidemic outbreak. However, control measures implemented several weeks after the initial outbreak successfully reduce R0 to less than unity, thus resulting in disease elimination. Furthermore, it is observed that there is strong agreement in our estimates for the pre-control value of R0, both across different methodologies as well across different models. However, there are also significant differences between our estimates for the post-control value of the basic reproduction number across the two different models.
In conclusion, we have obtained robust estimates for the value of the basic reproduction number R0 associated with the 2011 dengue fever epidemic before the implementation of public health control measures. Furthermore, we have shown that there is close agreement between our estimates for the post-control value of R0 across the different methodologies. Nevertheless, there are also significant differences between the estimates for the post-control value of R0 across the two different models.
KeywordsEpidemiology Dengue fever Statistical inference Stochastic model Markov chain Monte Carlo
Please see Additional file 1 for translations of the abstract into the six official working languages of the United Nations.
Global incidence of dengue has seen a striking increase over the past few decades [1, 2]. The infectious disease is now endemic in more than a hundred tropical and subtropical countries worldwide [1–3]. With an estimated 50–100 million cases and nearly 10,000- 20,000 deaths annually, dengue ranks second to Malaria amongst deadly mosquito-borne diseases [1, 2, 4–6]. The disease is caused by one of four virus serotypes (strains) of the genus Flavivirus[2, 3, 7]. Most infected individuals suffer from dengue fever, a severe flu-like illness characterized by high fever, which is not usually a threat to mortality [2, 8]. The symptoms of the disease last one to two weeks, after an initial incubation period of about 4–7 days . Some infected individuals however, develop dengue hemorrhagic fever (DHF) resulting in bleeding, low levels of blood platelets and blood plasma leakage, or dengue shock syndrome (DSS) resulting in extremely low blood pressures. The risk associated with DHF and DSS is considerably higher, with mortality ranging from 5–15% [3, 5, 9, 10].
There is evidence of dengue epidemics occurring in North America, Asia and Africa in the late 18 th century . Up until the middle of the 20 th century however, incidences of dengue fever have been rare . Nonetheless, since the 1970’s, there has been a marked increase in the number of dengue cases, as well as the frequency and severity dengue epidemics, with the WHO claiming a 30-fold increase in the incidence of dengue between 1960 and 2010 [2, 3, 8]. Factors such as population growth, rapid urbanization and increase international travel are often cited as having contributed to this dramatic increase . Dengue is currently endemic in nearly 110 countries in Southeast Asia, the Americas, Africa and the Eastern Mediterranean . The WHO estimates that nearly 2.5 billion people are at risk of contracting the disease. Furthermore, nearly 50–100 million cases and almost 20,000 deaths due to more severe forms of dengue fever are reported globally every year, making dengue one of the deadliest mosquito-transmitted diseases [1, 2, 4–6].
Dengue is transmitted to humans through mosquito bites. Female mosquitos of the Aedas genus, primarily Aedes aegypti, acquire the dengue virus through a blood meal from infected humans [2, 11]. The dengue virus has an incubation period of about 7–10 days in the vector, and is then spread to susceptible humans who are bitten by the infected mosquito . The virus also has an incubation period of 4–7 days in the host . While vectors never recover from infection with the dengue virus, the infection in hosts lasts only about one to two weeks . Hosts that recover from infection with one serotype of the dengue virus gain life-long immunity from that serotype but only temporary and partial immunity to other serotypes [2, 4, 12–14]. This partial cross-immunity is the cause of antibody-dependent enhancement (ADE) in the setting of a secondary infection with a different serotype of DENV (Dengue Virus). ADE is hypothesized to be one factors leading to DHF and DSS, the more severe form of dengue disease [4, 12, 13, 15]. In this study however, we will consider infection involving only a single serotype of the dengue virus.
About 80% of individuals suffering from a primary infection with DENV are asymptomatic or display only a mild, uncomplicated fever [2, 8]. A much smaller proportion of infected individuals suffer from dengue hemorrhagic fever and dengue shock syndrome [3, 5, 9]. As mentioned previously however, risk of DHF and DSS is associated primarily with secondary infection with a heterologous serotype of DENV [4, 12, 13, 15]. In general, the course of infection of dengue can be divided into three separate phases: febrile, critical and recovery. The febrile phase, which is rarely life threatening, is marked by the sudden onset of high fever, rash, headaches and muscle and joint pains, which lead to the alternative name "breakbone fever" for dengue disease . While most individuals then progress to the recovery phase, a small fraction of infected individuals instead progress to the critical phase of the disease. This phase lasts for one or two days and is marked by low blood pressure, leakage of blood plasma from the capillaries and decreased blood supply to organs. Severe cases of these symptoms are associated with DHF and DSS and the mortality in this phase of the disease is estimated to be as high as 5–15% [3, 5, 8, 9].
Over the past several years, a number of deterministic mathematical models have been proposed to analyze the transmission dynamics of dengue in urban communities [5, 11–17]. L. Esteva and C. Vargas  have investigated the coexistence of two serotypes of dengue virus using a deterministic ODE model. Moreover, Ferguson et al.  have investigated the effects of ADE on the transmission of multiple serotypes of dengue virus. In addition, Garba et al.  have shown the existence of a backward bifurcation in a standard incidence ODE model for a single strain of dengue virus. Garba et al.  have also explored the effects of cross-immunity on the transmission dynamics of two strains of dengue virus. Similarly, H. Wearing and P. Rohani  have investigated the effects of both ADE and cross immunity on multiple serotypes of dengue virus. Finally, Chowell et al.  have estimated the basic reproduction number for dengue using spatial epidemic data.
In addition, over the past few decades, several stochastic epidemic models for the spread of infectious diseases have also been proposed and analyzed [19–27]. An important qualitative difference between deterministic and stochastic epidemic models in general is the asymptotic dynamics . Furthermore, stochastic models also allow for the possibility of disease extinction in finite time and therefore the expected time to disease extinction can be calculated [19, 28, 29]. It is also observed that stochastic models better capture the uncertainty and variability that is inherent in real-life epidemics due to factors such as the unpredictability of person-to-person contact [27, 29]. L. J. S. Allen [28, 29] has explored the utility of stochastic epidemic models by comparing them with deterministic models. Despite, the utility of stochastic models, however, very little stochastic modeling has been performed for the transmission dynamics of dengue virus (see  and the references therein).
The purpose of this study is to estimate the transmissibility of the dengue virus during the 2011 dengue fever epidemic in Pakistan using epidemic data in the form of the cumulative number of reported cases of dengue. We will employ three different techniques, applied to two different models and compare the results across both the different statistical inference methodologies as well as the different models. The first approach, based on the earlier work of Cintron-Arias et al. , will involve fitting a deterministic epidemic model for the transmission of dengue to the epidemic data using an ordinary least squares (OLS) scheme implemented using the built-in optimization toolbox in MATLAB and applied in the context of an appropriate statistical model. The second method, also based on the recent work of Cintron-Arias et al. , will use a generalized least squares (GLS) scheme to fit the same deterministic model to the epidemic data. Furthermore, both approaches will also be applied to a different direct-transmission model for the transmission dynamics of dengue. Finally, the third approach will involve the formulation of a direct transmission stochastic epidemic model for dengue. We shall then use Markov chain Monte Carlo techniques to obtain a probability distribution for the model parameters.
A simple but effective measure of the transmissibility of an infectious disease is given by the basic reproduction number R0, defined as the total number of secondary infections produced by introducing a single infective in a completely susceptible population . For vector-borne diseases such as malaria and dengue, R0 is the number of secondary cases produced by a single infectious vector introduced in a completely susceptible host and vector population. In general, for simple epidemic models, if R0 is greater than unity, an epidemic will occur while if R0 is less than unity, an outbreak will most likely not occur. Thus, the value of R0 can be used to determine the intensity of control measures that need to be implemented in order to contain the epidemic.
The estimation of the basic reproductive number is generally an indirect process because the model parameters that R0 depends on are difficult or impossible to determine directly. The general methodology used therefore, attempts to fit an epidemic model to available epidemic data in order to estimate the model parameters. These parameters are then used to estimate the basic reproduction number R0. The current study is based on this methodology.
We have performed a retrospective analysis of the 2011 dengue fever epidemic in Pakistan and obtained estimates of the basic reproduction number R0, from epidemic data using three different methods. R0, defined as the total number of secondary infections produced by introducing a single infective in a completely susceptible population, is a simple but effective measure of the transmissibility of an infectious disease. In each case it was observed that the value of R0 was initially well in excess of unity, leading to the observed epidemic outbreak. Some weeks after the initial outbreak however, control measures were successfully implemented that reduced the value of R0 to less than unity, thus resulting in disease elimination.
Several methods have been proposed for the estimation of R0, both for deterministic as well as for stochastic models. These methods depend upon the mathematical model of the disease as well the nature of the data. In the case of deterministic compartmental models, least squares fit to the data has been widely used to estimate the model parameters [18, 30, 32]. For stochastic models likelihood based techniques have been used by several authors [33, 34]. We consider two ODE based deterministic models and a Continuous Time Markov Chain (CTMC) based stochastic model. Using least squares estimation for the deterministic models and a Markov Chain Monte Carlo based approach for the stochastic model, we compare the value of R0 obtained using the different models. We note that this is the first such study performed for the Dengue Epidemic in Pakistan.
The first inference methodology involved fitting a deterministic ODE model for the transmission dynamics of single-strain dengue to the epidemic data using a basic ordinary least squares (OLS) scheme in the context of a statistical model which assumed longitudinally constant variance for the epidemic data. An a priori more realistic methodology was used to fit the deterministic ODE model to obtain estimates of the model parameters using a generalized least squares (GLS) scheme which made use of a statistical model that assumed that variances associated with the observation process were directly proportional to the measurement values. One of the questions we tried to address was whether or not the variances were strongly dependent on the observations. Finally, we formulated a discrete-time, direct transmission, stochastic model for the spread of dengue virus and used Markov chain Monte Carlo (MCMC) methods to perform Bayesian inference and estimate the basic reproduction number.
We observe that the estimates for the basic reproduction number R0 before the implementation of control measures are in excellent agreement for the same model across different methodologies. Similarly, across different models there is a very slight but nonetheless statistically insignificant difference in our estimates of the pre-control basic reproduction number. We therefore conclude that our estimation of the pre-control value of R0 is quite robust, both across different methodologies as well as across different models. This leads us to believe that the noise does not depend significantly on the data. Furthermore, agreement in our estimates across models indicates that both the vector-host model as well as the direct-transmission model can be used to accurately capture the disease dynamics of actual dengue epidemics before the implementation of control measures.
While there is also close agreement in our estimates for the basic reproduction number R0 after the implementation of control measures across different methodologies, there is nonetheless significant difference between the post-control estimates of R0 across the vector-host and direct-transmission models. Specifically, R0 is estimated to be significantly larger in value when using the direct-transmission model as opposed to the vector-host model. We conjecture that this might be due to the fact that the direct-transmission model makes use of an approximation, which involves solving for the equilibrium value of the vector force of infection. Thus, while the vector force of infection rises and peaks for the vector-host model, before settling to its equilibrium value, it is in effect equal to the smaller equilibrium value for the direct-transmission model. Therefore, since the vector force of infection for the direct transmission model is, in effect, smaller for the time period after the implementation of control measures, it results in a larger estimate of the basic reproduction number in order to produce a ‘best-fit’ for the observed epidemic data.
In conclusion, we have attempted to assess the transmissibility of the dengue virus during the 2011 dengue fever epidemic in Pakistan by estimating the basic reproduction number R0 both before and after the implementation of public health control measures. Our estimates for the pre-control value of R0 are in close agreement both across different methodologies and the different models. Furthermore, the post-control estimates are also in close agreement across the different methodologies. There is however, a significant increase in the estimates of the post-control value of R0 obtained using the direct-transmission model compared to estimates obtained using the vector-host model.
Methods and materials for statistical inference using the vector-host model
The vector-host epidemic Model
The model is a deterministic vector-host ODE model that assumes a homogeneous mixing of the host (human) and vector (mosquito) populations. The total human population at time t, denoted by N(t), is divided into four mutually exclusive classes comprising of susceptible humans S H (t), exposed humans E H (t), infected humans I H (t) and recovered humans R H (t). It is assumed that individuals who recover from infection with a particular serotype of Dengue gain lifelong immunity to it . Similarly, the total vector population at time t is denoted by N V (t) and is divided into three mutually exclusive classes comprising susceptible of susceptible vectors S V (t), exposed vectors E V (t) and infected vectors I V (t). It is assumed that vectors (mosquitoes) infected with a particular serotype of Dengue never recover . We also modify the original model of Garba et al.  by assuming that exposed humans and exposed vectors do not transmit the disease.
The model assumes that the susceptible human population S H (t) has a constant recruitment rate Π H and natural death rate μ. Susceptible individuals are infected with Dengue virus (due to contact with infected vectors) at a rate λ H and thus enter the exposed class E H . The exposed population E H (t) is depleted at the natural death rate μ. Additionally, exposed individuals develop symptoms and move into the infected class I H at a rate σ H . The infected population I H (t) is depleted via the natural death rate μ, the disease-induced death rate δ H and the recovery rate of infected individuals τ H . Finally, the recovered population R H (t) decreases due to the natural death rate μ.
Similarly, the susceptible vector population S V (t) has a constant recruitment rate Π V and a natural death rate μ V . Susceptible vectors are infected with Dengue virus (due to effective contact with infected humans) at a rate λ V and thus move to the exposed vector class E V . The exposed vector class E V (t) is depleted at the natural death rate μ V . In addition exposed vectors develop symptoms and move to the infected vector class I V at a rate σ V . Infected vectors, in addition to the natural death rate μ die at a disease induced death rate δ V .
Description of the variables of the vector-host model ( 1.1)
N H (t)
Total host population
S H (t)
Population of susceptible hosts
E H (t)
Population of exposed hosts
I H (t)
Population of infected hosts
R H (t)
Population of recovered hosts
N V (t)
Total vector population
S V (t)
Population of susceptible vectors
E V (t)
Population of exposed vectors
I V (t)
Population of infected vectors
Prior to August 2011, there were three reported cases of dengue fever, all occurring several months before the actual epidemic. This leads us to conclude that these were isolated incidents and were not directly related to the epidemic itself. Furthermore, nearly 87% of all dengue infections were caused by a single serotype (DEN2) of the dengue virus. This justifies our use of a single-strain epidemic model as opposed to dengue models that incorporate the effects of cross-immunity and ADE.
In order to calculate R0, we require the values of several parameters used in model (1.1). Furthermore, we require knowledge of the initial conditions that will be used to simulate trajectories of the model (1.1).
Description of the parameters of the vector-host model ( 1.1)
Host recruitment rate
140 week -1
Vector recruitment rate
28000 week -1
Host disease-induceddeath rate
0.0035 week -1
Vector disease-induceddeath rate
negligible week -1
Latency period forexposed hosts
1 week 
Recovery time for infectedhosts
Latency period forexposed vectors
C H V
Effective contact rate
The recruitment rate for the susceptible host population depends on the demographics of the urban population that is being considered. This study uses epidemic data collected during the 2011 Dengue Epidemic in Punjab, Pakistan. Therefore, in the absence of concrete estimates, the host recruitment rate has been chosen so as to allow for a realistic steady state host population.
The effective contact rate C HV , which is a measure of the rate at which contact between an infective and a susceptible individual occurs and the probability that such contact will lead to an infection, is extremely difficult to determine directly. Most previous studies have used assumed values for the effective contact rate [11–13]. Thus, it is not possible to directly estimate the basic reproduction number. Consequently, we will adopt an indirect approach, similar to previous studies such as  and , by first finding the value of the parameter C HV for which the model (1.1) has the best agreement with the epidemic data, and then using the resultant parameter values to estimate R0.
As mentioned before, for the purpose of simulating model (1.1) we require knowledge of the initial conditions. It is possible to consider the initial conditions (S H (0),E H (0),I H (0),R H (0),S V (0),E V (0),I V (0)) as model parameters along with the effective contact rate C H V and estimate values for all parameters. Such a technique, however, produces slightly unreliable results. This is explained by the fact that the available epidemic data is restricted to the cumulative number of dengue cases reported, while the optimization schemes that we will employ produces estimates for eight variables. There are thus too many degrees of freedom and the ‘best-fit’ may result in unrealistic estimates for the initial conditions. We will therefore use reasonable estimates for the initial conditions and restrict ourselves to optimizing only the effective contact rate C HV .
Initial conditions used when applying the OLS and GLS methodology to the vector-host model ( 1.1)
S H (0)
E H (0)
I H (0)
R H (0)
S V (0)
E V (0)
I V (0)
In the following sections we will employ different methods to estimate the parameter ω = C HV by minimizing the difference between the predictions of model (1.1) and the epidemic data.
Ordinary Least Squares Estimation
We will first attempt to estimate the effective contact rate by fitting the best trajectory of model (1.1) to the epidemic data using an ordinary least squares (OLS) scheme, implemented using the fminsearch function in the built-in Optimization Toolbox in MATLAB. This will allow us to estimate the parameter ω and thus calculate the basic reproduction number R0.
where t∗ is the time at which control measures were first implemented. For the purpose of this study and in view of no concrete information being available in this regard, we have assumed that t∗ = 8 weeks.
For the purpose of this section we shall employ the notation and methodology developed in . Essentially, we will employ inverse problem methodology to obtain estimates of ω = C HV by minimizing the difference between the observed weekly cumulative number of recovered host individuals and the model predictions using a ordinary least squares (OLS) criterion. This will be done in the context of an appropriate statistical model.
where t0 denotes the time of start of the statistical process and the time of each observation is given by and ordered as t0 < t1 < t2 … .. < t N .
It is clear that the statistical estimator ω OLS is a random variable since each error term ε j is a random variable. Furthermore, the estimator attempts to minimize the distance between the observed weekly cumulative number of recovered host individuals and the predictions of model (1.1). A subsequent detailed description of how to obtain the probability distribution associated with ω OLS is given in . Our goal in the current study will be to obtain the mean of the probability distribution of ω OLS .
Uncertainty and sensitivity analysis of R 0
As mentioned previously, the basic reproduction number R0 for the deterministic model (1.1) is given by (1.2) Thus, the value of R0 depends on the variables C HV ,Π H ,μ,σ H ,δ H ,τ H ,Π V ,σ V ,μ V and δ V . While deterministic models implicitly assume that the model parameters are not stochastic in nature, an element of uncertainty is always associated with estimates of these parameters due to factors such as natural variation, errors in measurements and lack of measuring techniques. In general, uncertainty analysis quantifies the degree of confidence in the parameter estimates by producing 95% confidence intervals (CI) which can be interpreted as intervals containing 95% of future estimates when the same assumptions are made and the only noise source is observation error. Additionally, sensitivity analysis identifies critical model parameters and quantifies the impact of each input parameter on the value of an output. In this section, we shall perform uncertainty and sensitivity analysis of the basic reproduction number R0. A detailed description of the history and methodology of uncertainty and sensitivity analysis is given in .
Assumed probability distributions for the parameters of the model ( 1.1) used in the sensitivity and uncertainty analysis
Generalized least squares estimation
The Ordinary Least Squares Estimation (OLS) scheme we employed in the previous section assumed that the variances associated with the epidemic observations were longitudinally constant and not dependent on the values of the observations. This may not be a realistic assumption especially if the epidemic data is influenced by a source of non-constant systematic error such as under-reporting. Indeed, under-reporting of dengue cases has been well documented in previous studies such as  and .
If indeed the epidemic data that is being used in the current study is influenced by under-reporting then the assumption of constant variances associated with the observations is not correct since observation errors will now be proportional to the size of the measurement. Hence, we must use a statistical model, which assumes longitudinally non-constant and model dependent variances for the epidemic observations. In this section therefore, we will attempt to estimate the effective contact rate by fitting the best trajectory of model (1.1) to the epidemic data using a generalized least squares (GLS) scheme. An excellent discussion on the use of the OLS and GLS scheme and different statistical models depending on the assumptions regarding the error present in the observation process is given in .
Once again we shall employ the notation and methodology developed in . Apart from the assumptions of the statistical model, as before, we will assume that all reported cases recovered from the infection after a time lag of two weeks and that therefore, the epidemic data, after a lapse of two weeks, represents the total recovered host population. Furthermore, we will again assume that the effective contact rate C HV is a function of time. Thus, mathematically, Eq. 2.1 where t∗ is the time at which control measures were first implemented. As before, we have assumed that t∗ = 8 weeks.
where t0 denotes the time of start of the statistical process and the time of each observation is given by and ordered as t0 < t1 < t2 ….. < t N .
The rest of the analysis is similar to the method outlined in the previous section and follows easily.
Methods and materials for statistical inference using the direct-transmission model
The direct transmission epidemic model
Several existing studies on the transmission dynamics of dengue use a direct transmission SEIR model . The direct transmission models can be obtained using an approximation to vector-host models such as model (1.1). First, it is assumed that the vector force of infection can be approximated by solving for the equilibrium values of the vector population compartments. Furthermore, it is assumed that the susceptible vector population is approximately a linear multiple of the total host population. These two assumptions effectively result in a rescaling of the host effective contact rate C HV of model (1.1) into a direct transmission contact parameter β. Using the aforementioned approximation, we formulate a standard incidence, direct transmission SEIR model. More details of the approximation are given in .
Description of the variables of the direct-transmission model ( 3.1)
Total host population
Population of susceptible hosts
Population of exposed hosts
Population of infected hosts
Population of recovered hosts
Description of the parameters of the direct-transmission model ( 3.1)
Host disease-induced death rate
Latency period for exposed hosts
Recovery time for infected hosts
Effective contact rate for the direct transmission model
As discussed previously, the existing literature on dengue fever provides excellent estimates of all the parameters of Model (3.1) with the exception of the contact rate β. Hence, our aim will be to estimate the contact rate β using statistical inference and thereby estimate the basic reproduction number R0. The basic methodology used for both the OLS and GLS schemes will be similar to the process outlined in the previous sections.
where t∗ is the time at which control measures were first implemented. For the purpose of this study and in view of no concrete information being available in this regard, we have assumed that t∗ = 6 weeks. We observe that β(t) is most likely not a continuous function of time t. An alternative definition of the transmission rate β(t), as a continuous function of time, is given in .
The stochastic model and Markov Chain Monte Carlo (MCMC)
In this section we formulate a stochastic, direct-transmission, discrete-time, (S)usceptible, (E)xposed, (I)nfected and (R)ecovered/(R)emoved (SEIR) model for the transmission dynamics of dengue virus. We then use standard Markov chain Monte Carlo (MCMC) methods to perform Bayesian Inference on the epidemic data to obtain estimates of the basic reproduction number R0. As mentioned in , a number of studies exist on the transmission dynamics of dengue that assume direct-transmission. Moreover, a simple approximation can be used to reduce a vector-host model for dengue virus to a direct transmission model (see  and the references given therein for more details). The purpose of using a direct-transmission model is to make stochastic inference computationally tractable. For the purpose of this study, we will broadly follow the procedure outlined in .
Stochastic model formulation
Here, and are the time dependent transmission rate, the mean latency period and the mean infectious period respectively. Thus, in model (4.1) the transitions from one compartment to another are formulated as an exponentially distributed stochastic movements. The probability that each individual will stay in a specific compartment for a time period h is given by e xp(Π h), where Π is the compartment specific movement rate. The binomial distributions in (4.2) are then obtained by summing over the individual Bernoulli trials for every individual in the compartment. It is assumed that each trial is independent and identical for every member of the compartment.
Similar to the previous section, we will assume that the contact rate β(t) is a function of time t. Thus, mathematically, Eq. (3.2) where t∗ is the time at which control measures were first implemented. As before, we have assumed that t∗ = 8 weeks.
where τ∗ denotes the time at which observations of the epidemic have finished.
Thus, based on the available epidemic data, we have complete knowledge of both C and D but no knowledge of B. This lack of knowledge will be a major cause of uncertainty in our analysis. Nevertheless, we will attempt to estimate R0 for both the time period before control measures are implemented and the time period after control measures are implemented using our knowledge of both C and D.
where f1,f2 and f3 are the binomial transition probabilities given in (4.1) and (4.2), conditioned on Θ and all the epidemic data represented by B, C and D up until time t. Therefore, the maximum likelihood estimator for the parameter vector Θ, and by extension for β(t) and R0 can be obtained by maximizing the expression in (4.4).
According to model (4.1), the time series for S(t),E(t),I(t) and R(t) can be obtained using B(t),C(t) and D(t). Unfortunately as mentioned previously, B(t) is unknown since the process of infection is not observed. Hence, we must also impute the values of B(t). These values can then be used to construct the time series for S(t) and E(t).
where π(Θ) is the prior distribution. Thus, our MCMC algorithm will sample from the conditional probability distributions π(Θ|B,C,D) and π(B|Θ,C,D) to produce samples from the required distribution π(Θ,B|C,D). In short, our general algorithm will proceed as follows
Initialize the set B using any appropriate initial vector.
Since, C and D are known, construct the time series for S(t),E(t),I(t) and R(t).
Initialize the parameter vector Θ.
Update B using the conditional distribution π(B|Θ,C,D).
Reconstruct the new time series for S(t),E(t),I(t) and R(t).
Update Θ using the conditional distribution π(Θ|B,C,D).
Repeat steps 4-6 until the Markov chain has converged and subsequently, the required samples have been obtained.
To sample from π(B|Θ,C,D) one can use the conditional binomial distribution for B, making sure that the choice is consistent with the final size and length of the epidemic. This is however computationally very inefficient as most of the draws would be rejected due to the consistency condition. To avoid this issue we condition the proposal on the observed extinction time, following the method described in  for computationally efficient sampling. π(Θ|B,C,D) is updated using a random walk proposal.
Inference from the observed dengue data
An important question that arises at this point pertains to the meaning and significance of the basic reproduction number R0 for the stochastic SEIR model. As mentioned previously and as discussed in detail in , the basic reproduction number R0 for the deterministic SEIR model is essentially a threshold quantity which determines the possibility of an outbreak of the disease. Thus, for the deterministic SEIR model, if R0 is less than unity there is no epidemic while if R0 is greater than unity there will be a disease epidemic.
Unfortunately, the threshold dynamics of the stochastic SEIR model are not the same. It can be proven that in contrast to the deterministic model, the stochastic SEIR model predicts disease extinction regardless of the value of R0. This results in difficulty regarding the interpretation of R0 as a threshold quantity. Therefore, it is tempting to ask the question: what is the importance of R0 in the stochastic SEIR model? An answer to this question may be conjectured (but not proven) by referring to . It is proven in  that for the stochastic SI model, on average no epidemic will occur if R0 < 1, while for R0 > 1 there is a finite probability that an endemic quasi-equilibrium will develop. We conjecture that this result also holds true for the stochastic SEIR model and that it can therefore be used to explain the significance of R0 as a threshold quantity for the stochastic SEIR model.
Posterior mean of the contact rate and basic reproduction number for the Stochastic direct-transmission model ( 4.1)
3.0650 week -1
0.6318 week -1
R0 before control measures
R0 after control measures
The authors would like to acknowledge and thank the Punjab Disaster Management Authority for providing the data used for the research work being presented in this article.
- Ranjit S, Kissoon N:Dengue hemorrhagic fever and shock syndromes. Pediatr Crit Care Med. 2011, 12: 90-100. 10.1097/PCC.0b013e3181e911a7.View ArticlePubMedGoogle Scholar
- World Health Organization: dengue and severe dengue fact sheet. 2012, [http://www.who.int/mediacentre/factsheets/fs117/en/],
- Gubler DJ:Dengue and dengue hemorrhagic fever. Clin Microbiol Rev. 1998, 11 (3): 480-496.PubMed CentralPubMedGoogle Scholar
- Halstead S, Nimmannitya S, Cohen S:Observations related to pathogenesis of dengue hemorrhagic fever. IV. Relation of disease severity to antibody response and virus recovered. Yale J Biol Med. 1970, 42 (5): 311-322.PubMed CentralPubMedGoogle Scholar
- Kautner I, Robinson MJ, Kuhnle U:Dengue virus infection: epidemiology, pathogenesis, clinical presentation, diagnosis, and prevention. J Pediatr. 1997, 131 (4): 516-524. 10.1016/S0022-3476(97)70054-4.View ArticlePubMedGoogle Scholar
- Shekhar C:Deadly dengue: new vaccines promise to tackle this escalating global menace. Chem Biol. 2007, 14 (8): 871-872. 10.1016/j.chembiol.2007.08.004.View ArticlePubMedGoogle Scholar
- Holmes EC, Twiddy SS:The origin, emergence and evolutionary genetics of dengue virus. Infect Genet Evol. 2003, 3: 19-28. 10.1016/S1567-1348(03)00004-2.View ArticlePubMedGoogle Scholar
- Whitehorn J, Farrar J:Dengue. Br Med Bull. 2010, 95: 161-173. 10.1093/bmb/ldq019.View ArticlePubMedGoogle Scholar
- Gubler D, Kuno G: Dengue and Dengue Hemorrhagic Fever. 1997, London: CAB INTERNATIONALView ArticleGoogle Scholar
- Kawaguchi I, Sasaki A, Boots M:Why are dengue virus serotypes so distantly related? Enhancement and limiting serotype similarity between dengue virus strains. Proc R Soc Lond B Biol Sci. 2003, 270 (1530): 2241-2247. 10.1098/rspb.2003.2440.View ArticleGoogle Scholar
- Garba SM, Gumel AB:Abu Bakar MR: Backward bifurcations in dengue transmission dynamics. Math Biosci. 2008, 215: 11-25. 10.1016/j.mbs.2008.05.002.View ArticlePubMedGoogle Scholar
- Garba S, Gumel A:Effect of cross-immunity on the transmission dynamics of two strains of dengue. Int J Comput Math. 2010, 87 (10): 2361-2384. 10.1080/00207160802660608.View ArticleGoogle Scholar
- Wearing HJ, Rohani P:Ecological and immunological determinants of dengue epidemics. Proc Natl Acad Sci. 2006, 103 (31): 11802-11807. 10.1073/pnas.0602960103.PubMed CentralView ArticlePubMedGoogle Scholar
- Esteva L, Vargas C:Coexistence of different serotypes of dengue virus. J Math Biol. 2003, 46: 31-47. 10.1007/s00285-002-0168-4.View ArticlePubMedGoogle Scholar
- Ferguson N, Anderson R, Gupta S:The effect of antibody-dependent enhancement on the transmission dynamics and persistence of multiple-strain pathogens. Proc Natl Acad Sci. 1999, 96 (2): 790-794. 10.1073/pnas.96.2.790.PubMed CentralView ArticlePubMedGoogle Scholar
- Esteva L, Vargas C:A model for dengue disease with variable human population. J Math Biol. 1999, 38 (3): 220-240. 10.1007/s002850050147.View ArticlePubMedGoogle Scholar
- Esteva L, Vargas C:Analysis of a dengue disease transmission model. Math Biosci. 1998, 150 (2): 131-151. 10.1016/S0025-5564(98)10003-2.View ArticlePubMedGoogle Scholar
- Chowell G, Diaz-Dueñas P, Miller J, Alcazar-Velazco A, Hyman J, Fenimore P, Castillo-Chavez C:Estimation of the reproduction number of dengue fever from spatial epidemic data. Math Biosci. 2007, 208 (2): 571-589. 10.1016/j.mbs.2006.11.011.View ArticlePubMedGoogle Scholar
- Allen LJ:An introduction to stochastic epidemicmodels. Mathematical Epidemiology, Volume 1945 of Lecture Notes in Mathematics. Edited by: Brauer F, Driessche P, Wu J. 2008, Springer Berlin Heidelberg, 14197 Berlin Germany, 81-130.Google Scholar
- Keeling MJ, Ross JV:On methods for studying stochastic disease dynamics. J R Soc Interface. 2008, 5 (19): 171-181. 10.1098/rsif.2007.1106.PubMed CentralView ArticlePubMedGoogle Scholar
- Bailey NT:A simple stochastic epidemic. Biometrika. 1950, 37: 193-202. 10.1093/biomet/37.3-4.193.View ArticlePubMedGoogle Scholar
- Allen LJ, Flores DA, Ratnayake RK, Herbold JR:Discrete-time deterministic and stochastic models for the spread of rabies. Appl Math Comput. 2002, 132 (2): 271-292.View ArticleGoogle Scholar
- Weiss GH, Dishon M:On the asymptotic behavior of the stochastic and deterministic models of an epidemic. Math Biosci. 1971, 11 (3): 261-265.View ArticleGoogle Scholar
- Tuite AR, Tien J, Eisenberg M, Earn DJ, Ma J, Fisman DN:Cholera epidemic in Haiti, 2010: using a transmission model to explain spatial spread of disease and identify optimal control interventions. Ann Intern Med. 2011, 154 (9): 593-601. 10.7326/0003-4819-154-9-201105030-00334.View ArticlePubMedGoogle Scholar
- Allen L, Driessche P:Stochastic epidemic models with a backward bifurcation. Math Biosci Eng. 2006, 3 (3): 445-View ArticlePubMedGoogle Scholar
- de Souza DR, Tomé T, Pinho ST, Barreto FR, de Oliveira MJ:Stochastic dynamics of dengue epidemics. Phys Rev E. 2013, 87: 012709-View ArticleGoogle Scholar
- Spencer S: Stochastic epidemic models for emerging diseases. PhD thesis. 2008, University of NottinghamGoogle Scholar
- Allen LJ: An Introduction to Stochastic Processes with Applications to Biology. 2003, New Jersey: Pearson EducationGoogle Scholar
- Allen LJ, Burgin AM:Comparison of deterministic and stochastic SIS and SIR models in discrete time. Math Biosci. 2000, 163: 1-33. 10.1016/S0025-5564(99)00047-4.View ArticlePubMedGoogle Scholar
- Cintrón-Arias A, Castillo-Chávez C, Bettencourt LM, Lloyd AL, Banks H:The estimation of the effective reproductive number from disease outbreak data. Math Biosci Eng. 2009, 6 (2): 261-282.View ArticlePubMedGoogle Scholar
- Van den Driessche P, Watmough J:Reproduction numbers and sub-threshold endemic equilibria for compartmental models of disease transmission. Math Biosci. 2002, 180: 29-48. 10.1016/S0025-5564(02)00108-6.View ArticlePubMedGoogle Scholar
- Chowell G, Hengartner N, Castillo-Chavez C, Fenimore F, Hyman J:The basic reproductive number of Ebola and effects of public health measures: the cases of Congo and Uganda. J Theor Biol. 2004, 229: 119-126. 10.1016/j.jtbi.2004.03.006.View ArticlePubMedGoogle Scholar
- Lekone PE, Finkenstädt BF:Statistical inference in a stochastic epidemic SEIR model with control intervention: Ebola as a case study. Biometrics. 2006, 62 (4): 1170-1177. 10.1111/j.1541-0420.2006.00609.x.View ArticlePubMedGoogle Scholar
- O’Neill P, Roberts GO:Bayesian inference for partially observed stochastic epidemics. J R Statisitcal Soc A. 1999, 162: 121-129. 10.1111/1467-985X.00125.View ArticleGoogle Scholar
- Sanchez MA, Blower SM:Uncertainty and sensitivity analysis of the basic reproductive rate: tuberculosis as an example. Am J Epidemiol. 1997, 145 (12): 1127-1137. 10.1093/oxfordjournals.aje.a009076.View ArticlePubMedGoogle Scholar
- Suaya JA, Shepard DS, Beatty ME:Dengue: burden of disease and costs of illness. TDR. Report of the Scientific Working Group Meeting on Dengue. 2006, Geneva Switzerland: World Health Organization, 35-49.Google Scholar
- Beatty ME, Beutels P, Meltzer MI, Shepard DS, Hombach J, Hutubessy R, Dessis D, Coudeville L, Dervaux B, Wichmann O, Margolis HS, Kuritsky JN:Health economics of dengue: a systematic literature review and expert panel’s assessment. Am J Trop Med Hyg. 2011, 84 (3): 473-488. 10.4269/ajtmh.2011.10-0521.PubMed CentralView ArticlePubMedGoogle Scholar
- Banks HT, Davidian M, Jr Samuels JR, Sutton KL: An Inverse Problem Statistical Methodology Summary. 2009, 3994 AK Houten NetherlandView ArticleGoogle Scholar
- Jacquez JA, O’Neill P:Reproduction numbers and thresholds in stochastic epidemic models I. Homogeneous populations. Math Biosci. 1991, 107 (2): 161-186. 10.1016/0025-5564(91)90003-2.View ArticlePubMedGoogle Scholar
This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.