Skip to main content

Who benefits from increased service utilisation? Examining the distributional effects of payment for performance in Tanzania



Payment for performance (P4P) strategies, which provide financial incentives to health workers and/or facilities for reaching pre-defined performance targets, can improve healthcare utilisation and quality. P4P may also reduce inequalities in healthcare use and access by enhancing universal access to care, for example, through reducing the financial barriers to accessing care. However, P4P may also enhance inequalities in healthcare if providers cherry-pick the easier-to-reach patients to meet their performance targets. In this study, we examine the heterogeneity of P4P effects on service utilisation across population subgroups and its implications for inequalities in Tanzania.


We used household data from an evaluation of a P4P programme in Tanzania. We surveyed about 3000 households with women who delivered in the last 12 months prior to the interview from seven intervention and four comparison districts in January 2012 and a similar number of households in 13 months later. The household data were used to generate the population subgroups and to measure the incentivised service utilisation outcomes. We focused on two outcomes that improved significantly under the P4P, i.e. institutional delivery rate and the uptake of antimalarials for pregnant women. We used a difference-in-differences linear regression model to estimate the effect of P4P on utilisation outcomes across the different population subgroups.


P4P led to a significant increase in the rate of institutional deliveries among women in poorest and in middle wealth status households, but not among women in least poor households. However, the differential effect was marginally greater among women in the middle wealth households compared to women in the least poor households (p = 0.094). The effect of P4P on institutional deliveries was also significantly higher among women in rural districts compared to women in urban districts (p = 0.028 for differential effect), and among uninsured women than insured women (p = 0.001 for differential effect). The effect of P4P on the uptake of antimalarials was equally distributed across population subgroups.


P4P can enhance equitable healthcare access and use especially when the demand-side barriers to access care such as user fees associated with drug purchase due to stock-outs have been reduced.


Payment for performance (P4P) is a supply-side financing strategy which involves financial incentives being paid to health workers and/or facilities for reaching pre-defined performance targets. This approach started in high-income countries (HICs) with the aim of improving health care quality [24, 64, 65]. P4P is also increasingly being used in low- and middle-income countries (LMICs) to improve quality and use of health services, as well as to strengthen health systems [31, 57, 89]. The evidence base on the effectiveness of P4P is growing and suggests mixed effects with notable improvements for some incentivised indicators [9, 11, 17, 24, 26, 35, 61, 69, 73, 77].

However, most evaluations focus on average effects and pay little attention to distributional effects across provider or population subgroups [51]. There is, however, a growing awareness that average effects may mask important heterogeneous programme effects [12, 13, 19, 22, 38, 41, 51]. This study examines the heterogeneity of P4P effects on service utilisation across population subgroups. The overall goal is to display heterogeneous treatment effects, and specifically to check if the effects on population subgroups will reduce or enhance exiting inequalities in access to and utilisation of health care services.

Inequalities in access to and use of health services in favour of wealthier populations are still prevalent in many settings, with the greatest inequalities in the poorest settings [8, 15, 52, 56, 60, 68, 78, 79, 82, 84]. Factors referred to as “social determinants of health” such as economic status, education, location and age [21, 54, 60, 87], mostly drive these inequalities. From a theoretical point of view, it is hard to know how P4P will affect pre-existing inequalities. However, P4P can reduce inequalities in access to healthcare, for example, by encouraging providers to extend services to underserved groups (e.g. by reducing financial barriers to access care) in a bid to meet performance targets [31, 57]. On the other hand, P4P could also enhance inequalities in access to healthcare if providers cherry-pick the easier-to-reach patients in order to meet their performance targets [40].

Studies in HICs have found differential effects of P4P on healthcare quality between socioeconomic groups in favour of wealthier populations (pro-rich) but this effect declined over time. These studies have not found any differential effect with respect to age, sex and ethnicity [2, 14, 24, 80]. Evidence from LMICs is more limited and varied across service types [63]. For example, the effect of P4P on institutional delivery rates was greater among wealthier groups (pro-rich) in most settings [17, 46, 77] but there was an indication that it was greater among poorer groups (pro-poor) in Tanzania[11]. The effect of P4P on institutional deliveries was greater among women with health insurance in Rwanda [46] or a maternity care voucher in Cambodia [77] than their counterparts. The effect of P4P on family planning coverage was greater among wealthier groups (pro-rich), in Rwanda [46], and the effect on immunisation coverage was greater among poorer groups (pro-poor), in Burundi [17]. However, studies based on Rwanda Demographic Health Survey (DHS) data reported no differential effect by socioeconomic groups on the use of maternal care [62] and on child curative care seeking [72].

To date, most studies on differential effects of P4P have disaggregated the effect of P4P across population economic status particularly in LMICs, with little attention to other social determinants (e.g. education, occupation, and age), which are also known to affect the use of health services [4, 60], including maternal health services [30, 32, 71]. The assessment of programme differential effects across various social determinants in a broad perspective is crucial to inform universal access policies [28, 53, 60], and may help to understand how different service users are affected by a programme such as P4P [63]. In this paper, we examine the differential effect of P4P on service utilisation in Tanzania across a variety of population subgroups by stratified analyses according to various social determinants.

This paper proceeds as follows. The next section presents the conceptual framework, followed by the description of the P4P programme in Tanzania. The other sections include the methods and analysis, followed by the results, discussion and conclusion.

Conceptual framework

P4P programmes give providers incentives to change their behaviour to improve the quality of care in order to enhance utilisation and obtain financial rewards [66]. Based on this logic P4P can improve average service utilisation and the distribution of improved utilisation across population subgroups through the supply-side response (how providers respond to incentives) and the resulting demand-side response that triggers (how patients respond to supply side changes).

Supply-side response

To meet performance targets aimed at increasing the quantity of services provided, providers are likely to adopt strategies to attract more patients to facilities [31, 57]. One such strategy could be to make services more affordable [57], for example by reducing user fees, or by reducing drug stock-outs, avoiding patients having to procure drugs privately [10, 11]. Another strategy could be to improve responsiveness to service users, for example, by being kinder during service delivery [11]. However, providers might also attempt to cherry-pick patients or focus on easy-to-reach populations (i.e. underserved but easily reached) in order to meet the performance targets [25, 40], leaving the hard-to-reach (i.e. poorest with greatest need) underserved. In fact, providers may need to exert greater effort and time to serve the hard-to-reach [37]. The efficiency gains in that case can be reached but at the expenses of inequity [47].

Demand-side responses

According to Andersen’s behavioural model of healthcare utilisation [3, 4], the use of health services is a function of patient’s propensity to use services (predisposing factors), factors that facilitate or impede access and use (enabling factors), as well as perceived need for healthcare (need factors). These factors among others are also social determinants of health [21, 54, 74]. The interactions between a P4P programme (supply-side response) and social determinants (demand-side factors) may affect the use and distribution of health services. For example, reduced financial barriers to access care, resulting from provider response to incentives, may stimulate demand especially for poor and/or uninsured individuals, since they are more responsive to a change in healthcare costs consistent with demand theory [33, 49]. Demand for health services may also increase if the quality of care supplied is improved [1]; for example, through increased drug availability and better interpersonal care [10, 11]. Better-off populations (e.g. wealthier, educated, and urban residents) may also benefit more from quality improvements simply because they use services more than their counterpart populations [8, 15, 21, 32, 54, 68, 81].

Despite the potential interactions between the demand and supply-side response to P4P, the health care sector does not operate like a classic free market [6, 61]. For example, the demand-side response may be weak when some demand-side barriers to access care (e.g. cultural and information barriers) are unaffected by the supply-side response to incentives [27, 48, 61, 88].

P4P in Tanzania

In 2011, the Ministry of Health and Social Welfare (MoHSW) in Tanzania with support from the Government of Norway introduced a P4P scheme as a pilot in Pwani region. The scheme aimed to improve maternal and child health (MCH) and inform the national P4P roll out. Pwani is one of 30 regions in the country and has seven districts with more than 209 health facilities. It has a population of just over a million [59]. All health facilities providing MCH services in the region were eligible to implement the P4P scheme. The P4P scheme involved a series of performance targets for facilities that were set in relation to the coverage of specific services (e.g. institutional delivery) or for care provided during a service (e.g. uptake of antimalarials during antenatal care) (Table 1), as described in more detail elsewhere [11, 18]. Performance was rewarded based on two methods of target setting: single and multiple thresholds targets. The strategies to reach performance targets were left to the discretion of the health workers at the individual facilities. District and regional managers were also eligible to receive performance payouts based on the performance of the facilities in their district or region.

Table 1 Service indicators and performance targets for facilities implementing P4P in Tanzania

The extent to which facilities were successful in achieving performance targets determined the level of bonus payout they would receive as part of the programme. Full payment was made if 100% of a given target was achieved, and 50% of payment was made for 75% < 100% achievement, while no payment was made for lower levels of performance. The maximum payout if all targets were fully attained was USD 820 per cycle for dispensaries; USD 3220 for health centres and USD 6790 for hospitals. The payouts were additional to the funding facilities receive to cover operational costs and salaries of health workers. Incentive payouts at the facility-level included bonuses to staff (equivalent to 10% of their monthly salary if all targets were fully attained) and funds that could be used for facility improvement or demand creation initiatives (10% of the total in hospitals and 25% in lower level facilities). District and regional managers received bonus payments of up to USD 3000 per cycle.

To determine whether performance targets were met, performance data were compiled by facilities and verified by the P4P implementing agency every six months (one cycle) before distributing payouts.

The P4P programme was the subject of a process and impact evaluation. The impact evaluation showed a significant positive effect on two out of eight incentivised service indicators: institutional delivery rate and provision of antimalarial during antenatal care [11]. P4P was also associated with a number of process changes such as increased availability of drugs and supplies, increased supportive supervision, a reduced chance of paying user fees, and greater provider kindness during delivery care [5, 10, 11, 55].


Study design

Our study used data from a controlled before and after evaluation study of the P4P scheme in Pwani region, Tanzania, described elsewhere [11, 18]. All seven districts in Pwani region (intervention arm), and four districts from Morogoro and Lindi regions (comparison arm) were sampled. The comparison districts were selected to be comparable to intervention districts in terms of poverty and literacy rates, the rate of institutional deliveries, infant mortality, population per health facility, and the number of children under one year of age per capita [18]. Baseline data collection was done in January 2012, with a follow-up survey 13 months later.

Sampling and data source

In the intervention arm, we included all 6 hospitals and 16 health centres that were eligible for the P4P scheme, and a random sample of 53 eligible dispensaries. A similar number of facilities were included in the comparison arm. Facilities were randomly sampled amongst those where P4P was implemented and matching comparison facilities were selected based on facility level of care, ownership, staffing levels, and case load [18]. To assess maternal and child health service utilisation in the population, we randomly sampled 20 households of women from the catchment area of each health facility who had delivered in the 12 months prior to the survey. In total, we surveyed 3000 households with eligible women in both arms at baseline, and a similar number in the follow-up survey. The household survey also collected information on maternal background characteristics (e.g. age, marital status, education occupation, religion, and number of births), and household characteristics (e.g. household size, health insurance status, and ownership of assets and housing particulars for assessing the household socioeconomic status).

Outcome variables

Our outcome variables include the two incentivised services which we know from prior analysis improved significantly as a result of P4P: institutional deliveries and uptake of two doses of intermittent preventive treatment (IPT2) for malaria during antenatal care [11]. These were measured as binary outcomes for whether a woman gave birth in a health facility and received IPT2 during antenatal care, respectively.

Generation of subgroups for distributional analyses

To examine the distribution of P4P effects on these two outcomes, we generated population subgroups based on individual and household-level characteristics, according to Andersen’s behavioural model of healthcare utilisation [3, 4]. In this study we only considered predisposing and enabling factors since data on perceived illness was not available. “Perceived illness” could also be argued to be of less relevance for maternal service utilisation outcomes, since study participants were largely healthy.

Subgroups of predisposing factors include: marital status (married vs. none), maternal age (15–49) years (below vs. above the median age of 25), education (no education vs. primary level/above), occupation (farmer vs. non-farmer), religion (Muslim vs. non-Muslim), number of births/parity (parity 1 vs. parity 2/above), and household size (below vs. above the median size of 5 members). Subgroups of enabling factors include: health insurance status (any insurance vs. none), place of residence (rural vs. urban district), and household wealth status subgroups. The wealth subgroups were generated from wealth scores derived by the principal component analysis based on 42 items of household characteristics and asset ownership (Appendix 1: Table 5) [29, 83]. The household wealth scores were generated separately for baseline and follow-up samples, since participants differed over time. Households were ranked by wealth scores from poorest (low score) to least poor and classified into three-equal sized groups (terciles): poorest, middle and least poor. Subgrouping based on five-equal sized groups (quintiles) were also generated to examine the sensitivity of the findings to different wealth subgroupings.

Statistical analysis

We first compared the sample means of individual and household-level characteristics at baseline between intervention and comparison arms, and assessed whether the differences between arms were statistically significant by using t-tests. We then assessed the distribution of service utilisation outcomes at baseline across population subgroups by estimating the utilisation gap (i.e. a difference in average service use between two subgroups) [87]. We used t-tests to test whether the utilisation gaps were significantly different from zero.

To examine whether the effects of P4P on outcomes differed across population subgroups, we first performed subgroup analyses to identify the P4P effect on each subgroup, and then tested the significance of differential effects between subgroups through analysing the interaction effect. We identified the average effect of P4P on service utilisation by using a linear difference-in-differences regression model. This model compares the changes in outcomes over time between participants in the intervention and comparison arms as specified in Eq. (1):

$$ {Y}_{ijt}={\beta}_0+{\beta}_1\left(P4{P}_j\times {\delta}_t\right)+{\beta}_2{\delta}_t+{\beta}_3{X}_{ijt}+{\gamma}_i+{\varepsilon}_{ijt} $$

where Y ijt is the utilisation outcome (institutional deliveries or uptake of IPT2) of individual i in facility j’s catchment area and at time t. The intervention dummy variable P4P j takes the value 1 if a facility is in the intervention arm and 0 if it is in the comparison arm. The unobserved time invariant facility characteristics γ j were controlled for through facility fixed-effects estimation; and included δ t  for year fixed effects. We also controlled for individual and household-level covariates X ijt (age, education, occupation, religion, marital status, parity, insurance status, household size, and household wealth status) as potential confounders. The error term is ε ijt . We clustered the standard errors at the facility level, or facility catchment area, to account for serial correlation of ε ijt at the facility level. The effect of P4P on utilisation for each subgroup is given by β1.

To test the significance of an eventual differential effect across subgroups, we included a three-way interaction term between the average treatment effect (P4P j  × δ t ) and a subgrouping variable G i  (based on predisposing and enabling factors). The associated two-order interaction terms were also included in the model. The coefficient of interest is β4 which indicates the differential effect of P4P across subgroups as shown in Eq. (2):

$$ {Y}_{ijt}={\beta}_0+{\beta}_1\left(P4{P}_j\times {\delta}_t\right)+{\beta}_2{\delta}_t+{\beta}_3{X}_{ijt}+{\beta}_4\left(P4{P}_j\times {\delta}_t\times {G}_{ijt}\ \right)+{\beta}_5\left(P4{P}_j\times {G}_{ijt}\right)+{\beta}_6\left({G}_{ijt}\times {\delta}_t\right)+{\gamma}_j+{\varepsilon}_{ijt} $$

The use of the difference-in-difference approach to estimate the effect of P4P on outcomes relies on the key identifying assumption that the trends in outcomes would be parallel across study arms in the absence of the intervention [41]. While this can never be formally tested, we supported the assumption by verifying that the pre-intervention trends in utilisation outcomes at the household level were parallel across study arms as described elsewhere [11]. By surveying women who had delivered in the past 12 months at baseline, four longitudinal outcomes were generated and used to verify the assumption: share of institutional deliveries, caesarean section deliveries, women who breastfeed within one hour of birth, and women who paid for delivery care.

We further performed several robustness checks. First, we re-estimated the P4P differential effect by using wealth quintiles instead of wealth terciles to examine whether the results were sensitivity to wealth group classification. We also generated wealth status subgroups for each study arm and re-estimated the P4P differential effect by arm-based wealth subgroups to avoid the pre-existing baseline imbalance in wealth status between arms. Second, we re-estimated the regression model by including three-way interactions with categorical variable which gives multiple subgroups (e.g. education levels, occupation categories, parity groups and age groups) instead of interactions with binary variables (e.g. married vs. none). Third, we applied a non-linear logit model instead of linear model because of binary outcome variables. Fourth, we clustered the standard errors at the district level instead of facility level and used a bootstrapping method to adjust for the small number of clusters [20]. All the analyses were performed by using STATA version 13.


The majority of individual and household characteristics were similar across intervention and comparison arms at baseline (Table 2). Exceptions were women in the intervention arm, who were more likely to be married, non-farmers, and Muslim; and their households were more likely to be poor than their counterparts in the comparison arm.

Table 2 Baseline individual woman and household characteristics by study arms

The baseline rates of institutional deliveries in both arms were significantly lower for women in the poorest and middle wealth households, and for women who were illiterate, farmers, with parity greater than one than for their counterpart women (Table 3). The rate of institutional deliveries was also higher among intervention women with health insurance and from smaller households, as well as among urban women in the comparison arm than among their counterparts. The baseline uptake of IPT2 was generally similar across arms and population subgroups, except married women in the comparison arm, who were more likely to receive IPT2 than unmarried women (Table 3).

Table 3 Baseline levels of service utilisation by subgroups across study arms

P4P significantly increased the rate of institutional deliveries among women in the poorest and in the middle wealth status households, but not among women in the least poor households (Table 4). However, when compared with the least poor subgroup, the effect of P4P was only marginally greater among women in the middle wealth status households only (p = 0.094 for differential effect) (Table 4). The effect of P4P on institutional deliveries was also significantly higher among women in rural districts compared to women in urban districts (p = 0.028 for differential effect), and among uninsured than insured women (p = 0.001 for differential effect). There were no differential effects of P4P on institutional deliveries among other subgroups, and no differential effects of P4P on the IPT2 outcome across any population subgroups (Table 4).

Table 4 Effect of P4P on service utilisation outcomes by population subgroups

Our results were generally consistent following robustness checks. When we used wealth quintiles instead of terciles, the effect of P4P on deliveries was significantly higher in lower quintiles (indication of pro-poor) compared to the effect in the top quintile (least poor), but the results on IPT2 remained the same (Appendix 2: Table 6). When we used the arm-based wealth subgroups, the differential effect by quintiles on both outcomes remained broadly unchanged, but the differential effect by terciles on deliveries disappeared and appeared marginally for IPT2 (Appendix 2: Table 6). The effect of P4P on both outcomes remained equally distributed across categorical subgroups of education, occupation, parity and age (Appendix 3: Table 7). Some changes in the results were noted with the use of a logit model, the pro-middle wealth and pro-rural effect on deliveries disappeared but all other results including the pro-uninsured effect remained the same (Appendix 4: Table 8). When standard errors were clustered at the district-level instead of at facility-level, the differential effect on deliveries by health insurance and wealth status disappeared, and women from larger households increased institutional deliveries more than their counterparts, but all other results including the pro-rural effect remained unchanged (Appendix 5: Table 9).


This study examined the distribution of P4P effects on service utilisation outcomes across population subgroups in Tanzania. This is the first study in LMICs to examine who is really benefiting from the effects of P4P across a broad range of population characteristics which aligns with the social determinants of health framework. We found that P4P increased institutional deliveries more among women in middle wealth status households, among the uninsured, and among women living in rural areas than among wealthier, insured, and urban residing women. However, these differential effects were sensitive to the analytical specifications used during the robustness checks. The effect of P4P on IPT2 was equally distributed across population subgroups, and was robust across various analytical specifications. Our results show a declining trend in inequality to access institutional deliveries since service use improved most for subgroups which initially showed low utilisation rates; while the absence of inequality in uptake of IPT2 at baseline maintained after the introduction of P4P.

The greater impact of P4P on the use of institutional deliveries among women in the middle wealth households and uninsured than wealthier and insured respectively, is likely in part due to the increased adherence to user fee exemption policy among public facilities as well as the improved availability of drugs, minimising the need to pay for drugs in private pharmacies [5, 10, 11, 27, 39, 43, 45, 85, 86, 90]. The worse-off groups which experienced a greater P4P effect were also more responsive to a change in healthcare costs [33, 49]. This is consistent with our conceptual framework and demand theory, whereby the supply-side responses of reducing the financial barriers to access delivery care in turn stimulated the demand-side responses on service utilisation mostly among the disadvantaged population.

The finding that the increased uptake of IPT2 was similar across population subgroups may be explained by the already almost universal access to one antenatal care visit in Tanzania (above 97%) [11, 75, 76]. In an effort to achieve the IPT2 target, providers likely encouraged women to return for subsequent antenatal care visits to receive at least two doses of IPT. This represents a relatively easy task for most providers because continuation of care needs less effort than its initiation [34]. Although the provision of IPT is within the control of providers, it also depends on the available stock of antimalarial drugs for IPT. Another reason for the lack of differential effect on IPT2 may have been the pre-existing balance in the uptake of IPT2 across population subgroups at baseline. This is the first study to examine whether P4P had a differential effect on the uptake of IPT for malaria during antenatal care in LMICs. In Burundi, Bonfrer et al. [17] examined the differential effect of P4P on other contents of antenatal care and found a pro-rich effect on blood pressure measurement and a lack of differential effect on the uptake of anti-tetanus vaccination across socioeconomic groups.

The pro-middle wealth effect of P4P on institutional deliveries, as an indication of being pro-poor, is contrary to the pro-rich effect on deliveries reported in Burundi [17], Rwanda [46] and Cambodia [77]. The pro-rich effect in Cambodia was attributed to the lack of effective demand among the poorest women due to user fees [77]; whereas in Burundi it was attributed to other costs like transport because the user fees for deliveries were removed prior to P4P [17]. However, a pilot study in Burundi [16] and a study using demographic and health survey (DHS) data in Rwanda [62] found no differential effect on deliveries by household wealth status; and the results in the later study were attributed to low and uniform coverage of services at baseline. In the Democratic Republic of Congo providers implementing P4P negotiated user fees with communities and raised revenues without hurting the poorest [73], but the equity effects of this approach were not assessed empirically. Further evidence of a pro-poor effect of P4P has been shown on immunisation services in Burundi [17], and on quality of care improvement in high-income countries especially in the United Kingdom [2, 14, 23, 24, 80].

Moreover, our study found that institutional deliveries improved more in rural than in urban areas, while there was no differential effect on institutional deliveries by place of residence in Rwanda [62]. In Rwanda, the minimal number of urban clusters compared to rural clusters were thought to limit the power to detect the differential effect by place of residence [62], while our study had a slightly higher number of urban clusters compared to Rwanda (i.e. 28 versus 22 urban clusters). In the United Kingdom, the effect of P4P on quality of care was greater in urban areas than in rural areas [36, 42], while there was no differential effect of P4P on quality of care by rural–urban area in the United States [67].

We found a greater P4P effect on institutional deliveries among uninsured women, whereas a greater effect on deliveries was found among women with health insurance in Rwanda [46] and a maternity care voucher in Cambodia [77]. The findings from Rwanda and Cambodia were attributed to reduced financial barriers to access care [46, 77], and this could be the case with a stronger enforcement of fee exemptions in Tanzania [11].

However, another study in Rwanda based on DHS, as nationally representative data, found no differential effect on deliveries by health insurance status [62]. A greater P4P effect on deliveries among uninsured women in Tanzania, is partly because the baseline institutional delivery rate was already higher among insured than uninsured women in the intervention arm. A further reason could be that uninsured women were more responsive to reduced healthcare costs compared to insured women who were already covered. It is also likely that the statistical power to detect the effect among women with insurance was limited because few women are insured in Tanzania [58], compared to other countries like Rwanda [50, 70].

Furthermore, we found a similar distribution of institutional delivery rates and IPT2 uptakes across age groups prior to P4P, and the effect of P4P was equally distributed across age groups, which is contrary to P4P studies in high-income countries as they found inequalities in quality of care across age groups existed and persisted after the introduction of P4P [2, 14, 24, 80].

Overall our findings imply that when P4P results in supply side responses that reduce demand-side barriers to accessing care, it can enhance equity in service utilisation. P4P also appears less likely to show a differential effect when there is a similar level of service utilisation in a given indicator across population subgroups prior to an intervention. This study supports the argument that P4P can enhance equity in access for services where there is a pre-existing inequity in coverage, and where efforts to remove the demand-side financial barriers to access care have been made [28, 31, 44, 57, 86]. Thus, to ensure P4P reduces inequities in access to care, policy makers should consider introducing complementary measures to reduce demand-side access barriers. P4P is likely to be most effective at reducing inequities in settings where they offer free health services or there is high coverage of pre-payment schemes.

To make progress towards universal health coverage and achieve sustainable development goal three especially in LMICs, more efforts are needed to stimulate demand for and supply of healthcare services [57, 86, 90]. Further insights on how supply and demand side interventions interact and complement each other to affect outcomes are needed. Moreover, because the social determinants of health as sources of inequalities emerge from different sectors, strategies within the health sector alone cannot reduce inequalities in access and use of health services [21, 54].

This study has a number of limitations. First, our study may have been underpowered to detect the effect of P4P in some groups, for example among insured women and urban residents, possibly due to the more limited sample size within sub groups. Second, our results of differential effects on deliveries by wealth status, health insurance and place of residence, were not consistent across all analytical specifications used in robustness checks (i.e. non-linear model, and district level clustering of standard errors). However, the differential effects on deliveries for other subgroups of social determinants, and differential effects on IPT2, were robust to all analytical specifications used. Third, our finding that P4P reduces inequalities in service utilisation might be reflective of a regression to the mean principle (a random fluctuation rather than a true causal effect) because of having a short term evaluation [7]. Lastly, we restricted our distributional analysis to the outcomes which improved significantly under P4P. Although the inequalities in service use may happen with an outcome which showed insignificant P4P effect on average, our focus was limited to how the increased average utilisation effects were distributed across population subgroups.


In Tanzania, the effect of P4P on institutional deliveries was greater among women in middle wealth households, in rural areas and among the uninsured women than their counterparts. P4P effect on the uptake of IPT2 was equally distributed across population subgroups. Our finding suggests that P4P can enhance equitable healthcare access and use especially when the financial barriers to access care are reduced or removed.



Demographic Health Survey


Intermittent preventive treatment


Low- and middle-income countries


Maternal and child health


Ministry of Health and Social Welfare


Payment for performance


United States dollar


  1. Alderman H, Lavy V. Household responses to public health services: cost and quality tradeoffs. The World Bank Research Observer. 1996;11:3–22.

    Article  Google Scholar 

  2. Alshamsan R, Majeed A, Ashworth M, Car J, Millett C. Impact of pay for performance on inequalities in health care: systematic review. J Health Serv Res Policy. 2010;15:178–84.

    Article  PubMed  Google Scholar 

  3. Andersen R. A behavioral model of families' use of health services. In: A behavioral model of families' use of health services; 1968.

    Google Scholar 

  4. Andersen R, Newman JF. Societal and individual determinants of medical care utilization in the United States. Milbank Mem Fund Q Health Soc. 1973;51:95–124.

    Article  CAS  PubMed  Google Scholar 

  5. Anselmi L, Binyaruka P, Borghi J. Understanding causal pathways within health systems policy evaluation through mediation analysis: an application to payment for performance (P4P) in Tanzania. Implement Sci. 2017;12:10.

    Article  PubMed  PubMed Central  Google Scholar 

  6. Arrow KJ. Uncertainty and the welfare economics of medical care. Am Econ Rev. 1963;53:941–73.

    Google Scholar 

  7. Barnett AG, van der Pols JC, Dobson AJ. Regression to the mean: what it is and how to deal with it. Int J Epidemiol. 2005;34:215–20.

    Article  PubMed  Google Scholar 

  8. Barros AJ, Ronsmans C, Axelson H, et al. Equity in maternal, newborn, and child health interventions in countdown to 2015: a retrospective review of survey data from 54 countries. Lancet. 2012;379:1225–33.

    Article  PubMed  Google Scholar 

  9. Basinga P, Gertler PJ, Binagwaho A, et al. Effect on maternal and child health services in Rwanda of payment to primary health-care providers for performance: an impact evaluation. Lancet. 2011;377:1421–8.

    Article  PubMed  Google Scholar 

  10. Binyaruka P, Borghi J. Improving quality of care through payment for performance: examining effects on the availability and stock-out of essential medical commodities in Tanzania. Tropical Med Int Health. 2017;22:92–102.

    Article  Google Scholar 

  11. Binyaruka P, Patouillard E, Powell-Jackson T, et al. Effect of paying for performance on utilisation, quality, and user costs of health Services in Tanzania: a controlled before and after study. PLoS One. 2015;10:e0135013.

    Article  PubMed  PubMed Central  Google Scholar 

  12. Bitler MP, Gelbach JB, Hoynes HW. What mean impacts miss: distributional effects of welfare reform experiments. Am Econ Rev. 2006;96:988–1012.

    Article  Google Scholar 

  13. Bitler MP, Gelbach JB, Hoynes HW. Distributional impacts of the self-sufficiency project. J Public Econ. 2008;92:748–65.

    Article  Google Scholar 

  14. Boeckxstaens P, Smedt DD, Maeseneer JD, Annemans L, Willems S. The equity dimension in evaluations of the quality and outcomes framework: a systematic review. BMC Health Serv Res. 2011;11:209.

    Article  PubMed  PubMed Central  Google Scholar 

  15. Boerma JT, Bryce J, Kinfu Y, Axelson H, Victora CG. Mind the gap: equity and trends in coverage of maternal, newborn, and child health services in 54 countdown countries. Lancet. 2008;371:1259–67.

    Article  PubMed  Google Scholar 

  16. Bonfrer I, Soeters R, Van de Poel E, et al. Introduction of performance-based financing in burundi was associated with improvements in care and quality. Health Aff (Millwood). 2014a;33:2179–87.

    Article  Google Scholar 

  17. Bonfrer I, Van de Poel E, Van Doorslaer E. The effects of performance incentives on the utilization and quality of maternal and child care in Burundi. Soc Sci Med. 2014b;123:96–104.

    Article  PubMed  Google Scholar 

  18. Borghi J, Mayumana I, Mashasi I, et al. Protocol for the evaluation of a pay for performance programme in Pwani region in Tanzania: a controlled before and after study. Implement Sci. 2013;8:80.

    Article  PubMed  PubMed Central  Google Scholar 

  19. Brand JE, Xie Y. Who benefits most from college? Evidence for negative selection in heterogeneous economic returns to higher education. Am Sociol Rev. 2010;75:273–302.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Cameron AC, Miller DL. A practitioner’s guide to cluster-robust inference. J Hum Resour. 2015;50:317–72.

    Article  Google Scholar 

  21. CSDH. 2008. Closing the gap in a generation: health equity through action on the social determinants of health.

    Google Scholar 

  22. Djebbari H, Smith J. Heterogeneous impacts in PROGRESA. J Econ. 2008;145:64–80.

    Article  Google Scholar 

  23. Doran T, Fullwood C, Kontopantelis E, Reeves D. Effect of financial incentives on inequalities in the delivery of primary clinical care in England: analysis of clinical activity indicators for the quality and outcomes framework. Lancet. 2008;372:728–36.

    Article  PubMed  Google Scholar 

  24. Eijkenaar F, Emmert M, Scheppach M, Schoffski O. Effects of pay for performance in health care: a systematic review of systematic reviews. Health Policy. 2013;110:115–30.

    Article  PubMed  Google Scholar 

  25. Ellis RP, McGuire TG. Hospital response to prospective payment: moral hazard, selection, and practice-style effects. J Health Econ. 1996;15:257–77.

    Article  CAS  PubMed  Google Scholar 

  26. Engineer CY, Dale E, Agarwal A, et al. Effectiveness of a pay-for-performance intervention to improve maternal and child health services in Afghanistan: a cluster-randomized trial. Int J Epidemiol. 2016;45:451–9.

    Article  PubMed  Google Scholar 

  27. Ensor T, Cooper S. Overcoming barriers to health service access: influencing the demand side. Health Policy Plan. 2004;19:69–79.

    Article  PubMed  Google Scholar 

  28. Evans DB, Hsu J, Boerma T. Universal health coverage and universal access. Bull World Health Organ. 2013;91:546.

    Article  PubMed  PubMed Central  Google Scholar 

  29. Filmer D, Pritchett LH. Estimating wealth effects without expenditure data--or tears: an application to educational enrollments in states of India. Demography. 2001;38:115–32.

    CAS  PubMed  Google Scholar 

  30. Fort AL, Kothari MT, Abderrahim N. Postpartum care: levels and determinants in developing countries. In: USAID; Calverton. Maryland: Macro International Inc.; 2006.

    Google Scholar 

  31. Fritsche G, Soeters R, Meessen B. Performance-based financing toolkit. Washington DC: The World Bank; 2014.

    Book  Google Scholar 

  32. Gabrysch S, Campbell OM. Still too far to walk: literature review of the determinants of delivery service use. BMC Pregnancy Childbirth. 2009;9:34.

    Article  PubMed  PubMed Central  Google Scholar 

  33. Gertler P, Locay L, Sanderson W. Are user fees regressive?: the welfare implications of health care financing proposals in Peru. J Econ. 1987;36:67–88.

    Article  Google Scholar 

  34. Gertler P, Vermeersch C. Using performance incentives to improve health outcomes. In: NBER working paper no. 19046; 2013.

    Google Scholar 

  35. Gillam SJ, Siriwardena AN, Steel N. Pay-for-performance in the United Kingdom: impact of the quality and outcomes framework: a systematic review. Ann Fam Med. 2012;10:461–8.

    Article  PubMed  PubMed Central  Google Scholar 

  36. Gravelle H, Sutton M, Ma A. 2008. Doctor behaviour under a pay for performance contract: further evidence from the quality and outcomes framework. CHE Research Paper 34.

    Google Scholar 

  37. Gwatkin DR. How much would poor people gain from faster progress towards the millennium development goals for health? Lancet. 2005;365:813–7.

    Article  PubMed  Google Scholar 

  38. Heckman JJ, Smith J, Clements N. Making the most out of programme evaluations and social experiments: accounting for heterogeneity in programme impacts. Rev Econ Stud. 1997;64:487–535.

    Article  Google Scholar 

  39. Idd A, Yohana O, Maluka SO. Implementation of pro-poor exemption policy in Tanzania: policy versus reality. Int J Health Plann Manag. 2013;28:e298–309.

    Article  Google Scholar 

  40. Ireland M, Paul E, Dujardin B. Can performance-based financing be used to reform health systems in developing countries? Bull World Health Organ. 2011;89:695–8.

    Article  PubMed  PubMed Central  Google Scholar 

  41. Khandker SR, Koolwal GB, Samad HA. Handbook on impact evaluation: quantitative methods and practices. Washington DC: The World Bank; 2010.

    Google Scholar 

  42. Kontopantelis E, Buchan I, Reeves D, Checkland K, Doran T. Relationship between quality of care and choice of clinical computing system: retrospective analysis of family practice performance under the UK's quality and outcomes framework. BMJ Open. 2013;3(8):1–11.

    Article  Google Scholar 

  43. Kruk ME, Mbaruku G, Rockers PC, Galea S. User fee exemptions are not enough: out-of-pocket payments for 'free' delivery services in rural Tanzania. Tropical Med Int Health. 2008;13:1442–51.

    Article  Google Scholar 

  44. Kutzin J. Health financing for universal coverage and health system performance: concepts and implications for policy. Bull World Health Organ. 2013;91:602–11.

    Article  PubMed  PubMed Central  Google Scholar 

  45. Lagarde M, Palmer N. The impact of user fees on health service utilization in low- and middle-income countries: how strong is the evidence? Bull World Health Organ. 2008;86:839–48.

    Article  PubMed  PubMed Central  Google Scholar 

  46. Lannes L, Meessen B, Soucat A, Basinga P. Can performance-based financing help reaching the poor with maternal and child health services? The experience of rural Rwanda. Int J Health Plann Manag. 2016;31:309–48.

    Article  Google Scholar 

  47. Le Grand J. Equity versus efficiency: the elusive trade-off. Ethics. 1990;100:554–68.

    Article  Google Scholar 

  48. Li J, Hurley J, DeCicca P, Buckley G. Physician response to pay-for-performance: evidence from a natural experiment. Health Econ. 2014;23:962–78.

    Article  PubMed  Google Scholar 

  49. Litvack JI, Bodart C. User fees plus quality equals improved access to health care: results of a field experiment in Cameroon. Soc Sci Med. 1993;37:369–83.

    Article  CAS  PubMed  Google Scholar 

  50. Lu C, Chin B, Lewandowski JL, et al. Towards universal health coverage: an evaluation of Rwanda Mutuelles in its first eight years. PLoS One. 2012;7:e39282.

    Article  CAS  PubMed  PubMed Central  Google Scholar 

  51. Markovitz AA, Ryan AM. Pay-for-performance: disappointing results or masked heterogeneity? Med Care Res: Rev; 2016.

    Google Scholar 

  52. Marmot M. Social determinants of health inequalities. Lancet. 2005;365:1099–104.

    Article  PubMed  Google Scholar 

  53. Marmot M. Universal health coverage and social determinants of health. Lancet. 2013;382:1227–8.

    Article  PubMed  Google Scholar 

  54. Marmot M, Friel S, Bell R, Houweling TA, Taylor S. Closing the gap in a generation: health equity through action on the social determinants of health. Lancet. 2008;372:1661–9.

    Article  PubMed  Google Scholar 

  55. Mayumana I, Borghi J, Anselmi L, Mamdani M, Lange S. Effects of payment for performance on accountability mechanisms: evidence from Pwani, Tanzania. Soc Sci Med. 2017;179:61–73.

    Article  PubMed  Google Scholar 

  56. McIntyre D, Thiede M, Birch S. Access as a policy-relevant concept in low- and middle-income countries. Health Econ Policy Law. 2009;4:179–93.

    Article  PubMed  Google Scholar 

  57. Meessen B, Soucat A, Sekabaraga C. Performance-based financing: just a donor fad or a catalyst towards comprehensive health-care reform? Bull World Health Organ. 2011;89:153–6.

    Article  PubMed  Google Scholar 

  58. Mtei G, Makawia S, Masanja H. Monitoring and evaluating progress towards universal health coverage in Tanzania. PLoS Med. 2014;11:e1001698.

    Article  PubMed  PubMed Central  Google Scholar 

  59. NBS. Tanzania population and housing census: population distribution by administrative areas 2012. Dar es Salaam: National Bureau of Statistics (NBS); 2013.

    Google Scholar 

  60. O'Neill J, Tabish H, Welch V, et al. Applying an equity lens to interventions: using PROGRESS ensures consideration of socially stratifying factors to illuminate inequities in health. J Clin Epidemiol. 2014;67:56–64.

    Article  PubMed  Google Scholar 

  61. Petersen LA, Woodard LD, Urech T, Daw C, Sookanan S. Does pay-for-performance improve the quality of health care? Ann Intern Med. 2006;145:265–72.

    Article  PubMed  Google Scholar 

  62. Priedeman Skiles M, Curtis SL, Basinga P, Angeles G. An equity analysis of performance-based financing in Rwanda: are services reaching the poorest women? Health Policy Plan. 2013;28:825–37.

    Article  PubMed  Google Scholar 

  63. Renmans D, Holvoet N, Orach CG, Criel B. Opening the 'black box' of performance-based financing in low- and lower middle-income countries: a review of the literature. Health Policy Plan. 2016;31:1297–309.

    Article  PubMed  Google Scholar 

  64. Roland M. Linking physicians' pay to the quality of care--a major experiment in the United kingdom. N Engl J Med. 2004;351:1448–54.

    Article  CAS  PubMed  Google Scholar 

  65. Rosenthal MB, Fernandopulle R, Song HR, Landon B. Paying for quality: providers' incentives for quality improvement. Health Aff (Millwood). 2004;23:127–41.

    Article  Google Scholar 

  66. Rosenthal MB, Frank RG. What is the empirical basis for paying for quality in health care? Med Care Res Rev. 2006;63:135–57.

    Article  PubMed  Google Scholar 

  67. Ryan AM, Blustein J. The effect of the MassHealth hospital pay-for-performance program on quality. Health Serv Res. 2011;46:712–28.

    Article  PubMed  PubMed Central  Google Scholar 

  68. Say L, Raine R. A systematic review of inequalities in the use of maternal health care in developing countries: examining the scale of the problem and the importance of context. Bull World Health Organ. 2007;85:812–9.

    Article  PubMed  PubMed Central  Google Scholar 

  69. Scott A, Sivey P, Ait Ouakrim D, et al. The effect of financial incentives on the quality of health care provided by primary care physicians. Cochrane Database Syst Rev. 2011(9):Cd008451.

  70. Sekabaraga C, Diop F, Soucat A. Can innovative health financing policies increase access to MDG-related services? Evidence from Rwanda. Health Policy Plan. 2011;Suppl 2(26):ii52–62.

    Google Scholar 

  71. Simkhada B, Teijlingen ER, Porter M, Simkhada P. Factors affecting the utilization of antenatal care in developing countries: systematic review of the literature. J Adv Nurs. 2008;61:244–60.

    Article  PubMed  Google Scholar 

  72. Skiles MP, Curtis SL, Basinga P, Angeles G, Thirumurthy H. The effect of performance-based financing on illness, care-seeking and treatment among children: an impact evaluation in Rwanda. BMC Health Serv Res. 2015;15:375.

    Article  PubMed  PubMed Central  Google Scholar 

  73. Soeters R, Peerenboom PB, Mushagalusa P, Kimanuka C. Performance-based financing experiment improved health care in the Democratic Republic of Congo. Health Aff (Millwood). 2011;30:1518–27.

    Article  Google Scholar 

  74. Solar O, Irwin A. A conceptual framework for action on the social determinants of health. Social determinants of health. In: Discussion paper 2 (policy and practice); 2010.

    Google Scholar 

  75. TDHS. Tanzania demographic and health survey 2010. Dar es Salaam: National Bureau of Statistics (NBS); 2011.

    Google Scholar 

  76. TDHS. Tanzania demographic and health survey and malaria indicator survey 2015–16. Dar es Salaam: National Bureau of Statistics (NBS); 2016.

    Google Scholar 

  77. Van de Poel E, Flores G, Ir P, O'Donnell O. Impact of performance-based financing in a low-resource setting: a decade of experience in Cambodia. Health Econ. 2016;25:688–705.

    Article  PubMed  Google Scholar 

  78. van Doorslaer E, Masseria C, Koolman X. Inequalities in access to medical care by income in developed countries. CMAJ. 2006;174:177–83.

    Article  PubMed  PubMed Central  Google Scholar 

  79. Van Doorslaer E, Wagstaff A, Van der Burg H, et al. Equity in the delivery of health care in Europe and the US. J Health Econ. 2000;19:553–83.

    Article  CAS  PubMed  Google Scholar 

  80. Van Herck P, De Smedt D, Annemans L, et al. Systematic review: effects, design choices, and context of pay-for-performance in health care. BMC Health Serv Res. 2010;10:247.

    Article  PubMed  PubMed Central  Google Scholar 

  81. Victora CG, Barros AJ, Axelson H, et al. How changes in coverage affect equity in maternal and child health interventions in 35 countdown to 2015 countries: an analysis of national surveys. Lancet. 2012;380:1149–56.

    Article  PubMed  Google Scholar 

  82. Victora CG, Requejo JH, Barros AJ, et al. Countdown to 2015: a decade of tracking progress for maternal, newborn, and child survival. Lancet. 2016;387:2049–59.

    Article  PubMed  Google Scholar 

  83. Vyas S, Kumaranayake L. Constructing socio-economic status indices: how to use principal components analysis. Health Policy Plan. 2006;21:459–68.

    Article  PubMed  Google Scholar 

  84. Wagstaff A. Poverty and health sector inequalities. Bull World Health Organ. 2002;80:97–105.

    PubMed  PubMed Central  Google Scholar 

  85. WHO. Equitable access to essential medicines: a framework for collective action. Geneva: World Health Organization; 2004.

    Google Scholar 

  86. WHO. The world health report: health systems financing: the path to universal coverage. Geneva: World Health Organization; 2010.

    Google Scholar 

  87. WHO. Handbook on health inequality monitoring: with a special focus on low- and middle-income countries. Geneva: World Health Organization; 2013.

    Google Scholar 

  88. Witter S, Fretheim A, Kessy FL, Lindahl AK. Paying for performance to improve the delivery of health interventions in low- and middle-income countries. Cochrane Database Syst Rev. 2012(2):Cd007899.

  89. Witter S, Toonen J, Meessen B, et al. Performance-based financing as a health system reform: mapping the key dimensions for monitoring and evaluation. BMC Health Serv Res. 2013;13:367.

    Article  PubMed  PubMed Central  Google Scholar 

  90. Yates R. Universal health care and the removal of user fees. Lancet. 2009;373:2078–81.

    Article  PubMed  Google Scholar 

Download references


We would like to thank the whole P4P evaluation research team, including data collectors and field coordinators. We thank Elinor Bartle for the language editing and Ottar Mæstad for reviewing the paper. We also acknowledge the support from all field respondents, and various officials such as Regional Medical Officers, District Medical officers, Health facility in-charges, and P4P implementing agency.


The Government of Norway funded the data collection for the program evaluation that was used in this paper (grant numbers: TAN-3108 and TAN 13/0005. and the UK Department for International Development (DFID) as part of the Consortium for Research on Resilient and Responsive Health Systems (RESYST) supported the data analysis and writing of this paper. This study is part of a PhD thesis at the University of Bergen for Peter Binyaruka, who is financially supported by the Norwegian State Education Loan Fund. The funding bodies had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Availability of data and materials

The data have been uploaded into a data repository. The DOI URL for the dataset is:

Author information

Authors and Affiliations



PB was involved in designing this sub-study with JB and oversaw data collection, and analysed the data. JB designed this sub-study and the impact evaluation within which it is embedded. PB wrote the first draft of the manuscript. BR, GT, JB involved in data interpretation, presentation, and revision of the manuscript. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Peter Binyaruka.

Ethics declarations

Ethics approval and consent to participate

Ethical approval for the evaluation study was obtained from the Institutional Review Board of the Ifakara Health Institute (approval number: 1BI1IRB/38) and the Ethics Review Board of the London School of Hygiene & Tropical Medicine. Study participants provided written consent to participate in this study, requiring them to sign a written consent form that was read out to them by the interviewers. This consent form was reviewed and approved by the ethics committees prior to the start of the research.

Consent for publication

Not applicable.

Competing interests

The authors of this manuscript have the following competing interests: two authors (PB and JB) were funded by the Government of Norway to undertake the data collection associated with this research. The Government of Norway also funded the P4P programme in Pwani region of Tanzania. The funder of the study had no role in data analysis, data interpretation, or writing of the manuscript.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.



Appendix 1

Table 5 Items used to construct household wealth status score

Appendix 2

Table 6 Effect of P4P on service utilisation by different categories of wealth status and by arm-based wealth subgroups

Appendix 3

Table 7 Effect of P4P on service utilisation by subgroups for categorical variables


Table 8 Effect of P4P on service utilisation by subgroups –using the non–linear logit model

Appendix 5

Table 9 Effect of P4P on service utilisation by subgroups –using district-level clustering of Standard Errors

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (, which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated.

Reprints and Permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Binyaruka, P., Robberstad, B., Torsvik, G. et al. Who benefits from increased service utilisation? Examining the distributional effects of payment for performance in Tanzania. Int J Equity Health 17, 14 (2018).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:


  • Inequality
  • Equity
  • Social determinants of health
  • Universal coverage
  • Distributional effects
  • Healthcare financing
  • Pay for performance
  • Tanzania