)XOO�SRSXODWLRQ 7RS���� 7RS��� .H\��$YHUDJH�DQQXDO�SHU�DGXOW�UHDO�LQFRPH�JURZWK�UDWH�IURP������WR������LV������� (VWLPDWHV�FRPELQH�VXUYH\��ILVFDO�DQG�QDWLRQDO�DFFRXQWV�GDWD�
$QQXDO�UHDO�QDWLRQDO�LQFRPH�JURZWK�SHU�GHFDGH 1DWLRQDO�LQFRPH�JURZWK�LQ�,QGLD�����������
5
data sources and methodology, section 3 presents our key findings, section 4 briefly discusses their policy relevance and section 5 concludes.
1 ENTERING THE DIGITAL AGE WITHOUT INEQUALITY DATA
1.1 Economic policy shifts since the 1980s
Over the past thirty years, the Indian economy went through profound evolutions. In the late seventies, India was recognized as a highly regulated economy with socialist planning. From the 1980s onwards, a large set of liberalization and deregulation reforms were implemented. In this context, it is unfortunate that Indian authorities stopped in 2000 publishing income tax tabulations, which represent a key source of data to track consistently the evolution of top incomes.
Under Prime Minister Jawaharlal Nehru (in power from 1947 to 1964), India was a statist, centrally directed and regulated economy. Transport, agriculture and construction sectors were owned and administered by the Central Government, commodity prices were regulated and the country had important trade barriers. Nehru's followers, including Indira Gandhi's (1966-77 and 1980- 1984) prolonged these policies and implemented a highly progressive tax system. In the early 1970s, the top marginal income tax rate reached record high levels (up to 97.5%).
From the mid 1980s onwards, liberalization and trade openness became recurrent themes among Indian policymakers. The Seventh Plan (1985-1990), led by Rajiv Gandhi (1984-1989), promoted the relaxation of market regulation, with increased external borrowing and increased imports. The tax system was also gradually transformed, with top marginal income tax rates falling to 50% in the mid-1980s. In the late 1980s, when India faced a balance of payment crisis, it called for International Monetary Fund assistance. Financial support was conditioned to structural reforms which pushed forward the deregulation and liberalization agenda.
What came to be known as the first set of economic reforms (1991-2000) placed the promotion of the private sector at the heart of economic policies, via denationalizations, disinvestment of the public sector, deregulation (dereservation and delicencing of public companies and industries)2. These reforms were implemented both by the Congress government of N. Rao (1991-1996) and its successors, including the conservative Janata Party government of A. Vajpayee (1998-2004). The reforms were prolonged after 2000, under the 10th and
2 Economic policies also seeked to rationalize the public sector, its branches now had to pursue the objectives of profitability and efficiency. The opening of imports, exchange rate floating regime and banking, capital market opening were also implemented.
6
subsequent five-year plans. These plans ended government fixation of petrol, sugar or fertilizer prices and led to further privatizations, in the agricultural sector in particular.
The impacts of these reforms in terms of growth has been praised by public authorities. Real per adult national income growth, which has more sense from the point of view of individual incomes than commonly used GDP3, significantly increased after the reforms. It was 0.7% in the 1970s, 2.5% in the 1980s, 2.0% in the 1990s and 4.4% since 2000 (Figure 1). However, little is known on the distributional characteristics of post-2000 growth.
1.2 The income inequality data gap
Public debate over liberalization policies largely focused on their macroeconomic impacts (Ramaswami, Kotwal, Wadhwa, 2011) and on the impacts on poverty, with a substantial reduction in poverty rates4 (World Bank, 2017; Deaton & Dreze, 2002; Deaton & Kozel, 2005). How the Indian economy fared in terms of inequality has been arguably less discussed. This can partly be explained by a lack of consistent data on the distribution of incomes or wealth for the recent period. Some evidence suggesting a rise in income inequality in India after the turn of the century can however be found in NSSO surveys and other sources available in openly-available sources. Figure 2 presents the share of total consumption attributable to the top 20% of consumers, available online from the World Bank and United Nations WIDER World Income Inequality Database (UN-WIDER WIID). The data shows a decrease in top quintile consumption share from the fifties to the seventies from around 43% to 40% and an increase thereafter (in line with Banerjee and Piketty findings) to close to 44%. There are important irregularities with the data, but the overall "U-shape" trend seems relatively consistent5.
3 Net national income is equal to GDP minus depreciation of fixed capital plus net foreign incomes. 4 The share of Indians under the $1.9 poverty line went from 45.9% in 1993 to 21.2% in 2011 (PovcalNet, 2017) 5 As discussed below, income surveys sources exist for 2005 and later years; in particular data from the National Council for Applied Economic Research (NCAER) and from the Inter University Consortium for Applied Political and Social Sciences Research (ICPSR. These data sources however do not enable comparison before and after 2000.
7
Figure 2 - Top 20% consumption share from NSSO surveys
Source: Authors’ computations using data from United Nations WIDER Income Inequality Database
and World Bank India Database (based upon NSSO surveys)
The shortcomings of household survey data in monitoring the evolution of
inequality are well known; because of underreporting and undersampling issues, surveys fail to properly capture inequality dynamics at the top of the distribution (Atkinson and Piketty, 2007, 2010). What is more, NSSO surveys only focus on consumption rather than income and the distributional dynamics of these two concepts can differ notably. In addition, the relatively limited magnitude of the changes observed in NSSO data calls for care in the interpretation of such results. Consumption data available through surveys constitutes part of the evidence, but are not sufficient to inform debates on Indian inequality.
Other data sources, such as Forbes' Indian Rich lists, suggest an important increase in the wealth of the richest Indians after 2000 (see
Figure 3). The wealth of the richest Indians reported in Forbes' India Rich List, amounted to less than 2% of National income in the 1990sn, but increased substantially throughout the 2000s, reaching 10% in 2015 and with a peak of 27% before the 2008-9 financial crisis. Such data suggests a rise in wealth inequality levels throughout the post-2000 period, but does not enable a consistent analysis of income inequality over the long run. This is confirmed by simple simulations using a fixed normalized wealth distribution and taking into account rising average nominal wealth over the period (unfortunately Indian wealth data is very limited so it is difficult to go further).
38 40
42 44
46 %
T ot
al c
on su
m pt
io n
1950 1960 1970 1980 1990 2000 2010 Year
Data from United Nations WIDER World Income Inequality Database and World Bank India Database.
Top 20% share in total consumption in India, 1951-2011
8
Figure 3 - Wealth of richest Indians in Forbes' Rich List
Source: Authors' computations based upon Forbes billionaire rankings and WID.world national income
data.
The recent release of income tax tabulations by the Income Tax Department
for the post 2011 period does, however, allow for a more consistent analysis of the dynamics of income in India since the turn of the century.
2 DATA SOURCES AND METHODOLOGY
We present the data used to produce series on the evolution of income for the entire distribution from 1951 to 2014 (period covered by both household surveys and tax data, as well as national accounts) and for the evolution of incomes of the top 1% share and above from 1922 to 2014 (period covered by tax data and national accounts only, with no survey data prior to 1951).
2.1 Description of the different data sources
2.1.1 Tax data
The Indian Income Tax Department released tax tabulations for the fiscal years 1922-1923 to 1998-1999, and interrupted the publication in 2000. After several public calls for more democratic transparency over Indian inequality data6, the ITA released tax tabulation for years 2011-12 to 2013-14. All these tabulations report the number of taxpayers and the gross and returned income for a large number of
6 See for instance http://www.bbc.com/news/world-asia-india-36186116
0 10
20 30
% N
at io
na l I
nc om
e
1990 1995 2000 2005 2010 2015 Year
Wealth of richest Indians reported in Forbes' rich list. National Income data from wid.world.
Wealth of richest Indians in Forbes List, 1988-2015
9
income brackets7. Gross income corresponds to pre-tax income before certain deductions are applied to compute returned income8. Tax units are defined as individuals or Hindu Undivided Families (HUF, family clusters allowed to file their income jointly). The number of HUF represented roughly 20 % of tax returns in the interwar period, 5% in 1990 and less than 2.5% in 2011.9
The exact reason why Indian tax administration stopped publishing data in 2000 remains unknown. One potential explanation for this is the change in the sampling method employed in the late 1990s, with a resulting loss in the precision of estimates. Indeed, official tax tabulations were based on the entire population until the early 1990s - or based on stratified samples with sampling rates close to 100 percent for top incomes as is the case in most OECD countries, but seem to be based on uniform samples of all tax returns after this period and up to 2000 (Banerjee and Piketty, 2005). The latter method led to less precise results10. Another potential explanation for the halt in tax reporting could just be the lack of interest in income statistics and inequality (which given the rise in top income shares observed from mid 1980s to 2000 seems rather surprising).
Interestingly enough, the number income tax payers in India has increased substantially over the past decades. Less than 0.5% of the population filing tax returns up to the 1950s, between 0.2 and 1% over the period between 1960 to 1990, before a substantial increase thereafter; from 1% to close to 3% in the late 1990s and more than 6% in the latest period (Figure 4)11. This increase over twenty years is impressive, yet comparatively, the current figure is similar to the levels observed in France and in the USA in the late 1910s, and much lower than the levels observed in the interwar period (about 10-15%) and in the decades following World War 2 (50% or more) in these two countries (Piketty, 2001; Piketty and Saez, 2003). With revenues from income tax equivalent to approximately 2% of GDP, India receives more revenue than China (1%), but significantly less than other emerging countries
7 According to the Income Tax Department, a number of tax payers paid their taxes but did not file returns in fiscal years 2011- 2013. In order to take into account these individuals, we assumed that they fell in the lower income tax brackets. We tested alternative assumptions: i) assuming they are fully representative of other income filers and ii) assuming they all fall in the lowest taxable bracket. These alternative assumptions have very limited impact on our final results. Minor corrections were also done to raw tax data and mainly pertain to the clubbing of brackets in some years as the average income was incompatible with the bracket they were categorized. In such rare cases, we club erroneous brackets in the lower bracket. Year 1997 was removed altogether, as data is erroneous. 8 Deductions are defined at chapter VI of the Income Tax Act. They include premiums of annuity plans, equity fund investments, medical or health insurance, certain forms of donations, etc. Focusing on gross income is more accurate in terms of pre-tax income and is also less impacted by changes in the definitions of deductions. Income losses (such as business income losses) have to be adjusted while computing Gross Total Income as per Income Tax law. Note that imputed rent for owner occupied dwellings were included in Income tax computations before 1986 and removed afterwards. More precisely, post 1986 tax data excludes imputed rent for first residence, but not for secondary residences. 9 One should note that the Indian income tax data is entirely based upon individual income. This corresponds to equal-split income (ie. income shared among spouses) only if we assume that all tax-payers are either single or married to other tax-payers falling in the same bracket, which strictly speaking cannot be true. This implies that our estimates tend to over-estimate inequality as compared to the equal-split benchmark and to under-estimate inequality as compared to the individualistic benchmark. If and when we access to micro-level Indian tax data, we will be able to refine this analysis and compute separate equal-split and individualistic series. 10 For year 1997, see Appendix 2. 11 This figure includes estimated tax payers who did not file returns post-2011. They represent approximately 30% of the number of tax payers, according to the Income Tax Department.
10
such as Brazil and Russia (4%), and South Africa and the OECD countries (9%) (OECD, 2017).
Figure 4 - Evolution of the proportion of income-tax taxpayers in India
Source: Authors' computations using data from Indian Income Tax Departement and UN population
data.
2.1.2 NSSO consumption data
The NSSO, led by the Ministry of Statistics and Program Implementation started an all-India consumer household expenditure survey (AIHS) after its independence in 1947. The first round of the AIHS was carried out in 1951 and surveys were then conducted on an annual basis. The size of rounds varies since the quinquennial AIHS has a larger sampling of about 120 000 households and five times less for smaller other rounds. The reach of the quinquennial survey is extensive in terms of consumption items (ranging from daily used food, clothing to durable goods and services such as construction, education and healthcare). NSSO surveys however do not measure individual or household incomes12, in part because agricultural and business incomes are judged to be volatile and assumed to be much less reliably measured than consumption.
12 The Employment Unemployment Surveys report wages for the working-age population, but other sources of income are not covered.
0 2
4 6
8 S
ha re
o f t
ot al
a du
lt po
pu la
tio n
( % )
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Year
Data from Indian Income Tax Department and WID.world population estimates.
Number of taxpayers in India, 1922-2014
11
Since the first survey rounds, NSSO produced 30 days reference period estimates. This period is known as the Universal Reference Period. Post-1990, concerns were raised about the sensitivity of the reference period on the estimates and NSSO started publishing alternative reference periods (7 days and 365 days). As Deaton and Kozel (2005) note, shorter recall periods tend to lead to higher consumption estimates. However, experiments carried out with different reference periods by the NSSO working group concerned concluded that there is no clear superiority of a period over another. We thus use the Universal Reference Period. This choice is also motivated by the fact that the 30 days period is the only one that is consistent throughout the entire period of analysis (1951-2014).
For recent years (1983 to 2011) we use quinquennial rounds 38 (1983), 43 (1987-88), 50 (1993-94), 55 (1999-2000), 61(2004-05), 66 (2009-10). Micro data at the household level was obtained from the NSSO. For earlier rounds (rounds 3 to 32), for which we could not access micro data files, we use the Poverty and Growth in India Database of the World Bank (Ozler et al., 1996) which provides rural and urban per capita consumption tabulations for a dozen quantile groups for years 1951 to 1978. All rounds and corresponding years used are summarized in Appendix 3, along with the summary statistics of each round. We describe in section 2.2.2 the procedure used to infer the full distribution of income from these surveys and how we interpolate missing years.
2.1.3 National Accounts data
From 1950 to the present day, we use GDP data from WID.world, based on National Accounts Statistics (NAS) from 1971 to 2013, on World Bank (after 2013) and on Maddison (2007) from 1950 to 197013. WID.world then performs its own computations to infer Net Foreign Income and Consumption of Fixed Capital (Blanchet and Chancel, 2016). Before 1950, we use historical National Income growth rates from Sivasubramonian (2000).
A well know puzzle in Indian statistics (Deaton and Kozel, 2005; CSO, 2008) pertains to the difference in survey consumption growth rates and national accounts growth rates, particularly during the recent period. Figure 5 shows the total growth rate of Net National Income and Household Final Consumption Expenditure from NAS and personal consumption from NSSO, from 1983 to 2011. According to NAS, national income grew at 475% and household consumption grew at slightly more than 300%, while NSSO data indicates that household consumption grew at 200%.
13 In the 1990s we observe noticeable differences between real GDP growth estimates obtained from UN SNA and those reported by the World Bank (see Appendix 1).
12
Figure 5 - Cumulated growth rates according to NAS and NSSO
Source: Authors' computations using national accounts and NSSO data. Several reasons have been put forward to explain this gap, including (i) population coverage (it is different between NSSO and NAS, since Non Profit Institutions Serving Households and homeless individuals are not covered by NSSO surveys); (ii) valuation and integration of certain types of services in survey questionnaires (it was argued that the treatment of cooked meals served by employers to employees leads to underestimation of the total value of services consumed by households in the NSSO surveys (CSO, 2008) while other services such as financial intermediation that are particularly important among top earners, are not included in survey estimates (Sundaram and Tendulkar, 2005); (iii) imputed rents (while the NAS incorporates imputed rents, NSSO surveys do not14); (iv) consistency of National Accounts estimates (Kulshreshtha and Kar, 2005) ; (v) under-reporting and under-sampling of top incomes in survey data (Banerjee and Piketty, 2005). We should stress from the outset that we do not pretend to solve this complex issue. The divergence probably involves several, if not all of the factors above cited.
14 When correcting for imputed rents the Central Statistical Organization (2008) finds a large and growing share of total consumption remains unexplained.
0 10
0 20
0 30
0 40
0 50
0
To ta
l g ro
w th
(% )
Total real growth rate in India, 1983-2011
National income (Nat. Accounts) Household consumption (Nat. Accounts)
Household consumption (NSSO)
13
What we seek here is to better estimate the fraction of the difference that can be explained by the absence of top earners in survey data. We do not think that this factor alone can explain the entire gap, as it has sometime been suggested (Lakner and Milanovic, 2015).
2.1.4 IHDS income and consumption survey
The Inter University Consortium for Applied Political and Social Sciences Research (ICPSR), based at the University of Michigan, provides access to the India Human Development Survey (IHDS), conducted in 2005 and 2011-12 among more than 40 000 households from rural and urban areas. The survey provides information at the household level on both income and consumption. Consumption related questions were designed so as to match the NSSO questionnaire, using similar item categories and similar referencing periods. The definition of income in the IHDS survey includes all sources of income: labour income (wages and pensions), capital income (rents, interests, dividends, capital gains) as well as mixed (or business) incomes15. Government benefits, reported in the survey, are excluded from the analysis for consistency with tax tabulations; our focus is pre-tax income.
The IHDS is one of the very few surveys estimating both consumption and income in India. This is particularly useful as it enables a tentative reconstruction of NSSO unobserved income levels, using IHDS information. We describe this methodology in section 2.2.2. IHDS micro data is also openly available via the ICPSR website, which makes it particularly convenient16.
2.1.5 UN statistics population data
We define the theoretical population of tax payers as the total number of
adult individuals in India. We use adult population data from UN Population Prospects (2015) from 1950 to today. UN Population prospects provide 5-year age range annual population tables, based on national census and their own estimation procedures. The adult population is defined as the number of individuals over age 20. Before 1950, we use total population estimates from Sivasubramonian (2000) and reconstruct the adult population using total population growth rates given by the same author.
15 Imputed rents are not included in IHDS survey. They are not taken into account in NSSO data, nor in tax data after 1986. 16 We were not able to access the micro files of the National Council for Applied Economic Research's National Income and Expenditure Survey, done in 2004-5 and 2010-11.
14
2.2 Methodology
2.2.1 Estimation of top fiscal incomes
Following Banerjee and Piketty (2005), we first reconstruct top income thresholds and levels, using generalized Pareto interpolation techniques. The main methodological difference with Banerjee and Piketty lies in the use of generalized Pareto interpolation techniques (Blanchet, Fournier and Piketty, 2017) rather than standard Pareto distributions. Generalized Pareto interpolation17 allows for the recovery of the distribution based on tax tabulations without the need for parametric approximations. This method has demonstrated its ability to produce very precise results and also has the advantage of generating smooth estimates of the distribution, i.e. generating a differentiable quantile function and a continuous density, while other methods introduce kinks around the thresholds used as inputs for the tabulation.
The generalized Pareto interpolation procedure generates 127 generalized percentiles, namely p0p1, p1p2, ..., p99p100, corresponding to 100 fractiles of the distribution. The top fractile is split into 10 deciles (p99.0 p99.1, p99.1 p99.2,..., p99.9p100), its top decile itself split in ten deciles (p99.90 p99.91, p99.91 p99.92, ..., p99.99 p100), the tenth decile again split in ten deciles (p99.990p99.991, p99.991 p99.992, ..., p99.999p100). The top generalized percentile thus corresponds to the top 0.001% of the population. As shown in Figure 4, tax data in India is only reliable above the p94 threshold for the recent period and above the p99.9 threshold when we go backwards in time.
2.2.2 Estimation of bottom survey incomes
One of the main difficulties of our exercise is related to the fact that NSSO does not include questions on individual and/or household income. Our strategy consists of using observed income-consumption profiles in IHDS data to reconstruct income profiles from NSSO consumption data. We first estimate income and consumption levels for each generalized percentile of the distribution of income and consumption given by IHDS data. For each survey and each percentile of the distribution, we construct observed income-consumption ratios α1p=yp/cp, with yp and cp respectively with a mean income and consumption within quantile p. We call this strategy A1. To obtain a theoretical income-consumption profile over percentiles, we take average of years 2005 and 2011-12. In practice, the two profiles differ only marginally. We then construct two alternative ratios, α2p and α0p, referred to as strategies A2 and A0 respectively. In strategy A2, we assume 17 Available online at www.wid.world/gpinter
15
that α2p= 1 for α1p≤1 and α2p=α1p otherwise. This second strategy is equal to assuming no negative savings rates among the poor. In strategy A0, we define α0p=(α1p+α2p)/2 for α1p≤1. This strategy assumes that there can be negative savings rates, remittances or household transfers, but that the true αp value lies between strategy A1 and strategy A2. Income consumption ratios for the different strategies are presented in Appendix 4. We find that these different strategies have no effect on the trends we observe and a limited impact on top share estimates, as we show in section 3.4.
The choice of these different strategies indeed impacts on the estimated share of total savings in the economy. In strategy A1 total savings are close to 0, which seems too low compared to the current rate of savings in India (about 30%). This figure is close to 5% in strategy A0 and approximately 10% in strategy A2. These values are more or less constant throughout the entire period covered whereas in National accounts they move from about 10% in the 1960s to 30% today. However, using strategy A0 and factoring in top incomes in the analysis allows us to find an aggregate savings rate of the same order of magnitude as those observed today (see Appendix 5).
2.2.3 Interpolating survey and tax data for missing years.
Our objective is to produce yearly estimate for the full distribution from
1951 to 2014. Given that survey or tax data is not available for all years, it is necessary to interpolate tax and/or survey data for a certain number of years. In order to do so, we interpolate missing years using a constant growth rate between known intervals t and t+N18.
As described in sections 2.1.2 and 2.1.4, we have two available sources for the estimation of survey income for the years 2005 and 20010-11, NSSO and IHDS. However, the trends observed in the surveys are somehow divergent. The ratio of reconstructed NSSO total income to total personal income from national accounts decreases, while the ratio of IHDS total income to total personal income from national accounts is stable. The choice of one or the other source of data has implications on our final inequality statistics: using IHDS means for the estimation of the bottom of the distribution (strategy B1) yields a lower rise in top income shares than when using the NSSO survey (strategy B2). However, using NSSO mechanically accentuates the rise in top shares over the period and the strategy B1
18 In practice, for each average income at percentile p of the survey (or tax) distribution, we define ypt+1=ypt×g where g=(ypt+N/ypt)1/N, with g the growth rate, ypt+1 the average income at percentile p and year t+1.
16
is therefore used as our benchmark, as it represents the conservative approach. That said, we cannot rule out strategy B2, if we believe NSSO surveys are consistent throughout the entire period covered. We provide results for strategy B2 in the data appendix.
Between 2000 and 2011, we do not observe any tax statistics, but we do observe survey data in 2005 and in 2011. Survey data is not satisfactory to track the dynamics of top incomes, but it is better than no data at all. We thus estimate the growth rates of each percentile between 1999 and 2005 on the basis of their evolution observed in the survey distribution. The resulting estimates show the top 10% share evolving in the same direction between 2005 and 2011 in our final results as in the survey. We see this strategy as the best we can have with the available data at hand.
2.2.4 Combination of tax and survey data
Several strategies can be used to correct for missing top incomes in survey data. These include the modification of the weights assigned to top earners in household surveys, the addition of extra observations of top earners or the multiplication of income levels at the top (Burkhauser et al, 2016), and each has its own strengths and weaknesses. We think that an acceptable method should be consistent, in producing distributions with plausible statistics, in particular, the shape of inverted Pareto beta coefficients curves should be relatively smooth. The method followed should also be transparent, in so-much as it should provide a statistical outcome that could be anticipated from an economic perspective; survey inequality should in principle increase when we factor in top fiscal incomes. Furthermore, a simple strategy would also be better than a complex one.
Our preferred strategy is to assume that surveys are reliable from the bottom of the distribution up to a certain percentile and that tax data is reliable after another. In practice, this amounts to multiplying income of the top percentiles in the survey by a certain factor, given by tax data. More precisely: we suppose that survey data is reliable from p0 to p1 - this means that between p0 and p1, averages and thresholds are given by the distribution of interpolated (estimated) survey income. In our benchmark scenario, which we refer to as strategy C1, p1=p90. We also test alternative ranges: (i) p1=p95, which we refer to as strategy C2 and (ii) p1=p80, referred to as strategy C3. As shown in section 3.4, these different strategies have no impacts on the recent and long term income trends observed in India and have only a moderate impact on income concentration levels.
17
We then suppose that tax data is reliable from a certain percentile, p2, up to
the top of the distribution. p2 is given by the population share under the first
taxable bracket observed in the tax data. This value varies from p2=99.9 in the 1950s
to p2=95.5 in the 2010s19. Therefore, our strategy implies that averages and
thresholds for all percentiles above p2 are given by the distribution interpolated
from observed tax data. Appendix 5 gives the precise value of p2 for each year.
Between p1 and p2, we test several strategies for the progression of income levels and thresholds at a given point of time. We define a convex junction profile (strategy D1), a linear profile (strategy D2) and a concave profile (strategy D3). We adopt D1 (convex profile) as our benchmark strategy as it corresponds to the profile observed for recent years, for which we have more observed fiscal data at the top; more than 6% of the population against 0.1% for the earlier period (see Appendix 6). We find that these different strategies have negligible impacts on top share results. In fact, the bulk of the correction we apply to survey incomes occurs above p2, not between p1 and p2.
2.2.5 From total fiscal income to national income
Total fiscal income is the total personal income that would be reported by
individuals or tax units, if all of them reported their revenues to the tax administration. In the case of India, we do not observe this value because of the limited tax base. One way to recover it, following Atkinson (2007), is to start from the sum of primary incomes obtained by households reported in national accounts and operate a series of deductions and additions towards a definition closer to taxable income. This is the approach followed by Banerjee and Piketty (2005) and appears appropriate given that their focus was restricted to top incomes only. By construction, total fiscal income evolves at the same rate as pre-tax national income under this approach
The other approach consists of reconstructing total fiscal income via the combination of top fiscal incomes and observed (or estimated) survey income, as we detailed in the previous section. This is equivalent to assuming that tax data give true fiscal incomes for individuals over p2 and that estimated survey data gives the true fiscal incomes for individuals below p1. In this approach, reconstructed fiscal income and total national income can evolve at a different
19 In Indian tax files, there is a non-negligible proportion of filers falling below the first taxable bracket.
18
pace. Over the years, we observe a growing gap between reconstructed total income from surveys and total national income (see Appendix 7). This divergence is the repercussion of the gap between household consumption surveys and national accounts discussed in section 2.1.3. We show in section 3.4 that we can account for a non-negligeable share of this gap after the combination of survey and tax data , but that a large part of the difference remains unexplained.
In order to produce income estimates comparable to other countries, we chose to rescale our fiscal income estimates to match total pre-tax national income from national accounts. In practice, we preserve the distribution obtained from the combination of tax and survey data and simply rescale average and threshold levels of all percentile groups by a yearly factor so that we match total national income.
In further work, we intend to distribute retained earnings to the top of the distribution following the DINA guidelines (Alvaredo et al, 2016). This would most likely increase the level of inequality in the recent period, since the growth of retained earnings is likely to be concentrated among top earners. The amount by which our results would vary presumably remains limited though. Indeed, assuming retained earnings are equal to 10% of national income, distributing half of them to the top 1% would increase its share by about 1 percentage point.
2.2.6 Definition of a benchmark scenario
The combination of our different strategies defines 54 scenarios (3 A scenarios x 2 B scenarios x 3 C scenarios x 3 D scenarios). We stress that most of the combinations of scenarios among these 54 possibilities can be a priori justified, and as such, we provide results for all corresponding series in our data appendix. We see our benchmark scenario (A0B1C1D1) as being at the same time plausible and conservative compared to most of the scenarios tested, as top income shares are lower than in most scenario and also increase at a slower rate over the recent decades. Robustness tests are presented in section 3.4.
3 RESULTS
3.1 Sharp rise in top income shares since the mid-1980s
Our results exhibit a strong rise in top income shares since the mid-1980s. In our benchmark estimation scenario, the share of national income attributable to the top top 1% reached 21.7% of national income in 2013-14, up from 6.2% in 1982-1983 (see Figure 6). This is the highest level recorded since the establishment of the income tax in 1922. The top 1% share of national income was at 13% of national income in 1922-23 and increased to 20.7% in 1939-40, at the dawn of World War II.
19
It then dramatically decreased to 10.3% in 1949-50 and further decreased from the late 1960s to the early 1980s.
Figure 6 - Top 1% income share in India, 1922-2014
Source: Authors' computations using tax and survey data and national accounts.
As expected, the top 0.1% income share dynamics exhibit a similar pattern in our benchmark scenario (see Figure 7). Top 0.1% earners captured 8.6% of total income in 2013-2014. This only slightly below its pre-independence peak of 1939-40 (8.9%). The top 0.1% then saw a strong drop during World War II (down to 5.5% in 1944-45), followed by a continued reduction up to 1982-83 (when it reached 1.7%). From 1983-84 onwards, the share of national income accruing to the top 0.1% rose almost continuously. Figure 7 - Top 0.1% income share in India, 1922-2014
5 10
15 20
25 %
T ot
al in
co m
e
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Year
Per adult pretax national income. Systematic combination of tax, survey and national accounts data. Benchmark scenario displayed (A0B1C1D1).
Top 1 % income share in India : 1922 - 2014
20
Source: Authors' computations using tax and survey data and national accounts.
Looking at the 0.01% earners (Figure 8), we also observe a strong increase in their share of national income since the mid 1980s, reaching 3.8% in 2013-2014, up from 0.4% in 1982-83. In 1941-42, the top 0.01% earned 3.8% of total income. The share of national income earned by the top 0.001% share is presented in the Appendix and also display a sharp since the mid-1980s. Figure 8 - Top 0.01% income share in India, 1922-2014
2 4
6 8
10 %
T ot
al in
co m
e
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Year
Per adult pretax national income. Systematic combination of tax, survey and national accounts data. Benchmark scenario displayed (A0B1C1D1).
Top 0.1 % income share in India : 1922 - 2014
21
Source: Authors' computations using tax and survey data and national accounts.
3.2 Fall in Middle 40% and bottom 50% shares
We now turn to post-1951 results, which we have for the entire distribution of income. Figure 9 shows the mirror evolution of top 10% share in total income and middle 40% share (i.e. individuals above the bottom 50% earners and below the top 10%). In the mid-fifties, the top 10% and the middle 40% held about 40% of total income each, the share of the middle 40% progressively increased from the mid-fifties to 1982-83, reaching 46% of total income. It then decreased afterwards. At the turn of the Millenium, the top 10% and the middle 40% groups captured exactly the same amount, 40%. However, by 2013-14, the middle 40% share had fallen to an historically low level of 29.6%. Figure 9 - Top 10% vs. Middle 40%: 1951-2014
0 1
2 3
4 %
T ot
al in
co m
e
1920 1930 1940 1950 1960 1970 1980 1990 2000 2010 Year
Per adult pretax national income. Systematic combination of tax, survey and national accounts data. Benchmark scenario displayed (A0B1C1D1).
Top 0.01 % income share in India : 1922 - 2014
22
Source: Authors' computations using tax and survey data and national accounts.
The dynamics for the bottom 50% of the income distribution exhibit a similar pattern to that of the middle 40% (Figure 10). Bottom 50% share of national income increases from 19% in 1955-56 to 23.6% in 1982-1983, but then decreases sharply and almost continuously thereafter (20.6% in 2000-2001 and 14.9% in 2013-14).
30 35
40 45
50 55
% T
ot al
in co
m e
1950 1960 1970 1980 1990 2000 2010
Year
topshare_A0B1C1D1 middle40_A0B1C1D1 Per adult pretax national income. Systematic combination of tax, survey and national accounts data. Benchmark scenario displayed (A0B1C1D1).
1951-2014 Top 10 % vs. Middle 40 % income shares in India
23
Figure 10 - Bottom 50% income share: 1951-2014
Source: Authors' computations using tax and survey data and national accounts.
3.3 Total growth rates by income group
We compare total growth rates across the full distribution of incomes over the 1980-2014 period and compare these results in perspective to other countries available in the WID.world database, namely China, France and the USA. We also provide global growth estimates for the corresponding global groups.