1 Introduction

Digitalization promises to improve tax enforcement due to the acceleration of data collection that enables tax administrations to monitor transactions in real time (Jacobs, 2017). As a result, digitizing tax collection is gaining noticeable popularity in tax policy debates and is attracting increasing interest among academics. In this study, we examine the effect of the transition from paper-based to electronic invoicing (e-invoicing) on cross-border value-added tax (VAT) fraud. For this purpose, we use the Italian e-invoicing system, which became mandatory for almost all transactions between resident entities as of January 1, 2019.

Many non-European countries, especially within Latin America and Asia, implemented digitized transaction processes, i.e. through mandatory business-to-business (B2B) e-invoicing to monitor economic processes. In the European Union (EU), Italy is the first country to have introduced such a system on a mandatory basis for B2B and B2C (business-to-customer) transactions.Footnote 1 Italy has undertaken the introduction of e-invoicing on its own, i.e. without specific coordination with other EU Member States. Hence, the scope is limited to the national level. Therefore, we ask the question whether enhanced domestic tax enforcement capabilities have a significant deterrent effect on cross-border VAT fraud that accounts for a bulk of overall VAT gapsFootnote 2 within the EU (European Commission, 2016; Frunza, 2016; Braml & Felbermayr, 2021).Footnote 3

VATFootnote 4 as the main form of consumption tax is implemented in about 170 countries. In the countries of the Organisation for Economic Co-operation and Development (OECD), it generates approximately one-third of all tax revenue (OECD, 2020). This type of consumption tax has the potential to create high revenue at relatively low administrative and economic costs.Footnote 5 However, it is also prone to fraud as firms themselves collect the tax on behalf of the state. The tax is payable by the acquirer to the supplier, while the latter is obliged to forward the received VAT to the tax authorities after deducting input tax paid on own purchases. The damage resulting from organized VAT fraud, under which the supplier does not remit the received tax from the acquirer, is partially reduced if the right to deduct the input VAT for the supplier is refused and is thus limited to the tax amount on his or her profit margin (“value added”). However, the fraudster's plunder and thus the VAT loss increase significantly if the fraudster is able to avoid paying the input VAT. Zero-rated cross-border transactions open up this possibility. The fraudster imports goods from another EU Member State without VAT, sells them with VAT on the domestic market and disappears with the gross amount received.Footnote 6 Due to the disappearance, the fraudster is called “missing trader” and the straightforward name of this scheme is ‘missing trader intra-Community’ (MTIC) fraud.Footnote 7 Based on the cross-border element, recent studies have shown that the product-specific gap between the export reported by the exporting country and the corresponding import reported by the importing country (trade data gap, TDG), serves as an indicator of cross-border VAT fraud (Braml & Felbermayr, 2021; Bussy, 2020; Stiller & Heinemann, 2019, 2023).Footnote 8

With e-invoicing, the risk of fraud detection increases since the invoice has to be sent electronically via a system of the tax administration that enables quicker cross-checks between VAT claimed and paid. The penalties imposed for not using the e-invoicing system, as well as the refusal to deduct VAT when the purchaser knew or ought to have been aware of the existence of VAT fraud, provide an incentive for honest businesses to avoid suspicious transactions. If e-invoicing prevents the fraudster’s domestic supplies, the fraudster imports less or no more. As a result, declared exports and undeclared imports decrease or honest importers replace the fraudster, increasing declared imports. In both cases, the TDG declines.

Therefore, we exploit a difference-in-differences model accounting for potential omitted variable bias including unit and time fixed effects. We obtain data on Italy's trade with the remaining EU countries for all products at the level of the 8-digit code of the Combined Nomenclature (CN), 12 months before and after the introduction of e-invoicing on January 1, 2019. As the control group, we use products that fall under the previously introduced reverse charge mechanism (RCM) and therefore should not be subject to VAT fraud. RCM applies to B2B transactions and is a VAT blocking mechanism under which the buyer is obliged to pay the VAT to the tax authorities instead of paying it to the supplier. Thus, the VAT does not come under control of the fraudster. Recent empirical studies confirm the fraud-reducing effect of the RCM (Buettner & Tassi, 2023; Bussy, 2020; Stiller & Heinemann, 2019, 2023). However, we provide additional empirical evidence for the effect of the RCM with regard to the Italian implementation.

We identify the difference in the TDG before and after the reform between products potentially not affected by e-invoicing (RCM products) as the control group and all remaining products (non-RCM products) as the treatment group.Footnote 9 We find that the introduction of e-invoicing is associated with a significant decrease in cross-border VAT fraud expressed by the TDG. The results hold when we replace the control group (RCM products in Italy) with non-RCM products in EU countries that did not adopt e-invoicing and had the highest VAT gaps in 2018 according to Poniatowski et al. (2020), i.e. Greece, Lithuania, Romania and Slovakia. Furthermore, we find a significant decrease in the TDG if we narrow down the treatment group to make treated products more similar to RCM products.

Using a simple back-of-the-envelope calculation, we estimate that e-invoicing tackled cross-border VAT fraud in a range from € 2.2 billion to € 2.6 billion from 2018 to 2019. Our results are close to the Italian estimates of an increase in VAT revenue in 2019 of about € 1.7 billion to € 2.1 billion euros (Italian Ministry of Economy & Finance, 2020). However, our model is able to isolate the effect of cross-border VAT fraud. Given that Italy detected around € 1 billion in cross-border fraud in 2019 and 2020 (European Commission, 2021a), e-invoicing has not completely extinguished fraud.

Our findings contribute to a better assessment of the impact of e-invoicing on cross-border VAT fraud, confirm the significant share of this fraud in total revenue losses and underpin the suitability of the TDG as an indicator for cross-border VAT fraud. Since administrative costs in relation to the system are low (running cost of up to € 20 million a year) a domestic e-invoicing system provides a promising way to tackle cross-border VAT fraud in other countries.

Tax research on digital tools, including e-invoicing systems, focuses on the potential to improve tax compliance and collection in developing countries (Alonso et al., 2021; Bellon et al., 2022; Bérgolo et al., 2018; Fan et al., 2020; Hernandez & Robalino, 2018; Lee, 2016; Mascagni et al., 2021; Ramirez et al., 2018; Templado & Artana, 2018) as well as cost implications for both tax administrations and firms (Giannotti et al., 2019). Although theoretical considerations on the use of digital tools against VAT fraud in Europe go back a long time (see e.g. Ainsworth, 2006), empirical studies on the tax fraud-reducing effect of digitalization are scarce. Most recently, Kitsios et al. (2022) conducted an empirical study that examines the impact of digitalization efforts on cross-border VAT fraud using aggregated trade data. They confirm that digitalization correlates with lower tax fraud. However, their analysis focus on the relationship between aggregated trade data within the EU and the Online Service Index conducted by the United Nations as proxy for digitalization efforts. Such a highly generalized index cannot disentangle single digital measures. Moreover, cross-border tax fraud is a product-specific phenomenon that can be studied only to a limited extent with aggregated data.

As part of the annual VAT gap study for all EU Member States, Poniatowski et al. (2022) find a statistically significant negative correlation between the VAT gap and (digital) reporting obligations, including VAT listing, Standard Audit File-Tax, real time and e-invoicing. However, this estimation aims to identify the overall impact of digital reporting obligations in the EU rather than single measures. Nevertheless, it shows the importance of improved tax reporting.

Against this background, the implication of certain digitalization measures with regard to cross-border tax fraud has been insufficiently examined. Such empirical evidence is essential to evaluate ongoing implementation efforts and to support tax policy in future debates as the digitalization of tax administrations become increasingly important. This demonstrates the example of the current debate on a harmonized e-invoicing system in Europe (European Commission, 2020). While especially in Latin America, e-invoicing is attested to have considerable anti-fraud potential (Barreix & Zambrano, 2018), the question remains how it affects the case of cross-border VAT fraud in Europe. We address this research gap by examining the introduction of e-invoicing in Italy in 2019 on B2B and B2C transactions using gaps in double-reported trade data between Italy and the remaining EU countries at the most detailed product code level of the CN.

With this paper, we contribute to the ongoing empirical research on the examination of measures against VAT fraud and its impact on tax revenues using trade data gaps as fraud proxy (Braml & Felbermayr, 2021; Bussy, 2020; Stiller & Heinemann, 2019, 2023). In this sense, we also contribute more broadly to the overall literature on the analysis of the TDG as a cross-border fraud indicator (Fisman & Wei, 2004; Javorcik & Narciso, 2008, 2017; Mishra et al., 2008; Stoyanov, 2012).

Additionally, our paper contributes to the emerging empirical research on the relationship between digitalization and tax fraud. Kitsios et al. (2022), Strango (2021) and Poniatowski et al. (2022) find that higher digitalization of tax reporting obligations is correlated with less (cross-border) tax fraud. All these papers, however, focus on aggregated country-level data and proxies for general digitalization efforts. We extend this literature stream i.e. by using disaggregated product-level data and a single reform, uncovering the impacts of digitalization on tax fraud on a more detailed level.

The rest of this paper is structured as follows. In Sect. 2, we provide the institutional background of the e-invoicing system in Italy and formulate our hypothesis in conjecture with the definition of our proxy for cross-border VAT fraud. In Sect. 3, we present the data and in Sect. 4 the identification strategy. Section 5 is devoted to the presentation and discussion of the main results. Section 6 addresses robustness checks and in Sect. 7, we describe and perform a quantification of the fraud. Section 8 concludes. We provide additional heterogeneity analyses in Section B in Appendix.

2 Hypothesis development

2.1 Reform background and theoretical considerations

In 2019, the obligation to send invoices electronically via the Italian exchange system (Sistema di Interscambio; SdI) came into force for the vast majority of Italian firms carrying out B2B and B2C transactions.Footnote 10 E-invoices fully replaced paper-based invoices for taxpayers with an annual turnover of more than € 65 thousand. According to the Italian Ministry of Economy and Finance (2018a), this threshold intends to cover 80% of all taxable persons in Italy. The initial start of the system dates back to June 2014, where e-invoicing became mandatory for transactions with ministries, tax agencies and national security agencies. From March 2015, all business-to-government (B2G) supplies were integrated into the system. This was followed by a voluntary adoption of e-invoicing for B2B in 2017. From July 2018, the use of the SdI was mandatory for the sale of fuels and became binding to all B2B and B2C transactions from January 1, 2019 (Italian Revenue Agency, 2021).

Italy enforces the system mainly by the imposition of different penalties when not using the SdI. These include the refusal of the input VAT deduction when no confirmed e-invoice is sent through the system and additional monetary fines.Footnote 11 The seller has to send the invoice file (Fattura PA) to the SdI, so that the tax authority acquires the information contained in the e-invoices in real time. However, in the first instance, the SdI only checks if the formal requirements are met. In a second step, the e-invoice data are transmitted to the tax authority that stores the e-invoices and uses automated and integrated processes to cross-check the consistency between the VAT declared and paid and also with other cross-border anti-fraud information sources (European Council, 2018; European Commission, 2021).Footnote 12 If the system accepts the formal validity, the seller obtains a receipt, while the buyer receives the invoice. Only through this procedure, the invoice is regarded as such for purposes of VAT and the acquirer can deduct the input VAT. Therefore, the taxable buyer should be sensitive to require an e-invoice before transferring the gross amount to the seller.Footnote 13

Moreover, fines are imposed when the SdI is not applied. The fines range between 90% and 180% of the VAT. As an exception, the regulations allowed taxpayers to avoid these fines if the e-invoice was uploaded to the SdI until the 15th of the following month during the first half of 2019.Footnote 14 Since then, an e-invoice has to be sent directly to the SdI to avoid the penalties.

In order to theoretically assess the compliance effects of e-invoicing, we distinguish between two broad types of non-compliance. On the one hand, non-compliant firms, in particular those under-reporting sales or over-reporting costs, and on the other hand the organized MTIC fraud. We focus on the latter, but presenting a short theoretical framework for both to justify how our model is able to isolate the effects on cross-border VAT fraud conceptually.

When an invoice has been sent through the SdI, the tax authority receives the transaction-based information shortly after and can perform cross-checks between taxpayers. This limits non-compliant firms to adjust their accounting records afterward. Therefore, keeping a certain level of tax evasion is thus very likely to be costlier after the reform, as these practices can potentially be exposed more quickly. In addition to increased costs of evasion, several benefits result from the system. Namely, the automation of invoice retention obligations, lower cost per invoice compared to paper invoices, streamlining of accounting processes and the availability of real-time accounting data (Italian Revenue Agency, 2021). Shedding light on the effect of switching from paper-based to electronic invoicing, Bellon et al. (2022) find for Peru that firms indeed increase reported sales, purchases and VAT liabilities on average. These results are stronger among small firms since they tend to be less compliant. For the introduction of e-reportingFootnote 15 of sales in Ethiopia, Mascagni et al. (2021) find that reported sales increase; however, firms also adjust reported cost upward. Therefore, curbing the positive tax collection effect by the reform, which nevertheless showed a net positive effect. Moreover, Fan et al. (2020) find a significant increase in VAT revenues after the introduction of digitally encrypted invoices in China.

In contrast to the non-compliance behaviour described above, organized cross-border fraud is likely to react differently to increasing digitalization. VAT fraudsters might hardly profit from any of the structural benefits resulting from the process digitalization. The reform confronts them with increased costs of fraud that can jeopardize their activities. Note that MTIC fraud differs from cases in which seller and buyer have an incentive to under- or over-report sales and costs, or even consensually carry out transactions without invoicing. Fraudsters make profits from the VAT collected that is not remitted to the tax authority. Regardless of whether the buyer is involved in the fraud, the invoice sent to the buyer determines the success. If the buyer is involved, the right to deduct the input VAT is essential to keep an overall profit from the scheme for the criminal organization.Footnote 16 In the event that the buyer is unaware, the fraudster has to pretend to be a compliant firm, as an invoice and inconspicuous transaction circumstances are central for the buyer to obtain an input tax deduction. The imposed penalties arising from not applying the electronic system should increase the incentive of taxpayers even further to take care not making business with fraudsters. Thus, the use of the SdI should increase the costs of fraudsters in each case (selling to involved or uninvolved firms) since the fraudster has to provide an unsuspicious e-invoice through the SdI for that a registration has to take place. Compared to paper invoices, electronic invoicing additionally poses a higher risk of detection for fraudsters, as the tax authority can cross-check the invoice data in real time.

2.2 E-invoicing and trade data gap

A growing literature that examines the effectiveness of measures against VAT fraud exploits discrepancies in double-reported trade data (Bussy, 2020; Kitsios et al., 2022; Stiller & Heinemann, 2019, 2023). Fisman and Wei (2004) first used these discrepancies to study tariff evasion on the product level between China and Hong Kong. This approach has found wide use in other studies related to tariff evasion (Javorcik & Narciso, 2008, 2017; Mishra et al., 2008; Stoyanov, 2012). In accordance to the vast literature, we define the ratio of exports to corresponding imports of product \(p\) at the 8-digit CN product level at time \(t\) from exporting country \(e\) to importing country \(i\) reported by country \(e\) and \(i\), respectively, as the trade data gap (TDG). Taking the natural logarithm on both sides leads to

$$\ln {\mkern 1mu} {\text{TDG}}_{{{\text{eipt}}}} = \ln {\mkern 1mu} {\text{Export}}_{{{\text{eipt}}}} - \ln {\mkern 1mu} {\text{Import}}_{{{\text{eipt}}}} = \ln \left( {\frac{{{\text{Export}}_{{{\text{eipt}}}} }}{{{\text{Import}}_{{{\text{eipt}}}} }}} \right)$$
(1)

Equation (1) implies positive ln TDG values for \(\frac{{{\text{Export}}}_{{\text{eipt}}}}{{{\text{Import}}}_{{\text{eipt}}}}>1\) (case with prevalent fraud) and negative values for \(\frac{{{\text{Export}}}_{{\text{eipt}}}}{{{\text{Import}}}_{{\text{eipt}}}}<1\), as well as the value zero for \(\frac{{{\text{Export}}}_{{\text{eipt}}}}{{{\text{Import}}}_{{\text{eipt}}}}=1\). Besides fraud, ln TDG can occur due to different valuations of exports and imports. Since exports are valued as free-on-board, while imports include also cost of insurance and freight, the latter should be slightly higher by default resulting in a slightly negative value (Eurostat, 2020).

European taxpayers operating across borders are generally obliged to report imports and exports not only in the domestic periodic VAT return but also in the Intrastat system. The application of the TDG as proxy for cross-border VAT fraud is based on the theoretical argument that the fraudster does not report imports in the Intrastat system, while the exporter does. Since the fraudsters import goods on a zero-VAT basis, i.e. without payable input VAT, there is no incentive to comply with the obligations to file tax returns and Intrastat declarations.Footnote 17 However, we only observe respective gaps if exporters report trade within the Intrastat systems, while fraudulent importers fail to do so. We rely on the assumption that exporters fulfill their reporting obligations. This assumption can be justified by the fact that the exporter does not have to be aware of the fraud. Even if involved, compliance with the declaration requirements could be used as an argument by the exporter to be unknowingly involved in the fraud in case of detection. Therefore, the exporter can claim the refund of the input tax. Such a line of reasoning does not help the fraudulent importer, as the tax due is not paid to the tax authority. The declaration of imports could possibly help the fraudster not to be detected immediately by the tax office. However, this strategy in the absence of tax payment can only work for a short time until the tax authority finds out that domestic VAT is not remitted.

Even though the implementation of Italy’s e-invoicing in 2019 targets domestic supplies and therefore has no direct impact on cross-border transactions. We expect a significant impact on TDG since e-invoicing increases the tax enforcement capabilities on domestic supplies and hence increases the costs for cross-border VAT fraud. As outlined above, the cross-border transaction is essential for the fraud scheme. If the fraudsters would acquire goods domestically, they would have to pay the input tax to the supplier and claim its refund from the tax authority. Importing the goods at zero rate from another EU Member State is less risky for the fraudsters. It has a liquidity advantage and allows them to charge a price lower than the net purchase price, as the fraudsters consider VAT as revenue, unlike the compliant taxpayers. Hence, they can undercut market prices for i.e. selling higher quantities (e.g. European Court of Auditors, 2019).

E-invoicing intends to increase compliance by non-compliant firms and to tackle cross-border fraud. We formulate the assumption that the TDG mainly reacts to the effect of e-invoicing on fraud instead of compliance changes by non-compliant firms. We argue that these firms are still incentivized to report an EU-import. Failure to declare the imports would preclude the deduction of the purchasing costs for income tax purposes. Against this background, we hypothesize:

Hypothesis

The introduction of mandatory e-invoicing in Italy significantly reduces the trade data gap and thus cross-border VAT fraud.

3 Data

We use Eurostat's freely accessible database,Footnote 18 which contains detailed information on exports and imports between EU Member States (intra-EU) for all goods distinguished by the 8-digit CN code, the most detailed level available. Data on intra-EU trade are based on Intrastat declarations by taxpayers exceeding the country-specific threshold (see Table 12 in Appendix) (Eurostat, 2020). To construct the TDG, we collect monthly data on traded products using the 8-digit CN code for intra-EU-imports to Italy from the 27 remaining EU countries reported by Italy and the corresponding intra-EU-exports reported by the remaining EU countries. For a robustness check, we extend this by analogous data for Greece, Lithuania, Romania and Slovakia as importing countries. The observation period ranges from January 2018 to December 2019, resulting in 12 months before and 12 months with mandatory e-invoicing in Italy (introduction of e-invoicing on January 1, 2019). Observations including the value of zero for exports and imports were omitted from the sample since our dependent variable requires nonzero values. We further exclude fuels from our baseline sample since these products were already subject to mandatory e-invoicing six months prior to the general introduction. Table 7 in Appendix presents the distribution of products across the product codes of our sample.

Table 1 presents the descriptive statistics for \({\text{ln TDG}}\) by treatment and control group. We use products falling under the reverse charge mechanism (RCM) as control group. These products should be unaffected by the fraud-reducing effect of the reform. We discuss the selection of this control group when we present the identification strategy below. We expect the mean \({\text{ln TDG}}\) to be (if at all slightly below) zero in case without fraud. The mean \({\text{ln TDG}}\) of the control group consisting of RCM products (TREAT = 0) before and after e-invoicing at 0.0266 and − 0.0332, respectively, is relatively stable and close to zero. In contrast, treatment products (TREAT = 1) show about ten times higher mean \({\text{ln TDG}}\) before e-invoicing (0.2624), which indicates potential fraud within this group. The respective mean of 0.0683 for TREAT in the period with e-invoicing is significantly lower and close to zero; however, it is still higher than its counterpart for the control group, indicating that some fraud activity could be left over. Nonetheless, these descriptive results give suggestive evidence that the mandatory e-invoicing system in Italy significantly affected the treatment group.

Table 1 Descriptive statistics of \({\text{ln TDG}}\)

4 Identification strategy

4.1 Empirical framework

According to our hypothesis, the application of the mandatory e-invoicing in Italy reduces cross-border VAT fraud. Thus, we estimate the following difference-in-differences model:

$$\ln {\text{TDG}}_{{{\text{ept}}}} = \gamma _{{{\text{ep}}}} + \lambda _{t} + \delta \left( {{\text{POST}}_{t} \times {\text{TREAT}}_{p} } \right) + \beta X_{{{\text{ept}}}} + \varepsilon _{{{\text{ept}}}}$$
(2)

\({{\text{POST}}}_{t}\) is a dummy equal to one from January 2019 on and zero otherwise. \({{\text{TREAT}}}_{p}\) is a dummy equal to one if a product \(p\) belongs to the treatment group and zero if the product is protected by RCM.Footnote 19

All non-RCM and non-FUELS products form the treatment group. As the control group, we use products that were most likely not affected by fraud in the run-up to the mandatory e-invoicing. Findings by Buettner and Tassi (2023), Bussy (2020), and Stiller and Heinemann (2019, 2023) provide theoretical and empirical support that the introduction of the RCM substantially tackles cross-border VAT fraud in the importing country as it excludes the fraudster from receiving the output tax. This domestic reverse charge procedure, implemented on certain products and services, shifts the liability to pay the VAT from the supplier to the buyer in B2B transactions. Therefore, fraudsters cannot take control over the VAT anymore, eliminating the incentive to trade with these products for fraudulent purposes. To provide additional evidence, we estimate the effect of the Italian RCM following the approach of Stiller and Heinemann (2023). For brevity, we refer to the description of the exercise and the results in Table 8 in Appendix. RCM significantly reduced the \({\text{ln TDG}}\) in the two main implementation events around April 2011 and May 2016 (see Table 8, Panel C, Column 3, Appendix) indicating a substantial decrease in cross-border VAT fraud.

Due to the hypothesized fraud-reducing effect of e-invoicing in Italy, we predict a negative coefficient \(\delta\) in Eq. (2). Our panel data enables us to include unit and time fixed effects. \({\gamma }_{{\text{ep}}}\) reflects unit fixed effects as exporting-country-8-digit CN code combinations and \({\lambda }_{t}\) represents time fixed effects as continuous month-year combinations.

\(X_{{{\text{ept}}}}\) is a vector of control variables that contains the variables \({\mathrm{THRESHOLD\ GAP}}_{{\text{et}}}\), \({\mathrm{REDUCED\ A}}_{p}\), \({\mathrm{REDUCED\ B}}_{p}\), \({\mathrm{REDUCED\ C}}_{p}\) and \({{\text{EURO}}}_{{\text{e}}}\). \({\mathrm{THRESHOLD\ GAP}}_{{\text{et}}}\) captures differences in reported exports and imports due to different thresholds for reporting obligations for these trade flows that each country is required to set within the Intrastat system (see for thresholds Table 12, Appendix).Footnote 20 The variables \({\mathrm{REDUCED\ A}}_{p}\), \({\mathrm{REDUCED\ B}}_{p}\) and \({\mathrm{REDUCED\ C}}_{p}\) are dummies equal to one if the VAT rate in Italy on the specific product \(p\) is reduced to 10%, 5% or 4%, respectively, and zero otherwise. These dummies serve to capture VAT rate effects.Footnote 21 If fraudsters take the VAT rate into account, as higher rates should technically increase their profits, reduced rate products should be unattractive for them. We therefore expect a lower \({\text{ln TDG}}\) for these products. Further, we include \({{\text{EURO}}}_{e}\) that serves to absorb differences in trade data that could occur due to currency conversion (Loschky, 2006). The variable drops as soon as unit fixed effects are included. The error term is represented by \({\varepsilon }_{{\text{ept}}}\). All variables with explanations are displayed in Table 10 (Appendix). See also Table 11 (Appendix) for descriptive statistics on all control variables.

4.2 Event study and parallel trends

Given our difference-in-differences approach, treatment and control groups must share similar pre-trends. We provide graphical and statistical evidence to test this assumption. Graphic A of Fig. 1 displays the simple mean values of \({\text{ln TDG}}\) for treatment and control group by each period. Treatment products show a significantly higher mean \({\text{ln TDG}}\) before the reform compared to control products. It is noteworthy that the control group exhibits stronger fluctuations than the treatment group. However, before the reform, both groups exhibit similar directions in terms of increases and decreases of \({\text{ln TDG}}\). In period 0 (January 2019) and 1 (February 2019), the treatment group shows a decreasing trend of \({\text{ln TDG}}\) in both months, while the control group increases. Beyond that, the treatment group lingers at a significantly lower level as prior to the reform, close to the level of the control group. A sharp decline of \({\text{ln TDG}}\) can also be seen already in the two months before the event. However, this occurred equally for both groups, potentially caused by reporting issues.

Fig. 1
figure 1

Development of ln TDG. Notes Graphic A shows the mean value of \({\text{ln TDG}}\) as defined in Eq. (1) for treatment (red) and control (blue) by each month within the 24-month observation window. Graphic B shows the event-study coefficients from Eq. (3). Grey lines indicate the 90% confidence interval (Color figure online)

To test the parallel trends assumption more formally and to obtain dynamic effects, we estimate an event study specification of Eq. (2) that reads

$$\begin{aligned} \ln {\text{TDG}}_{{{\text{ept}}}} = & \gamma_{{{\text{ep}}}} + \lambda_{t} + \mathop \sum \limits_{k\neq-1; k = - 12}^{ 11} \delta_{k} \left( {D^{k}_{t} \times {\text{TREAT}}_{{\text{p}}} } \right) + \beta X_{{{\text{ept}}}} + \varepsilon_{{\text{ept }}} \\ \end{aligned}$$
(3)

in which \({D}^{k}_{t}=1[t=Period_0+k]\) and thus includes dummies turning one when the reform is \(k\) months from the start of the reform in period \(k=0\). The period immediately prior to introduction (\(k=-1\)) is not included in the equation and represents the base period, which is set to zero by convention. This dynamic specification includes periods before (pre-trends) and after (dynamic effects) the introduction of e-invoicing. We expect \({\delta }_{k}\) to be around zero for \(k<0\) and negative for \(k\ge 0\). Graphic B of Fig. 1 presents the estimated event study coefficients. The picture reveals that with the exception of the periods close to the ends of the observation window, the coefficients are close to zero directly prior to the reform. In the first period of the event, the coefficient drops visibly and stays negative for the majority of the postreform periods.

We keep the observation window short so that other policy changes aiming at increasing compliance interfere minimally with pre- and post periods. However, we want to discuss briefly the implementation of certain other measures during the sample period.Footnote 22,Footnote 23 Before the SdI was technically ready to process cross-border invoice data in 2022, Italy first demanded so-called Spesometro declarations. From 2011 to 2018, Italian taxpayers were obliged to report invoice data including import and export information in a quarterly or bi-annual report. In 2019, the Esterometro replaced the system by implementing a mandatory monthly filing of VAT sales and purchases made to or acquired from non-resident businesses since the SdI did not include cross-border invoice data. However, these reports did not release taxpayers from the obligation to file Intrastat declarations.Footnote 24 In general, it cannot be ruled out that a shortening of the reporting period has an impact on cross-border VAT fraud. However, the planned introduction of Esterometro has been postponed to April 30, 2019. Moreover, the Italian Ministry of Economy and Finance has also extended filing deadlines of the predecessor regulation (Spesometro) for 2018.Footnote 25

5 Results

Table 2 presents our baseline results for \({\text{ln TDG}}\) displaying all variables included in the model. The coefficient of POST is negative, but statistically significant only when including unit or alternative time fixed effects and suggests a general decreasing trend of \({\text{ln TDG}}\) in Italy. TREAT, on the other side, shows positive coefficients throughout the specifications indicating that the treatment group suffered from higher fraud prior to the reform compared to the control group. This result confirms our rationale for identifying RCM products as a control group.

Table 2 Baseline results

The main variable of interest is the interaction of both variables. The corresponding coefficient is negative and statistically significant throughout the specifications (see Table 2, Columns 1 to 8). Noteworthy, fixed effects control for a large share of the variation as the adjusted R2 increases significantly after including unit fixed effects. Simultaneously, the coefficient of the interaction drops from − 0.136 in Column 3 to − 0.073 in Column 4 (see Table 2). The exclusion of the control variables does not change the results (see Table 2, Column 6).

For robustness, we modify unit and time fixed effects and include them on a higher hierarchy. We use exporter-4-digit HS codes instead of exporter-8-digit CN codes regarding unit fixed effects and quarter-years instead of month-years as time fixed effects. As expected, including the alternative set of fixed effects lowers the adjusted R2 since these fixed effects capture less variation. The interaction effect increases slightly in magnitude to − 0.103 (see Table 2, Columns 7 and 8). Nevertheless, we believe that the specification from Column 5 gives us the best estimate, adequately controlling for omitted variables and lets us observe the preferred within variation of exporter-8-digit CN codes combinations. Finding this robust negative effect throughout the specifications strongly supports our hypothesis that mandatory e-invoicing reduced cross-border VAT fraud in Italy. Regarding Column 5 (Table 2), the application of e-invoicing in Italy is associated with a reduction of the TDG by approximately 7%.

Our first control variable EURO is positively correlated with the dependent variable (see Table 2, Column 2; due to collinearity with unit fixed effects, the variable drops out from Column 3). This result could be explained by the fact that intra-Eurozone fraud avoids currency exchange risks and is therefore more lucrative. Concerning THRESHOLD GAP, the negative coefficient is plausible as the variable sets the reporting threshold for exports in relation to the reporting threshold for imports. An increase in this variable reflects a relative increase in non-reported exports to imports, which reduces \({\text{ln TDG}}\). Note that country-specific estimations for non-reportable trade below the thresholds are not included in the trade figures. REDUCED A and (in some cases) REDUCED C show a negative and statistically significant coefficient as well, indicating that reduced VAT rate products are less appealing to fraudsters. This result is reasonable since a lower tax rate reduces their profits. REDUCED B is omitted from all specification including fixed effects due to collinearity. We note that the control variables do not change the results in any way. However, we keep them throughout the regressions and robustness checks later on, as they might capture some specific fraud behavior that is not controlled for by fixed effects.

6 Robustness checks

6.1 Alternative control group

In our first robustness test, we want to address the concern that trade with RCM products could still contain fraud, since e.g. B2C transactions are not fully covered by this mechanism. We are convinced by the empirical evidence and our exercise from Table 8 (Appendix) that the RCM removes fraud to a significant extent. This can be underpinned by the nature of cross-border VAT fraud, which is based on high-value transactions taking place at the B2B rather than the B2C level. Nevertheless, we want to address this concern. Therefore, we additionally make use of an alternative control group to check the robustness of our initial results. We modify our empirical setting and replace the initial control group (RCM products in Italy) with non-RCM products in other importing countries that did not adopt e-invoicing. Considering the 2018 VAT gap study by Poniatowski et al. (2020), Greece, Lithuania, Romania and Slovakia show similar levels of VAT gaps for the year 2018 and are therefore used as an alternative control group.

According to the above-mentioned strategy, we modify Eq. (2) as follows. The dummy variable TREAT takes on the value of one if the importing country is Italy and zero if the importing country is Greece, Lithuania, Romania or Slovakia.Footnote 26 Note that the respective equation gets an additional subscript \(i\) since the variation now also stems from the fact that we observe different importing countries and country pairs. The correlation matrix (Table 13, Panel B) and the descriptive statistics (Table 14) regarding the alternative control group are displayed in Appendix.

Referring to the descriptive statistics, we can observe that the mean \({\text{ln TDG}}\) of the control countries is relatively low and increases slightly from 0.0167 to 0.0302 after the introduction of mandatory e-invoicing in Italy. Despite the high VAT gaps, the low TDG indicates less cross-border VAT fraud activity in these countries before the Italian reform. Figure 2, Graphic A displays the differences in the mean \({\text{ln TDG}}\) for treatment (Italy) and control countries (Greece, Lithuania, Romania and Slovakia) regarding non-RCM products. The level of the Italian \({\text{ln TDG}}\) is visibly higher pre-reform and significantly closer to the control units afterward. Graphic B of Fig. 2 presents the event-study coefficients that show a sharp decline from period 0, staying constantly on a negative level.

Fig. 2
figure 2

Development of ln TDG—alternative control group. Notes Graphic A shows the mean value of \({\text{ln TDG}}\) as defined in Eq. (1) for treatment (red) and alternative control (blue) group (non-RCM products in Greece, Lithuania, Romania and Slovakia) by each month of the 24 months observation window. Graphic B shows the event-study coefficients from Eq. (3) using the alternative control group. Grey lines indicate the 90% confidence interval (Color figure online)

Table 3 displays the regression table for our alternative control group specification. We can observe a negative and statistically significant interaction throughout the specifications. The coefficient of interest is − 0.201 (see Table 3, Column 4) in the main specification. Thus, non-RCM products in Italy show a considerably lower TDG after the introduction of e-invoicing compared to the control country units. Noteworthy, the magnitude is higher compared to the baseline results, reflecting a more severe decrease of the treatment group compared to the control group. This might be due to a spillover effect of the Italian reform on the control countries. In this regard, we observe a positive coefficient for the main effect POST when it is included. This indicates that within our alternative sample, and in the absence of the reform, the TDG would have developed slightly upward. This is contrary to our baseline results (we observed a downward trend in the absence of the reform) and may be indicative for the spillover hypothesis. Finally, the coefficient on TREAT is positive when included. This confirms that treatment products suffered higher fraud activities prior to the reform.

Table 3 Alternative control group

6.2 Alternative dependent variables

In this section, we check if our baseline results hold when we change the dependent variable. First, we use \({\text{ln TDG}}\) calculated analogous to Eq. (1) using quantities instead of values. Second and third, we winsorize and trim the value-based \({\text{ln TDG}}\) at the bottom and top 1% by each exporting country, respectively. Therefore, we try to control for outliers in the data. Fourth and fifth, we examine the effect on the natural logarithm of export and import, respectively. In this case, we include the opposite trade flow (ln Import or ln Export, respectively) into the model as control variables. Therefore, we test our estimation assumptions used in the following section, according to which we expect to observe falling exports and/or rising imports.

Table 4 presents the results for all described alternative dependent variables. The coefficient of − 0.063 for \({\text{ln TDG}}\) in quantities (see Table 4, Column 1) is statistically significant and comparable to our initial result (− 0.073 in Table 2, Column 5). This strongly confirms our baseline result and indicates that fraudsters underreport values and quantities, which strengthens the assumption that missing traders fail to report imports at all. Winsorizing and trimming \({\text{ln TDG}}\) and therefore excluding outliers hardly affect the quantity of the estimator (see coefficient in Table 4, Columns 2 and 3). That gives us additional confidence regarding our baseline model.

Table 4 Alternative dependent variables

Interestingly, the coefficient on the interaction regarding ln Export is insignificant (see Table 4, Column 4), suggesting that export values did not change after e-invoicing. On the other side, we find a positive and statistically significant coefficient regarding ln Import (see Table 4, Column 5). This result could be indicative that honest traders took over trade from fraudsters that left the market after the reform. Unlike the fraudsters, we expect compliant traders to declare the imports due to business expense deduction, which results in a positive coefficient. To this extent, it seems reasonable that export reporting did not change. However, our model cannot pick up the reason why we observe or not observe certain reactions in specific export and import behavior. Using trade data gaps is more sophisticated in detecting changes in fraudulent trade. Therefore, we leave the interpretation to the reader and assume in the following section that changes in \({\text{ln TDG}}\) can occur due to both decreases in exports and increases in imports.Footnote 27

6.3 Restricted treatment group

In this section, we restrict the group of treated products to those that fall under the same 2-digit and 4-digit HS code as the RCM products, respectively.Footnote 28 This procedure modifies the treatment group with the aim to make it more comparable to the control group as the baseline treatment group covers many different products. Note that throughout RCM products form the control group and do not change compared to the baseline approach. Table 5 presents the results. The statistically significant coefficient of the interaction within the 2-digit HS code sample is very close to our main result (− 0.074 in Table 5, Column 1 vs. − 0.073 in Table 2, Column 5). The corresponding coefficient from the regression based on the 4-digit HS code is with − 0.215 almost three times larger (Table 5, Column 2). In this case, the sample size is significantly smaller due to the reduction of the treatment group.Footnote 29 However, we find significant effects also by decreasing the number of treatment products and making them theoretically more similar to the control products, which supports the suggestive evidence gained so far.

Table 5 Alternative treatment groups

7 Quantification of fraud tackled by e-invoicing

The previous results provide suggestive evidence that e-invoicing tackled cross-border VAT fraud by decreasing the TDG. Throughout, the interaction coefficient \(\delta\) from Eq. (2) captures the decrease in the level of the TDG for treatment products due to e-invoicing. Therefore, exports (imports) of treatment products in 2018 were abnormally high (low) due to fraudulent activity. Mechanically, the TDG decreases when exports (imports) decrease (increase). Therefore, we use simple back-of-the-envelope calculations to estimate the amount of VAT revenue loss (\({\text{REVLOSS}}\)) in the year prior to the reform using the following formulas, separately based on exports and imports:

$${\text{REVLOSS}}_{{\tau }}^{{{\text{EXP}}_{{2018}} }} = {\text{EXPORT}}_{{\tau }}^{{2018}} \times \left( {1 - e^{{\delta _{{{\text{weighted}}}} }} } \right) \times {\text{VAT}}_{\tau }$$
(4)
$${\text{REVLOSS}}_{{\tau }}^{{{\text{IMP}}_{{2018}} }} = {\text{IMPORT}}_{{\tau }}^{{2018}} \times \left( {e^{{ - \delta _{{{\text{weighted}}}} }} - 1} \right) \times {\text{VAT}}_{\tau }$$
(5)

In simple terms, Eqs. (4) and (5) calculate the amount of export excess and import deficit resulting from abnormally high exports and abnormally low imports in 2018 backward from the TDG reduction observed with \(\delta\).Footnote 30 There are four different VAT rates (\({{\text{VAT}}}_{\tau }; \tau\)) for which we calculate \({\text{REVLOSS}}\) in Table 15 in Appendix. We use the sum of exports to Italy reported by the 27 exporting countries (EXPORT2018) and the sum of imports reported by Italy from the 27 exporting countries (IMPORT2018).

In general, the interaction coefficient \(\delta\) estimates the reduction of the TDG. Therefore, we recalculate the amount of exports or imports that have led to this increased TDG in 2018 compared to 2019. These exports and imports are the base of fraudulent trade assumed to be carried out domestically by fraudsters. Therefore, we multiply each export excess and import deficit with the respective VAT rate to obtain the amount evaded in the year prior to the reform. Table 15 (Appendix) outlines the detailed values used in the calculation steps. Note that in the case where Italy refused to refund input VAT to a taxable buyer, part of \({\text{REVLOSS}}\) was recovered. However, we could not find any statement of how much input VAT deduction was refused by Italy.

In the baseline regression, we estimate an unweighted average effect of the reform that could bias our estimation exercise if e.g. a product with significantly higher trade volume experiences a stronger or weaker decline after e-invoicing. To estimate \({\text{REVLOSS}}\), we re-run our baseline regression weighting each observation by export or import volume of a product relative to all other products prior to the reform, respectively. Combining this weighted approach with the estimation of export- or import-based values, gives us a range of four alternatives. If previously declared exports to the fraudsters are eliminated through e-invoicing, the TDG reduces (export-based estimation). The other possible outcome is that honest importers become more active after the fraudsters are pushed out of the respective market, which increases the declaration of imports, and therefore decreases the TDG (import-based estimation).

The results of the weighted regressions are displayed in Table 6. The coefficients of the interaction are statistically significant and reduced to − 0.056 using the export share as weight (Table 6, Column 3) and − 0.048 using the import share as weight (Table 6, Column 6) compared to the unweighted baseline result of − 0.073 (Table 2, Column 5). If we insert the two new coefficients into Eqs. (4) and (5), we obtain a range for \({\text{REVLOSS}}\) between € 2.2 billion and € 2.6 billion (see Table 15, Appendix, and in particular Columns 12 to 15).Footnote 31 Hence, e-invoicing tackled cross-border VAT fraud accounting for about 7% of overall uncollected VAT given the total VAT gap of Italy of € 32.415 billion in 2018.Footnote 32

Table 6 Results – weighted by export and import share

The Italian Ministry of Economy and Finance (2020) estimates the effect of e-invoicing for 2019 using macro-level data and calculates an unexplained residual between actual VAT revenue and the VAT revenue theoretically paid based on the economic cycle of € 1.7 billion and € 2.1 billion. Note that this approach captures all compliance increases by e-invoicing regardless of the cross-border nature. Comparability to our estimate is limited since Italy estimates a figure for 2019, while we are estimating the amount of fraud that could have been carried out in 2018. However, Italy stated that they identified “companies involved in intra-Community fraud mechanisms carried out between the last months of 2019 and 2020, based on invoicing flows for non-existent transactions amounting to around EUR 1 billion” (European Commission, 2021). Together with our results, this indicates that e-invoicing initially reduced cross-border VAT fraud but did not eliminate the fraud activities entirely.

By nature, we lack of precise proxies with regard to tax fraud, but the estimation underpins the significant extent of cross-border VAT fraud and helps to assess the effects of the reform as precisely as possible. Considering the comparably low investment cost of about € 3.7 million and running cost of the mandatory e-invoicing system, amounting to € 10 to 20 million a year, our results provide a strong argument in favor of this tool for combating VAT fraud (Italian Revenue Agency, 2021; Italian Ministry of Economy and Finance, 2018a).Footnote 33

The estimation results are subject to some external constraints. Clearly, the observed VAT revenue and fraud-reducing effect is an Italian specific estimate. Since the level of pre-reform cross-border fraud is an important factor regarding the effectiveness of the mandatory e-invoicing, generalizations for other countries from the results need to be made with caution. As we discussed regarding the parallel trends assumption, we cannot rule out some anticipation of increased fraud in 2018.

8 Conclusion

The numerous measures taken against VAT fraud (such as RCM), as well as the ongoing significant revenue losses, make studies on the effectiveness of these countermeasures particularly important. In 2019, Italy introduced a mandatory e-invoicing system for B2B and B2C supplies, taking a pioneering role in the EU in the timely recording and control of transactions. This paper examines the effect of digitalization in form of e-invoicing in Italy on cross-border VAT fraud using discrepancies in double-reported trade data between Italy and the remaining EU Member States on product flows based on the 8-digit product code. As control group, we use products falling under RCM since recent empirical evidence suggests the fraud-eliminating effect of this measure. All other products serve as the treatment group.

We find a significant reduction of cross-border VAT fraud with the introduction of the mandatory e-invoicing system. This result holds for a number of robustness checks. Additionally, we quantify the reform in Italy using a back-of-the-envelope calculation and estimate that cross-border VAT fraud in 2018 led to VAT revenue losses between € 2.2 billion and € 2.6 billion tackled by the reform. Our findings indicate a desirable fraud-reducing effect of a mandatory e-invoicing system easily exceeding the set-up and running costs of such system. Even though the e-invoicing system covered only domestic transactions, it demonstrates a considerable deterrent effect on cross-border VAT fraud activities. The results provide key insights into the benefits of digitalization.