1 Introduction

Rooted in marketization, an increase in competition is often promoted as a solution to problems in many markets (Brunjes, 2020; Dulleck et al., 2011; Gottschalk et al., 2020; Huck et al., 2016; Johansen & Zhu, 2014; Krachler et al., 2022; Rasch & Waibel, 2018). However, although increased competition can result in greater consumer surplus and market efficiency, in the context of credence goods markets, whether consumers benefit from it remains unclear. In credence goods markets, the seller knows more than the consumer about the type or quality of goods or services that best fit the consumer’s needs (Dulleck & Kerschbamer, 2006; Dulleck et al., 2011; Jing, 2011). The markets of repair services (e.g., automobile, home, and office equipment repair), medical treatment, consulting (e.g., financial or management consulting), and street-hail taxi rides are prime examples of credence goods markets (e.g., Balafoutas & Kerschbamer, 2020; Bhattacharya & Dugar, 2022; Harding et al., 2016). In these markets, the seller knows more about the quality that yields the highest surplus from trade than the consumer (Balafoutas et al., 2013), and it is difficult for the consumer to evaluate the quality of the goods or services received. Such asymmetry of information between sellers and consumers opens the door for sellers to engage in fraudulent behavior—to commit actions that involve fraud.

This study explores the specific issue of fraud in credence goods markets by focusing on policies that result in an elevation in competition through an increase in the number of sellers (with fixed prices that are exogenous to the sellers). Our frame for this analysis is the street-hail taxi ride market, a market in which defrauding passengers is common and costly (CBS Lucky, 2016; Mount, 2014; News, 2010); it also presents characteristics that make certain types of fraud more common, which causes certain types of consumers to be more prone to fraud.Footnote 1 Specifically, overtreatment (or overprovision; i.e., providing excessive treatmentFootnote 2) is common in taxi markets. In the context of taxi rides, overtreatment means that a driver takes unnecessary detours and deliberately drives a longer distance rather than taking the optimal route, in order to charge a higher fare. Consumers with limited knowledge about the optimal route (as opposed to local residents or frequent visitors) are more likely to experience overtreatment. Another aspect of the street-hail taxi market that may increase the potential for defrauding passengers is the absence of additional information about the seller’s past behavior (either through the consumer’s own past interactions or through public information). Although overtreatment is prevalent in the taxi market, undertreatment (or underprovision; i.e., providing insufficient treatment considering the amount paid) and overcharging (i.e., charging for a higher quantity or quality of service than what was provided) are less common. This is because of seller liability (as the driver must drop off the passenger at the destination, upon which payment is usually made) and verifiability (through the proper use of a taximeter).Footnote 3

Examining the incentives to defraud consumers in the general credence goods market may offer helpful information for understanding specific issues in the street-hail taxi market. In the credence goods market, the intensity of competition among sellers has been identified as influencing their incentives to defraud consumers differently (Dulleck et al., 2011; Mimra et al., 2016a; Rasch & Waibel, 2018). Intuitively speaking, an increase in competition should decrease sellers’ incentives to defraud consumers in the credence goods market because competition intensity provides consumers with an option to choose from several sellers, decreases search costs, and increases easy access to second opinions (Rasch & Waibel, 2018). However, a lack of (cheap) second opinions, the absence of the possibility of reputation building, and fixed prices that are exogenous to the seller (therefore, the seller has no direct control over demand) add specific considerations to the market for credence goods. These market considerations may lead to the opposite outcome in markets for credence goods, in that the elevation in competition provokes unethical behavior (among sellers) to avoid or compensate for losses.

In this study, we examine the policy of the NYC Taxi and Limousine Commission (TLC) to license a fleet of green taxis to improve access to street-hail transportation in areas outside of the Manhattan Core (i.e., from south of West 110th Street and East 96th Street; NYC TLC, 2013), and how that decision has affected the behavior of existing sellers (i.e., yellow taxi drivers) and new sellers (i.e., green taxi drivers).Footnote 4 The city’s decision to license green taxis splintered the yellow taxi market into routes in which yellow taxis may face more competition for pickups or for both pickups and post-drop-off pickups.Footnote 5

Specifically, we conducted a multistep analysis using a dataset of trips by yellow taxis in NYC along with the launch of new green taxis. First, we empirically investigated whether any evidence indicates a change in fraudulent behavior by yellow taxis following the launch of green taxis.Footnote 6 Second, we investigated the degree of change in fraudulent behavior by yellow taxis (after the launch of green taxis) depending on (i) whether yellow taxis have a monopoly over pickups or compete with green taxis for pickups and (ii) whether yellow taxis have a monopoly over post-drop-off pickups or compete with green taxis for them.

Third, we investigated the degree of change in the fraudulent behavior of yellow taxis depending on passengers’ familiarity with optimal routes; we compared a subset of yellow taxi trips in which the pickup/drop-off locations were office buildings and hotels. In addition, we investigated the degree of change in the fraudulent behavior of yellow taxis across the daily fare schedule and over time.

Our empirical investigation of the NYC taxi market, which comprised more than 17 million matched trips within 14 months, revealed more fraudulent behavior among existing players. We also found that yellow taxis’ degree of change in fraudulent behavior varied with competition intensity; it was highest on routes where yellow taxis compete with green taxis for both pickups and post-drop-off pickups. The change in the fraudulent behavior of yellow taxis was greater during the hours of a change in driving shifts, indicating a desire to meet an income target set as a reference point (Camerer et al., 1997; Kukavica et al., 2022). In addition, we found higher levels of change in fraudulent behavior by yellow taxis for passengers traveling to or from hotels (who tend to be tourists and may be less knowledgeable about optimal routes), whereas the same did not hold true for trips to or from office buildings (where passengers tend to be locals and more familiar with optimal routes).

The remainder of this paper is organized as follows. First, we provide an overview of existing literature and our contributions to current knowledge through this study. We then detail the regulations and mechanisms of the taxi market in NYC, and present our hypotheses. Next, we describe the dataset used in our empirical study, followed by an explanation of the model used and a discussion of the results. The penultimate section details our robustness checks, tests for the validity of the identification strategy, and empirical findings, and provides further insights to support our inferences. Finally, we conclude with the key insights and contributions to the literature.

2 Overview of existing literature and contributions of the current study

Empirical approaches to investigating fraud can be categorized by market, such as repair services (Beck et al., 2014; Kerschbamer et al., 2016, 2019; Mimra et al., 2016a; Schneider, 2012), healthcare (Angerer et al., 2021; Björkman Nyqvist et al., 2022; Domenighetti et al., 1993; Gottschalk et al., 2020; Hennig-Schmidt et al., 2011; Huck et al., 2016; Waibel & Wiesen, 2021), and taxi ride markets (Balafoutas et al., 2013; Rajgopal & White, 2019).

Previous studies have shown that consumers with inadequate information about the quality or value of a service are vulnerable to higher degrees of fraudulent treatment. Through a field experiment involving undercover visits to auto repair shops using a test vehicle, Schneider (2012) found evidence of overtreatment. Beck et al. (2014) confirmed these findings through a lab experiment. In a questionnaire-based study on the healthcare market in Switzerland, Domenighetti et al. (1993) found that physicians (and their family members) had lower rates of surgery than the general population (typically less informed than physicians about the suggested service). Almost three decades later, investigating the underlying reasons for fraudulent behavior in the healthcare market remains a vibrant area of study. In a field experiment involving a test patient in the dental care market, Gottschalk et al. (2020) observed overtreatment recommendations on more than every fourth visit. Through a lab experiment, Huck et al. (2016) found that insurance leads to more overtreatment, whereas more competition helps decrease overtreatment. Hennig-Schmidt et al. (2011) found that patients are overtreated under a fee-for-service system and undertreated under capitation.

Another significant market in which consumer defrauding is common is the street-hail taxi market. Specifically, past studies have found evidence of taxi drivers defrauding passengers who are unfamiliar with local routes and tariffs (Balafoutas et al., 2013), and who explicitly state that their expenses will be reimbursed (Balafoutas et al., 2017). Rajgopal and White (2019) and Liu et al. (2019) also found evidence of defrauding both passengers on trips outside NYC limits (where yellow taxis are not allowed to pick up passengers) and passengers on trips subject to a two-part tariff, respectively. In other studies on the NYC taxi market, Tang (2020) and Liu et al. (2021) found that yellow taxis take passengers on significantly longer detours for metered airport routes.

Past studies related to fraud in credence goods markets have explored the causes of inefficiencies in the markets by investigating sellers’ incentives to defraud consumers, specifically, the effect of institutional and market conditions, such as sellers’ liability, treatment verifiability, availability of a second opinion in the market, breaking up the joint provision of diagnosis and treatment, and sellers’ reputation building (e.g., Bester & Dahm, 2018; Dulleck & Kerschbamer, 2006; Dulleck et al., 2011; Mimra et al., 2016a). However, the literature indicates a mixed impact of competition on the fraudulent behavior of sellers in credence goods markets (Balafoutas & Kerschbamer, 2020). Although some studies have found that more intense competition decreases seller fraud by reducing consumers’ search costs for obtaining a second opinion (Huck et al., 2016; Rasch & Waibel, 2018), or that there is no significant effect of long-term competition on overtreatment (Gottschalk et al., 2020), others have shown contradictory results (e.g., Dulleck et al., 2011; Mimra et al., 2016a).

Our study contributes to the literature on credence goods markets by investigating whether these markets benefit from policies that increase competition. Specifically, our study contributes to the research stream by examining whether credence goods markets benefit from competition-boosting policies. Policymakers often promote competition as a means of marketization (see Krachler et al., 2022) to overcome market issues. Although marketization is exercised in diverse ways in different markets (e.g., see Krachler et al., 2022), the increase in the number of sellers (i.e., the concentration of sellers) is a common proxy that past studies have employed to measure the degree of competition (Johansen & Zhu, 2014; Rasch & Waibel, 2018). Our study contributes to this stream of literature by providing insight into the impact of increased competition on sellers’ incentives to defraud consumers. Furthermore, it extends the analysis from a focus on the healthcare and schooling markets to the market for street-hail taxi rides.

Additionally, our study contributes to the literature on credence goods markets by investigating whether policies promoting competition expose specific groups of consumers to higher levels of fraudulent behavior. For example, we investigated whether consumers with inadequate information about the optimal route (e.g., tourists, compared to locals; see also Balafoutas et al., 2013) are vulnerable to higher levels of fraudulent behavior.

3 Details of the taxi market in NYC

3.1 Launch of green taxis in NYC

In August 2013, the NYC TLC (which is responsible for a range of licensing and regulation matters for the city’s taxis) began licensing green taxis. By the end of February 2014, 4,200 green taxis (approximately 30% of the total number of yellow taxis) were licensed.Footnote 7 The goal of licensing green taxis was to improve access to street-hail transportation outside of the Manhattan Core (NYC TLC, 2013, 2014) by providing "… yellow-caliber taxi service to the boroughs since 94% of yellow taxi pick-ups occur either in Manhattan or at one of the airports" (Bloomberg & Yassky, 2014, pp. 1–2; see also Panel A of Fig. 1).

Fig. 1
figure 1

Examples of Trips by Yellow Taxis in NYC and Areas of NYC Where Green Taxis Are/Are Not Allowed to Pick Up Passengers. Notes: In Panel A, darker colors represent a larger number of pickups. Similar patterns existed for pickups using yellow taxis before 2012. In Panel B, the exclusionary zone is highlighted in yellow. Green taxis can pick up passengers by street-hail outside the Manhattan exclusionary zone and by pre-arranged reservation (only) at airports (highlighted in grey; source: NYC TLC). In Panel C, the blue dots represent the pickup locations of trips in CR EN, and the red dots represent the drop-off locations of the respective pickups outside the exclusionary zone

3.2 Regulation of fare and activity area of the taxi market in NYC

Both yellow and green taxis have the same fare structure: (i) standard-rate fare (i.e., a two-part fare consisting of a fixed fee and a variable fee) and (ii) flat-rate fare (e.g., fares for trips between John F. Kennedy (JFK) International Airport and Manhattan). In the time frame of this study, standard-rate fare passengers would pay a fixed amount of $2.50 after pickup, plus a $0.50 variable fee per "unit." Each unit is calculated as 1/5 of a mile or 60 s, depending on the speed of the taxi. If the taxi drives faster than 12 miles per hour (mph), then the unit of variable cost is defined as 1/5 of a mile. Otherwise, the unit of variable cost is defined as 60 s. However, under the flat-rate fare, the passenger would pay a fixed amount of $52 plus other surcharges (e.g., tolls, or rush-hour and overnight surcharges).

Unlike yellow taxis, green taxis are not authorized to accept street-hail passengers from south of West 110th Street and East 96th Street (i.e., the Manhattan Core), which is considered an "exclusionary zone" where yellow taxis have a monopoly on picking up passengers (see Panel B of Fig. 1). However, green taxis may drop passengers off at any location.

4 Set-up and predictions

4.1 Set-up of competitive routes

This study divides NYC into four Competitive Routes (CR EECR NN; see Table 1), determined by whether yellow taxis compete with green taxis for pickups or post-drop-off pickups. As Table five shows, depending on the competitive route, yellow taxi behavior is affected by a combination of conditions: (i) whether they are competing with green taxis to pick up passengers or whether they have a monopoly over it, and (ii) whether they are competing with green taxis to pick up passengers post-drop-off or whether they have a monopoly over it. The competitive nature of this structure means that the launch of green taxis in NYC, which was primarily intended to improve access to street-hail transportation (Bloomberg & Yassky, 2013, 2014; NYC TLC, 2013, 2014), has the potential to affect the behavior of yellow taxis with regard to both the pickup and drop-off locations of the trip.

Table 1 Summary of Competitive Routes of NYC Street-Hail Taxi Market

4.2 Hypotheses and research question

Before the launch of green taxis in NYC, yellow taxis had a monopoly on street-hail pickups in all five boroughs of NYC, that is, the entire NYC. However, after the introduction of green taxis, their monopoly shrank to the exclusionary zone, leaving them to compete with green taxis for picking up passengers from other areas of NYC. Yellow taxis have continued to enjoy the same level of demand in CR EE (where they have a monopoly over pickups and post-drop-off pickups) before and after the launch of green taxis.

Hence, we make the following hypothesis.

  • H1: After the launch of green taxis, yellow taxis do not engage in higher levels of fraudulent behavior on trips within CR EE.

Even after the launch of green taxis, yellow taxis are the only available street-hail option for exclusionary zone passengers whose destination is the non-exclusionary zone. In the context of yellow taxis, the shrinkage of their monopoly over street-hail pickup areas from the whole of NYC to the exclusionary zone is considered a loss, and may motivate them to defraud passengers (more than they used to).

Yellow taxis already know that for trips from the exclusionary zone to the non-exclusionary zone (i.e., CR EN, where they held a monopoly before the launch of green taxis), they must compete with green taxis to pick up passengers post-drop-off. In credence goods markets, ex-ante, one cannot draw a straightforward conclusion that the benefit that consumers receive from lower prices (caused by competition among sellers) exceeds the loss they may face at the hands of fraudulent sellers who are incentivized to defraud by the lowering of prices (Kerschbamer & Sutter, 2017). However, exogenous prices to the seller (in our case, it means taxis having no control over the fare as NYC TLC regulates it) leave sellers with limited control over increasing their demand (e.g., by lowering prices/fares), and an increase in competition in such a situation may provide sellers with a stronger incentive to defraud passengers.

Additionally, previous studies have observed that NYC cab drivers may set an income target as a reference point and attempt to meet their income targets (Camerer et al., 1997; Kukavica et al., 2022). Since the launch of green taxis, yellow taxis have to search more for post-drop-off pickups on trips within CR EN, which delays their achieving the specific income target. In this regard, taking detours may be a way to compensate for the time needed to reach their income target (and the high opportunity cost of leaving the exclusionary zone) in order to break away from the constraints of a more competitive situation.

Unlike in CR EN, where yellow taxis have a monopoly over street-hail pickups, in CR NN, they must compete with green taxis for both pickups and post-drop-off pickups. In other words, in contrast to CR EN trips, for CR NN trips, yellow taxis must compete at least once more with green taxis post-drop-off (i.e., competing for pickups in addition to competing for post-drop-off pickups, as in trips within CR EN). Such a setting for trips within CR NN may provide yellow taxis with an even stronger motivation to defraud passengers on these trips.

Thus, we make the following hypotheses.

  • H2a: After the launch of green taxis, yellow taxis engage in higher levels of fraudulent behavior on trips within CR EN as well as CR NN.

  • H2b: After the launch of green taxis, yellow taxis engage in higher levels of fraudulent behavior on trips within CR NN than on trips within CR EN.

On trips within CR NE, the yellow taxis return to their exclusionary zone. Upon return, they experience a demand for post-drop-off pickups similar to what they had experienced before the launch of green taxis, yet they have to compete with green taxis to pick up passengers. Moreover, trips within CR NE often follow trips within CR EN/CR NN, where we hypothesize that yellow taxis had engaged in fraudulent behavior. In this regard, a priori, it is difficult to predict whether yellow taxis will engage in more fraudulent behavior on trips within CR NE. Accordingly, instead of formally formulating a hypothesis, we explore the effect of the launch of green taxis on yellow taxis in CR NE by investigating the following research question:

  • RQ: Do yellow taxis engage in higher levels of fraudulent behavior on trips within CR NE since the launch of green taxis?

5 Data

Figure 2 displays the number of monthly street-hail pickups by yellow and green taxis from January 2012 to January 2015 and August 2013 to January 2015. As Fig. 2 illustrates, after the launch of green taxis, the total number of monthly pickups in NYC increased compared to the number of pickups in the same month in previous years. Moreover, Fig. 2 shows that, overall, the number of monthly yellow taxi pickups decreased compared to the number of pickups in the same month in previous years. These observations can be considered an overall increase in the number of pickups outside the exclusionary zone and an overall decrease in the share of the yellow taxi market.

Fig. 2
figure 2

Monthly Number of Street-Hail Trips by Yellow and Green Taxis in NYC. Notes: The number of trips is based on the number of pickups reported by the NYC TLC Taxi Passengers Enhancement Program. The solid vertical lines represent the beginning and end dates of the trips in the dataset. The dashed vertical line represents the launch date of green taxis

We limited the timeframe of our analysis to the period from January 2013 to March 2014 for two reasons. First, an increase in the variable fee (from $0.40 to $0.50) implemented in September 2012 would affect our conclusions; therefore, we considered January 2013 as the beginning date of our analysis (i.e., seven months before the launch of green taxis).

Second, as Fig. 2 illustrates, during the first few months after the launch of green taxis in NYC, the monthly number of pickups by green taxis increased steadily until it remained stable around March, 2014. Therefore, to ensure that our empirical analysis captured the effect of the launch of green taxis on the fraudulent behavior of yellow taxis (and to avoid the launch and popularity of peer-to-peer ridesharing in the later months of 2014 affecting our results), we focused on data of yellow taxi trips seven months after the launch of green taxis (i.e., end of February 2014; see Fig. 2).

As we used data provided by the NYC TLC, it was necessary to carry out several steps to prepare the dataset (see Table 5 of Appendix A for details). We used the NYC shapefile to identify yellow taxi pickup and drop-off neighborhoods (districts within boroughs, as in the example of SoHo in Lower Manhattan). By identifying the pickup and drop-off neighborhoods of a trip, we can determine the CR of the respective trips and focus on trips within the NYC boundaries. For example, Panel C of Fig. 1 shows a screenshot of a sample of trip pickup and drop-off locations in CR EN.

For our dataset, we matched trips before the launch of green taxis with those after the launch of green taxis. We considered trips within an approximate radius of 0.025 miles to have the same pickup or drop-off location (i.e., approximately half the block size in Manhattan (Fréchette et al., 2019)). Precisely, we matched trips (i) on an hourly basis, by breaking down a day into 24 h, and (ii) that have both the same pickup and drop-off locations.

Matching trips (i.e., matching yellow taxi trips before the launch of green taxis with yellow taxi trips after the launch) on an hourly basis according to their pickup and drop-off locations is an important step in our analysis. First, matching trips allowed us to create a balanced sub-sample of yellow taxi trips with the same pickup and drop-off locations. This allowed us to isolate the effect (i.e., the launch of green taxis) by excluding unmatched yellow taxi trips, for example, from new (existing) locations that may (may not) become (un)popular owing to the launch of green taxis. Second, matching allowed us to create a subsample of trips that are comparable to each other and, consequently, to further isolate different variations, such as those due to traffic congestion throughout the day or the pickup or drop-off location of the trip, affecting our analysis.

Table 2 summarizes the number of trips subject to the standard-rate fare and compares the distance, duration, and fare in each CR before and after the green taxi launch. Table 2 shows that the sample of (matched) trips was balanced before and after the launch of green taxis across the four CRs. Table 2 also reveals that the average speed of trips in CR EE before the launch of green taxis is slower than 12 mph (10.21 = 1.48 miles / (8.70 / 60) h), indicating that the duration of the trip is the main driver of the fare. However, for trips within CR EN, CR NE, and CR NN, similar calculations reveal that the average speed of trips is higher than 12 mph, indicating that the distance of the trip is the main driver of the fare.

Table 2 Summary of Matched Yellow Taxi Trips (Subject to Standard-Rate Fare) Considered for Empirical Analysis

Moreover, Table 2 shows that the trip distance, duration, and fare of yellow taxi trips within CR EN, CR NE, and CR NN (at a significance level of 5%) did not decrease after the launch of green taxis. Specifically, except for the distance of trips in CR NE, the trip characteristics of yellow taxis increased within these routes after the launch of green taxis, which seems to provide some model-free evidence for H2a.

6 Description of the statistical model

Using matched trips, we investigated the effect of the launch of green taxis on the natural log-transformation of the distance of trips (by yellow taxis) separately for each competitive route. Specifically, for each competitive route, we investigated whether yellow taxi trip distances changed after the launch of green taxis. For this, we use:

$$\begin{array}{l}\ln\left({\mathrm{Trip}\;\mathrm{Distance}}_{\mathrm{jmd}}\right)={\beta}_0+\theta\times{\mathrm{GL}}_{\mathrm m}\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+\left[\gamma\times{\mathrm{GLT}}_{\mathrm m}+\alpha\times{\mathrm T}_{\mathrm m}+\lambda\times\mathrm T_{\mathrm m}^2\right]\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+{\beta}_1\times{\mathrm{No}.\;\mathrm{of}\;\mathrm{Yellow}\;\mathrm{Pickups}}_{\mathrm m}+{\beta}_2\times{\mathrm{No}.\;\mathrm{of}\;\mathrm{Green}\;\mathrm{Pickups}}_{\mathrm m}\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+{\beta}_3\times{\mathrm{Toll}}_{\mathrm j}+\left[{\beta}_4\times{\mathrm{Overnight}}_{\mathrm j}+{\beta}_5\times{\mathrm{Ruch}\;\mathrm{Hour}}_{\mathrm j}\right]\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+\left[{\beta}_6\times{\mathrm{Credit}\;\mathrm{Card}}_{\mathrm j}+{\beta}_7\times{\mathrm{Unknown}}_{\mathrm j}\right]\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+\lbrack{\beta}_8\times{\mathrm{Two}\;\mathrm{Passengers}}_{\mathrm j}+{\beta}_9\times{\mathrm{Three}\;\mathrm{Passengers}}_{\mathrm j}+{\beta}_{10}\times{\mathrm{Four}\;\mathrm{Passengers}}_{\mathrm j}\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+{\beta}_{11}\times{\mathrm{Five}\;\mathrm{Passengers}}_{\mathrm j}+{\beta}_{12}\times{\mathrm{Six}\;\mathrm{Passengers}}_{\mathrm j}\rbrack\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+\left[{\beta}_{13}\times{\mathrm{Rain}}_{\mathrm d}+{\beta}_{14}\times{\mathrm{Snow}}_{\mathrm d}+{\beta}_{15}\times{\mathrm{Max}\;\mathrm{Temp}}_{\mathrm d}+{\beta}_{16}\times{\mathrm{Min}\;\mathrm{Temp}}_{\mathrm d}\right]\\\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;\;+\beta_{17}\times{\mathrm{MFIN}}_d+\varepsilon_{jmd},\end{array}$$
(1)

where the dependent variable ln (Trip Distancejmd) is the natural log-transformed distance of trip j (in miles) in month m (= {1, 2, …, 14}) and on day d (= {(January 1, 2013), …, (February 28, 2014)}) in the respective competitive route. Therefore, the coefficients on the right-hand side of Eq. (1) represent the percentage change in our dependent variable, due to a "one unit change" in the independent variable.

We denote August 2013 (i.e., m = 8) as the month of the launch of green taxis. The variable GLm, which is a step dummy variable that takes a value of 1 when m ≥ 8 and zero otherwise, is used to allow for a level shift at the time of the launch of green taxis. Consequently, coefficient θ in Eq. (1) captures the percentage change in the distance of trips after the launch of green taxis.

To avoid unobservable time-varying effects on the dependent variable (i.e., trip distance) in Eq. (1), and to control for the nonlinear effect of time, we include a deterministic (monthly) trend variable Tm and a quadratic term for the deterministic trend variable Tm2. Essentially, variable Tm takes on values from 1 to 14 from January 2013 to February 2014. The variable GLTm takes the value of m − 8 + 1 when m ≥ 8 and zero otherwise. As such, this allows for the growth rate of the trend curve at the time of the launch of green taxis.

No. of Yellow Pickupsm is a variable representing the number of monthly pickups by yellow taxis (in millions) in month m. Moreover, to control for the increasing activity of green taxis, we control for the number of monthly pickups by green taxis, No. of Green Pickupsm (in millions). Controlling for the number of monthly pickups also allows us to capture the seasonality effect.

Tollj is a dummy variable equal to one if trip j includes a toll. Overnightj and Rush Hourj are dummy variables that control for variations in traffic congestion, and are equal to one for overnight and rush-hour trips. The standard-rate fare structure also sets a fixed rush-hour surcharge of $1.00 for trips between 4:00 p.m. and 8:00 p.m. on weekdays. An additional fixed charge of $0.50, an overnight surcharge, applies to trips between 8:00 p.m. and 6:00 a.m.

In Eq. (1), Credit Cardj and Unknownj are dummy variables equal to one if trip j is paid by credit card or has an unknown payment, respectively. Moreover, we control for the number of passengers on trip j by introducing dummy variables. Raind, Snowd, Max Tempd, and Min Tempd control for the level of rain or snow (in inches) and the maximum and minimum temperatures (in Fahrenheit) in NYC on day d, respectively (Table 12 of Appendix B summarizes the sources of the data that we utilize for our analysis).

MFINd is the daily stock price of the Medallion Financial Corp (NASDAQ: MFIN), a publicly traded company with leading positions in markets that include taxi medallion lending, and is a good proxy for taxi market performance in NYC. Finally, we used robust standard errors to estimate Eq. (1).

Under the standard-rate fare structure, the trip fare is a function of the combination of trip distance, taxi speed, and the time taken for the respective trip. Therefore, to obtain a complete understanding of the effect of the launch of green taxis, we use both the natural log-transformed trip duration and trip fare as separate dependent variables and run a regression on the same independent variables as in Eq. (1).

7 Results

Table 3 presents the effect of the launch of green taxis based on our regression results for all four CRs (for the robustness of the results in Table 3, see Panel A of Fig. 3). Table 3 shows that after the launch of green taxis in NYC, the distance of yellow taxi trips in CR EE increased by 2%. A 2% increase in the trip distance in CR EE translates to less than 0.03 miles (< 0.05 miles, which is the approximate block size in Manhattan; see Table 2 for mean distance, duration, and fare of trips in CRs before the launch of the green taxis). However, we found that the trip duration and trip fare in CR EE decreased by 4% and 2%, respectively (i.e., < 0.5 min and $0.16, respectively).

Table 3 Effect of the Green Taxi Launch on Distance, Duration, and Fare of Yellow Taxi Trips Subject to the Standard-Rate Fare

Trips in CR EE constitute short routes (see Table 2) and occur in an area with higher demand (i.e., the exclusionary zone), making (further) defrauding current passengers by increasing the variable fee less attractive than the income generated by the fixed fee of picking up a new passenger. These results, in line with the findings of past literature (Dulleck & Kerschbamer, 2006; Liu et al., 2021), indicate that yellow taxis are advantaged by taking as many trips as possible to exploit the proportionally large fixed fee of the fare (i.e., $2.50, which equals a detour of 5 min for trips moving slower than 12 mph) at the expense of a one-unit decrease in the variable fee.Footnote 8 As in the case of H1, these findings do not provide evidence of the (higher) fraudulent behavior of yellow taxis on trips within CR EE.

Table 3 shows that in CR EN, the distance of yellow taxi trips significantly increased by 5% after the launch of green taxis (i.e., approximately 0.33 miles). The trip duration and trip fare in CR EN also increased by 5% and 4%, respectively (i.e., approximately 1 min and $0.86, respectively). These findings are in line with H2a, indicating that the shrinkage in the monopolistic area of yellow taxis from the whole of NYC to the exclusionary zone, as a consequence of the launch of green taxis in NYC along with the elevation in competition post-drop-off, provides yellow taxi drivers with conditions that provoke the further defrauding of passengers in CR EN.

Our results in Table 3 indicate that trips within the non-exclusionary zone face the highest percentage increase—a 17% increase (i.e., approximately 0.80 miles in CR NN). Moreover, the trip duration and trip fare in CR NN also increased by 4% (approximately 0.5 min) and 10% (approximately $1.50), respectively. These findings are consistent with H2a and H2b.

For trips within CR NE, we found that the distance traveled increased by 5%. However, the trip durations decreased by 6%, and the trip fares did not change significantly (Table 3). Together, these findings do not provide evidence for a change in the fraudulent behavior of yellow taxis. However, they reveal that yellow taxis drive longer distances faster for trips from the non-exclusionary zone to the exclusionary zone (with a greater opportunity for pickups). These results provide an answer to the RQ. The underlying reason for this observation may be that yellow taxis drive to their "base" (i.e., monopoly area) for trips in CR NE, where they enjoy high demand and do not need to compete with green taxis for post-drop-off pickups. Consequently, a $2.50 fixed fee from picking up a new passenger may outperform detouring the current passenger.

8 Validity and robustness of the results

8.1 Robustness checks and flat-rate fare trips

We tested the robustness of our results through a battery of sensitivity analyses; the findings are presented in Table 3 (also see Panel A of Fig. 3). Using a specification curve (SPECC), we visualized how the coefficient magnitudes and precision varied across different specifications (see, e.g., Gathergood et al., 2021). Overall, we find that our central findings detailed in the Results section remain robust across various specifications, thus supporting our main conclusions. Moreover, we checked the robustness of our results using trip distance, trip duration, and trip fare as dependent variables (instead of their log-transformed values); our main conclusions in the Results section remain the same (see Table 6 of Appendix A).

To further increase our confidence in the results, we investigated whether the launch of green taxis in NYC affects the trip distance and trip duration of yellow taxi trips subject to a flat-rate fare. We considered 173,063 trips between Manhattan and JFK International Airport, subject to a flat-rate fare (i.e., $52 plus other surcharges).

Intuitively, after the launch of green taxis in NYC, yellow taxis should have no incentive to increase the distance of trips subject to a flat-rate fare. This conclusion is confirmed by the results illustrated in Panel B of Fig. 3, which show that the distances of yellow taxi trips subject to a flat-rate fare did not increase significantly after the launch of green taxis in NYC. Moreover, the results in Panel B of Fig. 3 do not show an increase in the trip durations of flat-rate fare trips, providing evidence that after the launch of green taxis, yellow taxis did not drive slower in these routes (see Table 7 of Appendix A for results from the model with all independent variables and Table 8 for respective results without log-transformation) Fig. 3.

Fig. 3
figure 3

Specification Curves Illustrating Effect of the Green Taxi Launch on Distance and Duration (and Fare) of Yellow Taxi Trips. variables and competitive routes. The black-filled (grey-filled) dots represent the presence (absence) of the respective control variables in the regression. The blue dots represent the estimations from the models with all control variables. Trips in CR EE have no tolls. For full regression results, see Appendix G (for Panel A) and Appendix H (for Panel B). Reading Examples: (i) the left (grey) highlighted column in Panel A represents the coefficient for the variable GLm (i.e., θ) in CR EE estimated from Eq. (1) (for full results, see results under column CR EE and ln (Trip Dist.) of Table 3); (ii) the right (grey) highlighted column in Panel A represents the coefficient for the variable GLm (i.e., θ) in CR NN estimated from the following modified version of Eq. (1): \(\ln\left(\mathrm{Trip}\;{\mathrm{Distance}}_{\mathrm{jm}}\right)={\beta}_0+\theta\times{\mathrm{GL}}_{\mathrm m}+\left[\gamma\times{\mathrm{GLT}}_{\mathrm m}+\alpha\times{\mathrm T}_{\mathrm m}+\lambda\times\mathrm T_{\mathrm m}^2\right]+{\beta}_1\times\mathrm{No}.\;\mathrm{of}\;\mathrm{Yellow}\;{\mathrm{Pickups}}_{\mathrm m}+{\beta}_2\times\mathrm{No}.\;\mathrm{of}\;\mathrm{Green}\;{\mathrm{Pickups}}_{\mathrm m}+{\beta}_3\times{\mathrm{Toll}}_{\mathrm j}+\left[{\beta}_4\times{\mathrm{Overnight}}_{\mathrm j}+{\beta}_5\times\mathrm{Rush}\;{\mathrm{Hour}}_{\mathrm j}\right]+\left[{\beta}_6\times\mathrm{Credit}\;{\mathrm{Card}}_{\mathrm j}+{\beta}_7\times{{Unknown}}_{\mathrm i}\right]+{\varepsilon}_{\mathrm{jm}}\)  

8.2 Decomposition of trips by borough

To further check the validity of our main findings (i.e., H2a), we decomposed trips in CR EN and CR NN (where we found strong evidence for the fraudulent behavior of yellow taxis) by their drop-off and pickup boroughs, respectively. We then ran a regression on the same independent variables as in Eq. (1) (see Fig. 5 in Appendix A). Overall, the results from the decomposition of trips by the pickup/drop-off borough do not reveal inconsistent patterns (compared to our main results, presented in Table 3) and are in line with our main conclusions. Specifically, we found that after the launch of green taxis, the trip distance, trip time, and trip fare of yellow taxi trips in CR EN and CR NN did not (significantly) decrease.Footnote 9

8.3 Variation in treatment of yellow taxis

We conducted different analyses to verify the validity of our results and generate additional insights. We mainly focused on matched yellow taxi trips (subject to standard tariff fares) in CR NN (the competitive route with the highest level of fraudulent yellow taxi behavior).

8.3.1 Consumer information

In the context of credence goods, past studies have highlighted the role of consumer information in improving their position (e.g., Balafoutas & Kerschbamer, 2020). Therefore, we checked the concurrent validity of our results by investigating whether the heterogeneity in passenger information about the optimal route affected the degree to which they were defrauded. We used matched yellow taxi trips (subject to standard tariff fares) in CR NN and conducted two different analyses.

We differentiated between (i) trips for which the pickup/drop-off is near office buildings (as a proxy for passengers’ familiarity with the optimal route, which provides them with some degree of diagnosis ability) and (ii) trips for which the pickup/drop-off is near a hotel (as a proxy for passengers’ unfamiliarity with the optimal route). We then investigated whether after the launch of green taxis, passengers on trips in (i) are treated differently than those on trips in (ii).

Specifically, we considered trips for which there was an office building or hotel within a 0.025 miles radius of a passenger’s pickup/drop-off. To obtain location data for office buildings and hotels, we used information collected by The Primary Land Use Tax Lot Output (PLUTO) and provided by the Department of City Planning (DCP; see Appendix B). Next, we used the natural log-transformed trip distance, trip duration, and trip fare as dependent variables and ran a regression on the same independent variables as in Eq. (1), using trips for which the pickup/drop-off was near an office building and a hotel.

We found that when the pickup/drop-off occurred near an office building, the distance, duration, and fare of trips remained unchanged after the launch of green taxis (see Table 4).Footnote 10 However, our results showed that after the launch of green taxis, trips for which the pickup/drop-off was near a hotel experienced 24%, 22%, and 21% increases in distance, duration, and fare, respectively.

Table 4 Robustness Checks: Effect of Green Taxi Launch on Distance, Duration, and Fare of Yellow Taxi Trips to/from Office Buildings and Hotels in CR NN and Subject to Standard-Rate Fare

8.3.2 Daily treatment of yellow taxis

We checked whether the fraudulent behavior of yellow taxis changed throughout the hours of the day on routes within CR NN; we used the natural log-transformed trip distance, trip duration, and trip fare as dependent variables and ran regressions similar to Eq. (1), with the following changes: (i) we controlled for hours of the day and the interaction of the hours of the day with the step dummy (i.e., GLm) and (ii) we removed Overnightj (due to collinearity; see Appendix C and the respective results in Table therteen).

Our results show that trips carried out between 1:00 p.m. and 5:00 p.m. have a relatively greater increase in distance, duration, and fare, indicating a greater change in fraudulent behavior in CR NN during these hours. This finding is important when noting that around 5:00 p.m. is the traditional time for yellow taxis to change their working shifts (de Blasio & Joshi, 2016; Grynbaum, 2011). These observations echo the notion of an income reference point for taxi drivers, indicating that taxi drivers may set an income target as the reference point (Camerer et al., 1997; Kukavica et al., 2022). The increase in the fraudulent behavior of yellow taxis before the change in their working shifts may be a result of attempts to reach their unmet income targets (driven by the decrease in their market share as a consequence of the introduction of thousands of green taxis in a span of a few months).Footnote 11

8.3.3 Treatment of yellow taxis during the months after the launch of green taxis

We also investigated the fraudulent behavior of yellow taxis over time. In particular, we examined the fraudulent behavior of yellow taxis during the months following the launch of green taxis on trips within CR NN. To do so, we ran regressions similar to those in Eq. (1) with the following modification: We introduced six dummy variables, one for each month following the launch of green taxis, in our dataset and considered August 2013 as our base level (see Appendix D).

Our results revealed that from September 2013 to February 2014 (i.e., the months after green taxis were launched), the trip distance, trip time, and trip fare of yellow taxis within CR NN were not higher than those during August 2013 (i.e., the month in which green taxis were launched). Specifically, we found that within the months after green taxis were launched, yellow taxis had a relatively lower distance and fare compared to the month in which green taxis were launched (i.e., August 2013). These findings can be interpreted as an elevation in competition, leading to an (abrupt) increase in the fraudulent behavior of sellers, while its magnitude diminished over time.Footnote 12

8.4 Validation of the identification strategy

In our primary analysis, we matched yellow taxi trips before and after the launch of green taxis, which raises questions regarding the validity of our identification strategy. To support the validity of our identification strategy and our main conclusion on the impact of the NYC TLC’s decision on street-hail transportation outside of the Manhattan Core, we utilized green taxi trips. Specifically, we created two additional datasets, consisting of green taxi trips in CR NE and CR NN which were matched with (i) yellow taxi trips before the launch of green taxis (referred to as the GvYB dataset) and (ii) yellow taxi trips after the launch of green taxis (referred to as the GvYA dataset; see Table 10 of Appendix A).Footnote 13

We used matched trips in the GvYB and GvYA datasets and ran regressions using a modified version of Eq. (1); the visualized magnitudes of the coefficients are depicted in Fig. 4 (for the full regression results, see Table 11 in Appendix A).

Fig. 4
figure 4

Specification Curves Comparing Green and Yellow Taxi Trips for Distance, Duration, and Fare. Notes: Each column shows estimations from a separate regression with a combination of control variables and competitive routes. The black-filled (grey-filled) dots represent the presence (absence) of the respective control variables in the regression. The blue dots represent the models with all control variables (see the regression results in Table 11 of Appendix A)

Figure 4 demonstrates that the sizes of coefficients for green taxi trips in CR NN in Panel A of Fig. 4 are larger than those in Panel B. In other words, within matched trips in CR NN, green taxis had longer trip distances and trip durations and higher trip fares in the GvYB dataset than in the GvYA dataset.Footnote 14 This finding (indirectly) indicates that yellow taxis had relatively longer trip distances and trip durations and higher trip fares in CR NN after the launch of green taxis. This finding supports the validity of our identification strategy, and is consistent with (part of) H2a.

9 Discussion and conclusion

9.1 Discussion

The question of whether markets benefit from increased competition has been controversial (Gilbert, 2006), as more competition may reinforce incentives to reduce service quality (see also, e.g.,Brekke et al., 2011; Gozvrisankaran & Town, 2003). For many markets (especially those in which the consumer has difficulty determining the value of goods or services received), it is unclear whether consumers always benefit from increased competition, and past literature shows mixed results regarding the effects of competition (Balafoutas & Kerschbamer, 2020; Johansen & Zhu, 2014; Krachler et al., 2022).

This study revealed that increased competition does not always help to overcome market issues. We examined the policy implemented by the NYC TLC, which resulted in an increase in competition in the NYC taxi market. Using trip information for taxis in NYC that covers the NYC TLC’s decision to launch green taxis, which led to an increase in the number of street-hail vehicles (and consequently, an elevation in competition in specific areas), we sought answers to several questions. Does an increase in competition improve the NYC taxi market? Or does it provoke (higher levels of) fraudulent behavior among taxi drivers? Does the intensity of competition affect the degree of change in fraudulent behavior? Do taxi drivers’ assumptions regarding consumers’ information about the optimal route affect the degree to which the consumer is defrauded (e.g., trips to/from office buildings versus those to/from hotels)?

In a study of millions of matched standard-rate fare taxi trips in NYC (i.e., facing similar daily traffic congestion and having the same pickup and drop-off locations), we found significant evidence of a change in the fraudulent behavior of yellow taxis after the launch of green taxis, with the highest change in the overtreatment of passengers on routes where yellow taxis compete with green taxis for both pickups and post-drop-off pickups. Our results show that the degree of change in defrauding passengers depends not only on the characteristics of the trip (e.g., whether yellow taxis face competition in pickups/post-drop-off pickups or whether they are allowed pickups post-drop-off) but also on the drivers’ assumptions regarding passengers’ familiarity with the optimal route.

Our results provide insights for policymakers who are required to assess the extent of competition in a market and those tasked with promoting it (Littlechild, 2018; Warner & Hefetz, 2008). When tasked with regulating or overseeing market systems, they must understand the range of implications that their policies would impose on both sellers and consumers. Among such regulatory concerns, policymakers should address seller fraud, which imposes significant monetary costs on consumers and on the larger society. This is especially true in markets for credence goods, where consumers are not positioned to effectively determine the quality or value of a good or service and must rely on policymakers to protect for their interests.

Our results indicate that competition intensity does not always help consumers. This finding is important for regulators, who have often not effectively promoted the need for competition (see, e.g.,Littlechild, 2018; Podnar et al., 2007). The launch of green taxis in NYC is a good example of regulation-based competition that did not have the full effect desired. The launch of green taxis primarily aimed to provide street-hail services in areas traditionally less popular for yellow taxis (Bloomberg & Yassky, 2014; NYC TLC, 2014). However, in practice, the introduction (and the consequent increase in competition in parts of NYC) affected the taxi market in NYC in unanticipated ways, which resulted in certain types of consumers being more subject to taxi fraud.

The effect of regulations is an important issue in a broad range of public policy areas. Regulations apply to a wide spectrum of markets, including healthcare, education, telecommunications, television broadcasts, pharmaceuticals, energy, airport and railway markets, and taxis (e.g.,Bilotkach & Hüschelrath, 2019; Bollinger, 2015; Broman & Eliasson, 2019; Epple et al., 2016; Johansen & Zhu, 2014). However, applying or enforcing regulations may affect the market in different ways and sometimes negatively. Therefore, in line with our findings, we suggest that when public policymakers want to impose changes that increase the consumer surplus, they must carefully assess the market situation before making concrete decisions regarding the transition from a regulated monopoly to deregulation and competitive entry into the markets. For example, the short- and long-term consequences of decisions such as adding doctors to the healthcare system should be carefully considered (Daily Mail, 2021).

From an economic perspective, poorly examined public policy decisions can impose huge costs on consumers. For example, our study on the yellow taxi market in NYC found increased fraud for trips subject to standard-rate fares in boroughs other than Manhattan (CR NN). Although the level of fraud was small (increasing the trip fare by approximately $1.50), it translates to an additional approximately $6.8 million within the seven months following the launch of green taxis.

Our results also have direct implications for the restrictions imposed on peer-to-peer ridesharing platforms (e.g., Uber) for picking up passengers in specific areas. For instance, in some countries, Uber drivers are not allowed to pick up passengers from airline terminals. Fewer restrictions on their presence may help decrease fraud in the taxi market in many ways, for example, by making second opinions more accessible, which is supported by past studies indicating that second opinions help to reduce overtreatment (Mimra et al., 2016b).

9.2 Limitations and future research

Despite its important findings, the current study has some limitations. First, identifying fraudulent behavior among sellers is challenging, and few studies have investigated the existence and magnitude of sellers’ fraudulent behavior in a real-world setting using actual secondary data (Rasch & Waibel, 2018). Generally, testing for fraud (or changes in it) among sellers in practice requires variation in the data (e.g., an abrupt change in policy) or the availability of a counterfactual (i.e., the service that the seller would have provided had the buyer been adequately informed). Although such variation is not available in our dataset, we checked the validity (and robustness) of our results in many ways (see Validity and Robustness of the Results section).

Second, our analysis does not consider the effect of peer-to-peer ridesharing, particularly Uber’s one-year e-hail pilot program in NYC. We believe that over the period of our observation, the launch of Uber resulted in a limited change in the behavior of NYC street-hail taxis for several reasons: (i) in the time period of our analysis, MFIN stock prices were at their highest since 2010 (see Fig. 6 of Appendix E), indicating that the street-hail taxi market in NYC remained healthy during the study period. Similarly, the highest corporate (and private) medallion sales in 2012 − 2014 were recorded in May and June 2013, and the second highest occurred in 2014 (NYC Bloomberg & Yassky, 2014; TLC, 2012, 2013); (ii) peer-to-peer ridesharing applications offer "pre-arranged" rides through smartphones (as opposed to the street-hail service that we considered in our analysis), and during the period of our study, accounted for a smaller market share. Furthermore, given that there were no geographic restrictions, such as an exclusionary zone for Uber’s activity (Uber Blog, 2013), its launch had a uniform effect on yellow taxi trips in NYC.Footnote 15 Thus, we believe that our main conclusions continue to have merits.

Third, our analysis does not directly account for changes in traffic levels and street/roadway construction in NYC. However, we believe that they do not affect our main conclusions for several reasons: (i) the absence of a significant change in the distance of trips subject to flat-rate fares provides some evidence against arguments such as significant changes in traffic and street/roadway construction during the time period of our analysis; and (ii) in fact, NYC experienced a downward citywide traffic trend during the time period of our analysis (see the citywide traffic trend from 2013 to 2014 reported by the NYC Department of Transportation (2016)).

In summary, our study demonstrates that changes in credence markets, such as an increase in competition resulting from an increase in the number of sellers, can significantly affect the fraudulent behavior of sellers. Policymakers should handle these markets with caution and carefully consider the consequences of their policies on sellers’ behavior and the impact it will have on consumers.