1 Introduction

Shared mobility has become increasingly popular in recent years as a way to reduce traffic congestion and improve access to transportation (Machado et al. 2018; Prencipe et al. 2022). The shared mobility market, which includes services such as car sharing, bike sharing, and e-scooter sharing, has the potential to offer a convenient and affordable alternative to traditional forms of transportation (Hensher et al. 2020). The industry is undergoing rapid transformation, with a strong emphasis on digitalization, connectivity, and customer-centricity (Siegfried 2022). To thrive in this dynamic landscape, providers of shared mobility must develop innovative business strategies that align with the ever-evolving customer expectations.

Despite the widespread adoption of shared mobility, there has been a paucity of research specifically examining the quality of these services. The existing predetermined frameworks of quality determinants for products or services (such as those proposed by Garvin 1987; Parasuraman et al. 1988) may not fit the contexts where tangible and intangible elements must be taken into account concurrently (Mastrogiacomo et al. 2021). This is particularly relevant for Product-Service Systems (PSS), such as shared mobility services, which require specialised approaches (Barravecchia et al. 2021).

Within the limited existing research, a few studies analyzed the quality of specific shared mobility modes. However, despite these efforts, a significant gap remains in the literature: the absence of a comprehensive tool capable of analyzing the quality of shared mobility in a holistic manner. It’s therefore essential to deepen our understanding of shared mobility quality, especially for stakeholders who wish to enhance benefits and minimize the negative impacts of these services.

From this perspective, the primary aim of our study is to construct a theoretical foundation that deepens our comprehension of shared mobility quality. To this end, this study introduces the MOBI-Qual framework, a comprehensive collection of shared mobility’s quality determinants. This framework was developed through a consolidation of quality determinants identified in different digital Voice-of-Customer datasets, related to the three most prevalent shared mobility modes: car sharing, bike sharing, and e-scooter sharing.

Although the analytical methods employed in this study, grounded in the Topic Modelling algorithm, are not novel, their concurrent application across various domains of shared mobility is worthy of attention. The proposed approach produced a holistic model that captures the complexity of different shared mobility landscapes.

The remainder of the paper unfolds as follows. Section 2 offers an in-depth introduction to shared mobility, also incorporating references to prior studies on the quality of shared mobility. Section 3 discusses the theoretical background underpinning the digital Voice-of-Customer (VoC) analysis employed in this research. Section 4 outlines the analytical methodology employed in this study. Following this, the proposed MOBI-Qual framework, along with the procedure pursued for its development, is articulated in Sect. 5. The paper concludes with a discussion of the study’s contributions, limitations, and possibile future research in the final section.

2 Shared mobility: modes and quality studies

The sharing economy leverages technology to facilitate peer-to-peer exchange of underutilised goods or services (Schlagwein et al. 2020). This process is facilitated by an intermediary platform, and importantly, does not involve the transfer of ownership, enabling resources to be shared rather than owned outright. This model has been adapted to a variety of sectors, including mobility. Shared mobility allows users to access shared vehicles on a short-term basis as needed and includes a range of services such as car sharing, bike sharing, ride sharing and on-demand ride services (Fikar and Hirsch 2018; Shaheen et al. 2020).

The growth of shared mobility has been driven by advances in digital technology and the implementation of GPS-based systems, which enable providers to monitor the position and status of shared vehicles and allow users to easily locate the nearest available vehicle using a mobile app.

A review of the existing literature revealed a lack of tools and methodologies for understanding and managing the quality of shared mobility services. The few studies that addressed this topic had also some limitations, such as small sample sizes (Ashqar et al. 2022; Beirigo et al. 2022; Kumar et al. 2022) and a lack of consideration for different shared mobility modes (Barravecchia et al. 2020a, b; Mastrogiacomo et al. 2021). This gap in the literature highlights the need for a more comprehensive and systematic approach to assessing the quality of shared mobility.

To gain a comprehensive understanding of the quality of shared mobility services, this study jointly examined the three most popular shared mobility modes: car sharing, bike sharing, and e-scooter sharing (Castellanos et al. 2022).

2.1 Car sharing

Car sharing users can access a fleet of shared cars on an as-needed basis and pay a usage and/or membership-based fee to get the benefits of a private vehicle without the related cost of ownership (e.g., fuel, maintenance, insurance) (Shaheen et al. 2020). Car sharing is developing extremely dynamically, not only in terms of users but also in terms of service schemes, application areas, organisational solutions, and players in the market. Various car sharing schemes have been developed (Ferrero et al. 2018); the most prevalent are:

  • Roundtrip car sharing, also known as station-based car sharing, requires cars to be returned to the same location from where it was picked up.

  • One-way car sharing, also known as point-to-point or free-floating car sharing, allows members to pick up and drop off a car in different locations within a delimited area.

Car sharing has been shown to have a number of positive effects on individuals, communities, and the environment. For individuals, car sharing can provide a convenient and cost-effective transportation option, especially for those who do not own a car or do not use it frequently. By allowing individuals to access a car when needed rather than owning one, car sharing can help to reduce the financial burden of car ownership and maintenance (Qian et al. 2022). It can also reduce the need for personal car storage, freeing up space in urban areas where space is often limited. At the community level, car sharing can help to reduce traffic congestion and air pollution, as fewer cars are needed to meet the transportation demand of a given population. Car sharing can also have positive environmental impacts, as it can help to reduce greenhouse gas emissions by reducing the number of cars on the road. This can contribute to the efforts to combat climate change and improve air quality. In recent years, vehicle electrification is also affecting car sharing by further improving its environmental sustainability performance (Prencipe et al. 2022).

Car sharing has seen a steady rise in popularity in urban areas, with a reported 38 million subscribers in 2018 and a total of 198,000 vehicles in 47 different countries. In the same year, car sharing generated over $9 billion in revenue, and it is expected to reach over $14 billion by 2024 (Statista 2021). These figures demonstrate the significant growth and potential of the car sharing industry in recent years and suggest that it will continue to be a major player in the urban transportation landscape in the coming years.

Mugion et al. (2019) found that the perceived quality of a car sharing service plays a significant role in determining a person’s intention to use the service. Mattia et al. (2019) developed a framework for understanding the psychological factors that influence the intention to re-use free-floating car sharing services. Barravecchia et al. (2020a, b) also proposed a preliminary framework for understanding the determinants of the quality of car sharing services. These studies suggest that the quality of car sharing services is an important feature for both users and providers of these services.

2.2 Bike sharing

Bike sharing is a sustainable and increasingly popular form of transportation that is being adopted by more and more cities around the world. Public administrations are increasingly offering bike-sharing services to their citizens as an alternative to traditional modes of transportation. Bike sharing allows users to rent bikes on an as-needed basis, either from a network of bike-sharing stations or from a designated area in the case of dockless bike sharing (Shaheen et al. 2020).

Bike sharing can have a number of positive effects. Bike sharing can help reduce traffic congestion and pollution in urban areas, as it provides a convenient and sustainable alternative to driving (Wang and Zhou 2017). This can improve air quality and public health, as well as reduce noise levels in cities. Bike sharing can also have economic benefits, as it can provide a low-cost form of transportation for those who may not have access to a car or who choose to use a bike as an alternative to public transit (Qiu and He 2018). Furthermore, bike sharing can encourage physical activity, as biking is a form of exercise that can support a healthy lifestyle (Otero et al. 2018). Overall, bike sharing can be a valuable tool for promoting sustainability, improving public health, and enhancing the quality of life in urban areas.

The bike-sharing industry experienced significant growth in recent years. In 2019, revenues for bike-sharing services reached approximately $5 billion, and by 2020, this figure had grown to $6.9 billion, with a total of 669 million users (Statista 2021). These numbers are expected to continue to rise in the coming years, with forecasts predicting that revenues for bike-sharing services could reach $9.6 billion by 2024, serving a total of 860 million users. (Statista 2021).

Most previous research on bike-sharing focused on forecasting demand and optimising dock locations. However, there have also been a few studies that have examined the quality of bike-sharing services. Bordagaray et al. (2012) developed a methodology for modelling the perceived quality of bike-sharing services in order to identify the variables that influence user perceptions of quality and their relative importance. Their study found that safety and information were the most important factors in determining the perceived quality of bike-sharing. Hsu et al. (2018) proposed a model for improving the quality of bike-sharing services based on 24 criteria derived from the SERVQUAL model. Morton (2018) used the SERVPERF model to develop a set of items for measuring the quality of bike-sharing services and user satisfaction. Similarly, Ma et al. (2019) proposed a list of service quality evaluation criteria based on the SERVPERF model. These studies highlight the importance of considering the quality of bike-sharing services and suggest that there are several factors that can influence user perceptions of quality.

2.3 E-scooter sharing

E-scooter sharing is a form of transportation in which users can rent electric scooters (e-scooters) on an as-needed basis. E-scooters are small, electric vehicles that can be easily rented and dropped off at various locations around a city. To use an e-scooter sharing service, users typically need to download a smartphone app and create an account. They can then locate and unlock an available e-scooter using the app, and pay for their ride using an electronic payment method. E-scooter sharing is becoming increasingly popular in urban areas as a convenient and environmentally friendly way to get around. It can be especially useful for short trips or as a last-mile transportation option to connect with public transit. E-scooter sharing has the potential to reduce traffic congestion and air pollution, and can provide a fun and affordable alternative to driving or using public transit (Wanganoo et al. 2022).

Despite some initial hesitation due to concerns about safety, the growth of e-scooter sharing has been rapid in recent years (Lee et al. 2021). E-scooter sharing is a relatively new form of shared mobility, with the first e-scooter sharing platform having been released in 2017. That year, e-scooter sharing generated only $11 million in revenue. However, the rapid expansion of the industry led to significant growth, with revenues reaching $1.34 billion in 2021 and the number of users surpassing 64 million (Statista 2021). The convenience and environmental benefits of e-scooter sharing, as well as the increasing availability of e-scooters in urban areas, are likely contributing factors to this growth.

There has been relatively little research on the quality of e-scooter sharing services. In a recent study, Hamerska et al. (2022) explored the factors that influence customer satisfaction or dissatisfaction in e-scooter sharing. They identified the features that shape the quality of e-scooter sharing and adapted the SERVQUAL model to define the assessment items for their study. Further research is needed to fully understand the factors that contribute to the quality of e-scooter sharing and how they can be effectively managed to provide optimal service to users.

3 Digital Voice-of-Customer

Digital VoC channels, which encompass platforms such as customer review websites, social media posts, and online discussions in blogs or communities, are frequently used by consumers to express their opinions about various products and services (Hennig-Thurau et al. 2010; Palese & Usai 2018).

The concept of digital VoC can be characterized by three key attributes (dos Santos 2021):

  • It requires a personal contribution or creative effort, indicating the involvement of individual consumers in generating feedback;

  • The generated content should be accessible to the public, or at least to a significant group of people, thus contributing to wider discourse;

  • It is expected to be created outside the excercise of professional obligations, ensuring the independence and authenticity of the feedback.

In the domain of digital VoC, textual customer reviews have gathered significant attention from researchers. Such reviews offer detailed insights into customer experiences, viewpoints, and sentiments related to specific products or services, thus serving as a valuable resource for the analysis and comprehension of customer perceptions (Subhashini et al. 2021). By analyzing the content of these reviews, it is possible to uncover common themes, identify strengths and weaknesses of products or services, and gain a deeper understanding of customer preferences and expectations (Qi et al. 2016). Furthermore, the accessibility and ease of sharing textual customer reviews have made them an influential source of information for prospective customers (Maslowska et al. 2017).

3.1 Digital VoC analysis approaches

The broad availability of textual digital VoC facilitates large-scale data collection. Given the sheer volume of digital VoC, the focus has been on developing and applying methods that can effectively analyse its content, with a particular emphasis on extracting the topics discussed.

Topic modelling has emerged as a widely utilized tool for digital VoC analysis. When applied to digital VoC, topic modelling algorithms proved to be able to extract the latent quality determinants of products or services, i.e., the elements can significantly influence perceived quality (Özdağoğlu et al. 2018; Barravecchia et al. 2020a, b; Mastrogiacomo et al. 2021). It is reasonable to assume that if a topic is discussed, it is important to the customer and thus critical to his/her perception of quality.

In recent years, novel methods for digital Voice-of-Customer (VoC) analysis have emerged, employing advanced deep learning techniques (Ullah et al. 2023). Notably, the application of BERT (Bidirectional Encoder Representations from Transformers) embeddings combined with K-means clustering has gained considerable attention (Devlin et al. 2018; Catelli et al. 2022). BERT, a pre-trained deep learning model, excels in natural language processing tasks by capturing the semantic context of words and phrases in a language (Bharadiya 2023). It transforms digital VoC data into high-dimensional vector representations, or “embeddings”, which serve as numerical proxies of the reviews, encapsulating the context and sentiment of the customer’s voice. K-means clustering is then employed to group these BERT embeddings, effectively segmenting reviews into distinct topics.

For the specific requirements of this study, a traditional topic modeling approach was selected. In subsequent phases of the study, techniques within the BERT chain will be explored. The outcomes will subsequently be compared with those obtained using the topic modelling approach.

3.2 Applications of digital VoC analysis

Digital VoC analysis has found significant application in the field of product/service improvement and development (Trenz and Berger 2013). Researchers utilized digital VoC analysis to identify the quality determinants in various products and services (Jelodar et al. 2019), such as hotels and travel services (Ding et al. 2020; Park et al. 2020; Putranto et al. 2021; Amat-Lefort et al. 2022; Shang et al. 2022), technical services (Papadia et al. 2022), brokerage services (Yang and Fang 2004), mobile apps (Kim et al. 2022) and digital devices (Almars et al. 2019).

Some attempts have been made to apply digital VoC analysis tools in the context of shared mobility. Lock and Pettit (2020) conducted a study that explored the potential of using social media as a means to engage with citizens and customers of public transportation systems. Their research involved analyzing social media data through topic modelling and sentiment analysis algorithms, combined with passively collected big data forms, to gain insights into customer experiences and feedback regarding operational transport performance. Kühl et al. (2020) proposed an automated approach to prioritize and quantify customer needs using social media data specifically in the context of e-mobility. Mastrogiacomo et al. (2021) conducted research focused on identifying the determinants of quality in car sharing services. All these studies highlight the potential of digital VoC analysis to enhance our understanding of shared mobility quality.

4 Methodology for identifying quality determinants

For the purpose of topic modeling in this study, the structural topic modelling (STM) (Roberts et al. 2019a, b) was employed. The primary reasons guiding the choice of structural topic modeling (STM) include:

  • Probabilistic Modeling: STM produces a probabilistic model, capturing the varied content across multiple topics for each digital VoC record analyzed. This allows for a more realistic and comprehensive representation of data.

  • Reliability: STM has gained trust in digital VoC studies due to its consistent and replicable results across varied research settings. Its versatility in handling different types of digital VoC data underscores its reliability in extracting meaningful topics.

  • Input Parameters: unlike other approaches that require a calibration process and a set of parameters to be specified, STM requires only the number of topics to be defined. This ensures easier setup and reduces the risk of over-tuning or bias.

This approach provided insights into the experiences and opinions of actual users of shared mobility services, offering a comprehensive understanding of their quality determinants (Barravecchia et al. 2020a, b, 2021; Mastrogiacomo et al. 2021; Amat-Lefort et al. 2022). The findings of this analysis will be used in subsequent sections to define a common quality framework applicable to all shared mobility modes.

The same analytical methodology was applied separately to each digital VoC dataset (pertaining to bike sharing, e-scooter sharing, and car sharing) to facilitate a direct comparison of the results. The applied methodology can be divided into five steps: (i) dataset extraction, (ii) pre-processing, (iii) topic modelling, (iv) labelling, and (v) validation of results. Figure 1 reports a synthetic scheme of the methodology described in detail in the following Sections.

Fig. 1
figure 1

Schematic representation of the digital VoC analysis methodology

4.1 Dataset extraction

The primary method for extracting digital Voice of Customer (VoC) data is through web scraping, which involves the automated and large-scale extraction of information from websites (Diouf et al. 2019).

To ensure comprehensive data collection, the leading shared mobility providers have been identified based on factors such as market share, geographical reach, and diversity of offerings. A different range of sources was selected to ensure a comprehensive dataset. The variation in sources can be attributed to the distinct online presence and review patterns observed for each type of shared mobility service. For instance, some sharing services may be recognized as more popular and consequently receive more reviews on specific platforms. Such multiplicity of sources guarantees that a broad spectrum of customer feedback is captured.

In detail, the software Octoparse (Octopus Data Inc. 2023) was utilized for the web scraping process. The scraper was programmed to target:

  • Review text: the main content of a review, where customers express their experiences and opinions about the service.

  • Provider: the shared mobility service provider that the review is about.

  • Website: the specific website from which the review was extracted.

  • Type of sharing: the specific type of shared mobility service being reviewed (car sharing, bike sharing, e-scooter sharing).

  • Operating country: the country or region in which the provider operates.

  • Release date: The date when the review was posted.

Table 1 details the specifications of the extracted digital VoC datasets. The authors made the complete dataset publicly available via the Harvard Dataverse platform (Barravecchia 2023). The dataset includes all the analyzed digital VoC records, each paired with its relevant metadata, as detailed in the preceding bullet points.

Table 1 Specifications of the extracted digital VoC datasets concerning bike sharing, e-scooter sharing and car sharing

4.2 Pre-processing

Before implementing topic modelling algorithms, it is necessary to process the text to ensure and enhance the algorithm’s performance. This process is generally referred to as text pre-processing (Hickman et al. 2022). Pre-processing allows working with smaller data sizes, decreasing the time required for topic modelling and improving the quality of the results (Uysal and Gunal 2014). The pre-processing phase included the following operations (Mastrogiacomo et al. 2021):

  • Converting text to lowercase.

  • Removal of punctuation.

  • Removal of stop words, which are all words functional to the language but lacking information, such as articles or prepositions.

  • Stemming, i.e., the operation through which the words are brought back to their lexical root.

  • Removal of rare words, which are often the result of typing errors or concern uncommon subjects.

4.3 Topic modelling

The strctural topic model (STM) algorithm was used to infer the major topics addressed within the analysed datasets of digital VoC. To this end, R Software, and the STM library were used (Roberts et al. 2019a, b). The code in R language containing the main functions used in the analysis is provided in Appendix 1. Recent papers and overviews of the STM algorithm exist for readers unfamiliar with its background (Roberts et al. 2014; Roberts et al. 2019a, b).

Topic modelling algorithms require to indicate the number of topics to be extracted (K). This parameter significantly impacts the quality and interpretability of the results. When K is small, the identified topics tend to be broad and general. However, as the number of topics increases, they become more specific and focused. While a higher value of K may capture finer-grained details, it can also lead to topics that are less coherent or meaningful (Sbalchiero and Eder 2020).

For the evaluation of the number of topics, the held-out likelihood was selected as the main indicator of the effectiveness of the model (Yi and Allan 2008). The held-out likelihood is a metric on the goodness of performance of the topic model calculated using a portion of the textual corpus as the training set to develop the model and the remaining portion of the corpus (held-out documents) as the test set (Wallach et al. 2009).

The number selected for maximising the held-out likelihood was 22 topics for the bike sharing, 24 for e-scooter sharing, and 20 for car sharing. Figure 2 depicts the variation of held-out likelihood as the number of topics ranges from 5 to 50. The subsequent phase of human validation of the results, as detailed in Sect. 4.5, confirmed the appropriateness of the chosen number of topics to be extracted.

Fig. 2
figure 2

Held-out likelihood of the topic model varying the number of topics (K): A bike sharing, B E-scooter sharing, C car sharing

Several factors may contribute to why the three topic models can have different numbers of topics. One factor could be the size and structure of the datasets. For example, a larger dataset with a greater variety of information may result in a greater number of topics being identified. Additionally, the language and style of the digital VoC may also affect the number of topics.

Once the number of topics was determined, a topic modeling algorithm was applied. The algorithm produced several outputs, including the distribution of topics within the document (topical prevalence) and the probability that a word belongs to a specific topic (topical content). In line with previous research, the extracted topics served as quality determinants for the analyzed shared mobility services.

4.4 Labelling

A panel of experts assisted in categorizing and understanding the various factors that contribute to the quality of shared mobility. This panel was composed of six experts, including researchers and practitioners. The experts were selected based on their expertise, combining insights from both academic scholars and industry professionals to provide a comprehensive perspective. Four were academic scholars whose selection was based on their research and contributions to shared mobility and related fields. In contrast, the two industry professionals were chosen for their practical experience in the shared mobility sector. Occupying senior roles within their respective organizations, they participated in the development and execution of shared mobility projects.

The panel of experts’ main task was to provide a representative label for each quality determinant, thereby facilitating their interpretation. The labeling process was informed by considering:

  • The keywords with the highest probability of belonging to each determinant;

  • The digital VoC records for which the model estimated a higher probability of belonging to a given quality determinant.

To ensure reliability in the labeling process, a structured protocol was followed. Each expert independently evaluated and assigned labels to the quality determinants. Afterward, the labels assigned by the experts were compared and discussed in a consensus meeting. In the few instances where discrepancies were identified, an iterative process was implemented to encourage open discussion and consider additional data. This approach facilitated the negotiation of differing viewpoints and fostered the collective development of a consensus. Only when all experts were in agreement, a final label was assigned to a quality determinant. This process, combining independent analysis with collaborative discussion, helped to ensure that each label was robustly scrutinized and agreed upon. As a result, the potential for bias or error was significantly minimized, enhancing the reliability of the findings.

Table 2 reports the labels of the identified determinants. A full description of the determinants and related keywords is reported in Appendix 2 in Tables 6, 7 and 8, respectively, related to bike sharing, e-scooter sharing and car sharing.

Table 2 Quality determinants of bike sharing, e-scooter sharing and car sharing provided by the topic modelling algorithm

Figure 3 provides an overview of the Mean Topical Prevalence assigned to all quality determinants based on the analysis of digital VoC for the three different shared mobility modes: bike sharing, e-scooter sharing, and car sharing.

Fig. 3
figure 3

Mean topical prevalence for the identified quality determinants (see the labels in Table 2). A Bike sharing. B E-scooter sharing. C Car sharing

The functionality and reliability of the mobile application emerged as one of the most discussed quality determinants across all three shared mobility modes. This highlights the critical role that user-friendly and reliable applications play in shaping customer experiences and satisfaction in shared mobility services. In the context of Bike sharing, topics related to “Use rates”, “Payment”, and “Customer service” also surface prominently in discussions. For E-scooter sharing, prevalent determinants are “Charging policy”, “Use rates issues”, and “Ease of use”. With regards to car sharing, the discussion is mainly addressed to quality determinants “Customer service responsiveness” and “Convenience”. Overall, this analysis provides a detailed understanding of how different aspects of each shared mobility service affect user perception and satisfaction, offering valuable insights for service providers aiming to improve their offerings.

4.5 Human validation of results

While advanced algorithms offer powerful tools for topic modelling, human validation remains an important step in ensuring the reliability of the generated outputs (Chang et al. 2009). Due to the complex nature of language, it is through the lens of human understanding, that it is possible to verify the accuracy of the results produced by the topic modelling algorithm. Consequently, human validation was employed in this study as a mechanism to confirm the outputs produced by the topic modelling algorithm.

To this end, the classification carried out by human evaluators on a random sample of one hundred digital VoC records was compared with the classification resulting from the topic modelling algorithm (Barravecchia et al. 2021). The comparison allowed a set of performance validation indicators (detailed in Appendix 3) to be calculated. Table 3 shows these indicators calculated for the three analysed datasets (bike sharing, e-scooter sharing, and car sharing), with an indication of experimental target values (Barravecchia et al. 2021). Overall, these results indicate that the developed topic models describe the contents of the analysed digital VoC appropriately.

Table 3 Performance validation indicators. Target values provided by Barravecchia et al. (2022)

Moreover, following the methodology proposed by Chang et al. (2009), two distinct evaluation measures, word intrusion and topic intrusion, were employed to ascertain the coherence and relevance of the topics identified within the respective models.

For the Word Intrusion assessment, the top 8 representative words were selected for each topic (as detailed in Table 6 in Appendix 2), and an ‘intruder’ word, not inherent to the specific topic, was introduced. Subsequently, ten human evaluators were tasked with identifying this inserted word.

In the Topic Intrusion procedure, a sample of 100 reviews was randomly chosen from each dataset. The most probable topics were associated with each of these reviews. An ‘intruder topic’, not originally part of the top probable topics for the review, was then added. These modified topic sets were then presented to the ten human evaluators, and the detection of the least congruent topic for each review was expected of them.

Following these methodologies, the model’s performance metrics were derived as follows:

$$Word\;intrusion\;success\;rate= \left(\frac{Number\;of\;correct\;word\;intruder\;identifications}{Total\;number\;of\;evaluation}\right)\times 100$$
$$Topic\;intrusion\;success\;rate= \left(\frac{Number\;of\;correct\;topic\;intruder\;identifications}{Total\;number\;of\;evaluation}\right)\times 100$$

where the Number of correct word intruder identifications and Number of correct topic intruder identifications represent the instances where human evaluators accurately chose the word or topic intruder; the Total number of evaluations signifies the number of attempts made by evaluators to identify an intruding word or topic.

The results, as illustrated in Table 4, indicate a high success rate in both the word and topic intrusion tests across all three models.

Table 4 Results of word intrusion and topic intrusion tests (Chang et al. 2009)

Both validation methods employed in this study converge in their assessments, underscoring the adequacy of the developed topic models.

5 MOBI-Qual framework

This section is dedicated to detail the methodology and outcomes of the consolidation process employed to identify the set of quality determinants common to all shared mobility modes. This led to the development of a novel model, henceforth referred to as the MOBI-Qual framework.

5.1 Definition of the MOBI-Qual framework

The quality determinants of different shared mobility modes were compared to identify connections and overlaps. The process of constructing the MOBI-Qual framework is summarised in Fig. 4. In order to cluster the quality determinants of the three considered modes of shared mobility, two steps were taken. First, the similarity of the identified quality determinants was assessed by comparing the descriptive keywords. From this comparison, a preliminary clustering was obtained. Second, the panel of experts in shared mobility reviewed and refined the results of the first step, resulting in the final list of MOBI-Qual macro determinants.

Fig. 4
figure 4

Schematic representation of the process for the definition of the MOBI-Qual framework

Specifically, in the first step of the clustering procedure, the topical content was taken into account. The topic modelling algorithm associates each determinant with a multinomial probability distribution specifying the relative weight of each keyword. It is possible to represent this distribution with the topical content vector:

$${TC}_{d}= \left[{tc}_{1,d},{tc}_{2,d},{tc}_{3,d},\dots ,{tc}_{w,d},\dots ,{tc}_{W,d},\right]$$

where \({tc}_{w,d}\) is the weight associated to the w-th keyword of the vocabulary in the d-th quality determinant (topic); \(w\in \left\{1,\dots ,W\right\}\) are the keywords of the vocabulary related to the digital VoC collection; \(W\) is the total number of keywords contained in the digital VoC vocabulary; \(d\in \left\{1,\dots ,D\right\}\) are the quality determinants identified by the topic modelling algorithm; \(D\) is the total number of identified quality determinants.

In order to compare the topical content vectors from different topic models, it is necessary to establish a common vocabulary that includes all the relevant keywords. In this study, the vocabularies of the topic models for car sharing, bike-sharing, and e-scooter sharing were combined to create a shared vocabulary. The resulting topical content vectors included information about all the keywords in the common vocabulary, regardless of whether they were originally present in the individual topic model vocabularies. If a keyword was not present in the original vocabulary, it was assigned a weight of zero in the topical content vector. This allowed for the comparison of topical content vectors from different topic models.

The degree of similarity between two quality determinants was calculated as the cosine of the angle between the two topical content vectors through the scalar product:

$${\begin{array}{c}Degree \\ of\;similarity\end{array}}_{i,k}= \frac{\langle {TC}_{i}, {TC}_{k}\rangle }{\left|{TC}_{i}\right| \cdot \left|{TC}_{k}\right|} =\frac{{\sum }_{w=1}^{W}\left({tc}_{w,i} \cdot {tc}_{w,k}\right)}{\sqrt{{\sum }_{w=1}^{W}{\left({tc}_{w,i} \right)}^{2}}\cdot \sqrt{{\sum }_{w=1}^{W}{\left({tc}_{w,k} \right)}^{2}}} \in [\mathrm{0,1}]$$

where \({TC}_{i}\) and \({TC}_{k}\) are the topical content vectors for the \(i\) and \(k\) quality determinants, and \(\left|{TC}_{i}\right|\) and \(\left|{TC}_{k}\right|\) are the respective vector modules.

The degree of similarity introduced a quantitative criterion to compare different quality determinants by measuring how much the keywords and their associated weights of two quality determinants are similar.

For the sake of clarity, Fig. 5 provides a fictitious example of how the degree of similarity between quality determinants can be calculated.

Fig. 5
figure 5

Fictitious example of calculation of the degree of similarity between quality determinants

In this example, the degree of similarity between determinants 2 and 3 is greater than that between determinants 1 and 3 or 1 and 2. The degree of similarity is equal to 1 when the two quality determinants are identical, while it is 0 when the two compared topical content vectors are perpendicular. In this study, it was observed that a degree of similarity above 0.40 signalled a significant overlap between the keywords of the two tested determinants. The threshold of 0.4 was determined on the basis of the characteristics of the data and the desired level of granularity in the clustering results. This criterion allowed about half of the determinants for the three shared mobility modes under analysis to be grouped (34 out of 66). Table 9 in Appendix 4 shows the results of this preliminary grouping.

Once the preliminary clusters were identified, the panel of experts evaluated the results and further aggregated the semantically related clusters. This analysis provided an aggregation of the determinants of quality related to different modes of shared mobility, which inherently present different vocabularies. Specifically, for each quality determinant, the panel of experts examined: (i) the meaning of the labels, (ii) the list of keywords, and (iii) the most representative digital VoC records. The quality determinants were grouped using affinity diagrams into conceptual macro-quality determinants useful for defining high-level references valid across different shared mobility modes (see Table 5).

Table 5 MOBI-Qual, a framework of quality determinants of shared mobility

5.2 MOBY-Qual macro quality determinants

The described procedure provided the following eleven macro quality determinants for shared mobility: (i) customer service, (ii) app reliability, (iii) charges and fees, (iv) vehicle conditions, (v) sharing benefits, (vi) physical accessibility, (vii) parking area, (viii) safety, (ix) easy-of-use, (x) digital accessibility, and (xi) payments.

Figure 6 represents the Mean Topical Prevalence (MTP) of each macro quality determinant and the contributions of the three shared mobility modes. The MTP represents the prevalence of each determinant, indicating their relative significance within the context of shared mobility. Furthermore, the figure also shows the contributions of the three shared mobility modes to the MTP of each macro quality determinant. This analysis helps identify which macro quality determinants have a stronger impact on certain shared mobility modes.

Fig. 6
figure 6

Mean topical prevalences and contributions of MOBI-Qual macro quality determinant. The contribution of each shared mobility mode to the MOBI-Qual macro quality determinant is calculated by summing up the MTP values of the quality determinants that constitute the macro quality determinant for that mode (see Fig. 3)

The findings suggest that app reliability, sharing benefits, charges and fees, and customer service emerge as the macro determinants that receive the most discussion across all three shared mobility modes. Based on this observation, it can be concluded that these factors are pivotal in shaping customer perceptions and levels of satisfaction.

App reliability is the most discussed quality determinants of shared mobility services (MTP = 0.17). In today’s digital age, the reliability and functionality of mobile applications are paramount in delivering a seamless user experience. Customers place a significant value on apps that are intuitive, stable, and provide accurate information regarding vehicle availability and reservation processes. Mobile applications are the primary interface through which users interact with shared mobility services. They rely on the app to locate and reserve vehicles, track their usage, and access important information.

Sharing benefits, characterized by the convenience, time-efficiency, and resource-savings that shared mobility provides compared to other transportation options, is a significant macro determinant of quality across all shared mobility modes (MTP = 0.16). This finding underscores the value customers place on the accessibility and flexibility offered by shared mobility services. The ability to conveniently access shared vehicles, whether it be cars, bikes, or e-scooters, reduces the need for personal vehicle ownership and promotes a more efficient allocation of transportation resources.

Charges and fees, with a Mean Topical Prevalence equal to 0.15, highlights the importance of transparent and reasonable pricing structures in establishing trust and ensuring that customers view shared mobility as a cost-effective alternative to traditional transportation options. Transparent pricing is essential for fostering trust and confidence among users. Customers expect clear and upfront information regarding the charges and fees associated with using shared mobility services. Hidden costs or unexpected charges can lead to dissatisfaction and erode trust in the service provider.

Customer service emerges as a significant macro determinant of quality in shared mobility services (MTP = 0.10). This finding emphasizes the critical role that customer service plays in shaping the overall customer experience. It highlights the importance customers place on the quality of interactions with service providers throughout their journey, from initial inquiries to issue resolution. Users value customer service that is responsive, prompt, and supportive.

The ease-of-use macro determinant shows that users value the simplicity and user-friendliness of the shared mobility platforms across all modes. This finding highlights the importance of intuitive interfaces, seamless booking processes, and clear instructions for accessing and operating the shared vehicles.

The aspect of physical accessibility, encompassing the availability, distribution, and proximity of vehicles within the service area, is critical for shared mobility quality. This observation underscores the importance of a well-designed fleet management system that ensures a sufficient number of vehicles are strategically placed throughout the service area, allowing customers to find and access them conveniently.

The macro determinant of payments encompasses the processing, accuracy, and reliability of payments, as well as the overall smoothness and efficiency of the payment process. This determinant highlights the importance placed by customers on the reliability of financial transactions within the shared mobility ecosystem. Users expect the assurance that they are accurately charged for the services they utilize, and they value a seamless and efficient payment experience.

The macro determinant of vehicle conditions refers to several important aspects, including cleanliness, tidiness, usability, residual autonomy (such as battery or fuel levels), and the ease of recharging or refueling. This determinant highlights the significance customers place on the overall state and functionality of the shared vehicles. The insight underscores that users value shared vehicles that are well-maintained, clean, and in good working condition. Additionally, the residual autonomy of the vehicles, whether it is the battery levels for electric vehicles or fuel levels for traditional ones, is an essential consideration.

The macro determinant of digital accessibility focuses on the speed and simplicity of the registration process and accessing the mobile application. Users expect a streamlined and intuitive registration process that allows them to quickly create an account and start using the shared mobility service. Digital accessibility is not limited to the initial registration process; it also encompasses the speed and responsiveness of the app. Users value a fast app that provides real-time information about vehicle availability, location, and reservation status.

The macro determinant of safety reflects the concerns customers have regarding the safety of using shared vehicles, the regulations governing their use, and the management of any potential damage or incidents that may occur during a trip. Customers expect shared vehicles to be well-maintained, meeting necessary safety standards, and undergoing regular inspections. In addition, customers are concerned about how incidents or damages are managed by the shared mobility providers. They expect prompt and efficient handling of any issues that arise during their trip, whether it be reporting damages, accidents, or other safety-related concerns.

The macro determinant of parking areas covers the availability of pickup and drop-off locations for shared vehicles, as well as the coverage area where the vehicles can be used and released. While it has a relatively lower contribution (MTP = 0.04), it is by no means less important than other determinants. Parking areas play a critical role in ensuring the convenience and accessibility of shared mobility services. Users expect a sufficient number of designated parking spots strategically located within the service area. Moreover, the coverage area is a crucial consideration for users. They value the flexibility of being able to pick up and drop off shared vehicles within a reasonably wide area, enabling them to conveniently access transportation options near their desired locations.

6 Conclusions

The main objective of the current study was to establish a unified framework that comprehensively defines the quality determinants of shared mobility. The resulting MOBI-Qual framework encompasses eleven determinants that provide a comprehensive and holistic understanding of shared mobility quality. These determinants include: (i) customer service, (ii) app reliability, (iii) charges and fees, (iv) vehicle conditions, (v) sharing benefits, (vi) physical accessibility, (vii) parking area, (viii) safety, (ix) easy-of-use, (x) digital accessibility, and (xi) payments. It was developed using a bottom-up approach, with the definition of quality determinants based on the analysis of extensive sets of textual digital Voice-of-Customers. This approach allows MOBI-Qual to be grounded in the actual experiences and opinions of shared mobility users, providing a deeper insight of the factors that influence the perceived quality of these services. In addition, MOBI-Qual was created by comparing the quality determinants of the most prevalent shared mobility modes, resulting in a common framework that is applicable to a wide range of shared mobility services.

Overall, the MOBI-Qual framework offers a robust and reliable approach for understanding the quality determinants of shared mobility and is a valuable tool for providers of these services looking to enhance the user experience.

The shared mobility sector is undergoing significant changes due to technological advances and shifts in consumer preferences. As a result, new and innovative personal transportation systems are expected to emerge and play a role in the sector evolution. The MOBI-Qual framework can be a valuable resource to support the design of these new forms of shared mobility. By considering the various quality determinants identified in the framework, operators can develop systems taking into account the customers needs and expectations.

The MOBI-Qual framework offers a useful tool for regulators and policy makers to evaluate the quality of shared mobility services and identify areas for improvement. By recognizing the factors that influence the perceived quality of shared mobility, policy makers may develop targeted policies and regulations to enhance the user experience and support the sustainability of these services.

Overall, this study makes several original contributions: (i) it defines the quality determinants specific to the three prevalent modes of shared mobility (car sharing, bike sharing, and e-scooter sharing); (ii) it is the first framework to define a set of quality determinants that can be applied across different modes of shared mobility; (iii) MOBI-Qual can be used for a range of practical purposes, including monitoring the quality of shared mobility using traditional assessment methods (such as questionnaires and interviews) and redesigning shared mobility services to enhance their quality; (iv) The procedure used to define the MOBI-Qual framework provides a practical approach to consolidate different frameworks for determining quality.

One limitation of the study is that it only examined explicit customer needs. More research is needed to determine whether implicit needs are also included in the identified quality determinants. Additionally, further research is required to fully understand how the released period of UGC can influence the development of the framework.

The model proposed in this study offers potential openings for further validation, especially by analyzing digital VoC datasets using alternative text mining approaches, notably BERT or other Large Language Model (LLM).