Introduction

Unilateral spatial neglect (henceforth spatial neglect) is a disabling neurological disorder. Patients fail to report events (in different sensory modalities: visual, auditory, and somatosensory; when imaging a familiar visual scene from a given vantage point), which occur in the side of physical and imaginal space and the body contralateral to the side of the hemispheric lesion (contralesional). These patients also fail to perform actions in those sides of extra-personal and personal (i.e., the body) spaces (Chokron et al., 2008; Heilman et al., 2000; Vallar & Bolognini, 2014; Vallar & Calzolari, 2018; Vallar & Maravita, 2009). In most patients, neglect impairs the left side of the space, contralateral to a right-hemispheric lesion. Right spatial neglect, contralateral to a left-hemispheric lesion, is less frequent and severe (Bisiach et al., 1984; Bowen et al., 1999; De Renzi et al., 1970; Ogden, 1985). The reported incidence of spatial neglect after stroke varies across studies (ranging from about 10 to 90%). This wide range depends on a number of factors, including the operational definition of neglect, the patients’ selection criteria, and the assessment methods (Bowen et al., 2013). Spatial neglect is more frequent in the acute stage post-stroke (Bowen et al., 1999). After about three months, spatial neglect is still present in over 15% of right-brain-damaged patients (Ringman et al., 2004). In stroke patients the presence of spatial neglect predicts poor outcome in functional recovery (Di Monaco et al., 2011), entailing a longer time of hospitalization, greater functional dependency, increased risk of falls and a long-term disability in Activities of Daily Living (ADL) (Barer, 1990; Batchelor et al., 2012; Bernspång et al., 1987; Bowen et al., 2013; Czernuszenko & Czlonkowska, 2009; Nijboer et al., 2013). This results in an overall increase of costs for the health system (Barrett & Houston, 2019).

Spatial neglect is not due to lower-level sensorimotor deficits. Many patients show associated sensorimotor deficits, such as hemiplegia, hemianesthesia and hemianopia. However, a double dissociation (Teuber, 1955; Vallar, 2000) has been found between spatial neglect and sensorimotor deficits: patients may show sensorimotor deficits without neglect which, in turn, may be present in isolation (Bisiach et al., 1986). However, sensory and motor deficits may have higher-order components, brought about by spatial neglect itself (Vallar et al., 1991; Vallar, Sandroni, Vallar et al., 1991a, b). In line with these behavioral findings, the neural correlates of spatial neglect involve damage to higher-order associative areas, in perisylvian regions, including the inferior parietal lobule, the temporo-parietal junction, the posterior part of the superior temporal gyrus, the premotor cortex, white matter fiber tracts connecting these regions, the thalamus and the basal ganglia (Corbetta & Shulman, 2011; Karnath & Rorden, 2012; Ptak & Schnider, 2011; Vallar & Calzolari, 2018).

Spatial neglect may be conceived as a multifaceted syndrome with many clinical manifestations that frequently co-occur, but may manifest also in isolation (Halligan et al., 2003; Vallar, 1998). A primary distinction may be drawn between two sets of abnormalities: defective and productive. Defective signs, in which patients exhibit the absence of the appropriate behavior, requested by the experimental task or the ADLs, are the most investigated components of the syndrome. They include the impairment at attending to and exploring the left side of space, and of objects in it (Albert, 1973; Bisiach et al., 1981, 1983; Bisiach & Vallar, 2000; Gainotti et al., 1972; Rode et al., 1995; Schenkenberg et al., 1980; Stone et al., 1991; Vallar & Perani, 1986). Patients with spatial neglect may also show productive manifestations, that consist in gratuitous actions and delusional beliefs. These behaviors are inappropriate in the clinical setting, with respect to the requests by the examiner (as for extra-personal space: Kleinman et al., 2013; Ronchi et al., 2012, 2016; Rusconi et al., 2002; as for bodily space: Vallar & Bolognini, 2014; Vallar & Ronchi, 2009). Spatial neglect may occur in different dimensions or frames, with reference to the body and body parts (the mid-sagittal plane of the trunk, the head). In egocentric spatial neglect, the neglected side is coded with reference to the body’s trunk, the head, or the eyes. Allocentric spatial neglect refers instead to one side of an object, encoded with reference to its intrinsic coordinates (Howard, 1982; Klatzky, 1998, about these reference frames; see Vallar, 1998; Vallar & Bolognini, 2014 for these different manifestations of neglect). Furthermore, depending on the affected sector of space, spatial neglect may concern personal (the body), near peri-personal (the space surrounding the body, and within the reach of body effectors, primarily the hand, also the foot), and far extra-personal (not within the reach of such effectors) sectors of space (Vallar & Maravita, 2009). Finally, spatial neglect may also concern internally generated visual images (representational or imaginal spatial neglect, see Bisiach & Luzzatti, 1978; Bisiach et al., 1981; Meador et al., 1987; Grossi et al., 1989; Rode et al., 1995). All these manifestations of spatial neglect may occur in association, but dissociations among virtually all signs of the deficit have been observed (Bisiach et al., 1986; Guariglia et al., 1993).

Quantitative and Standardized Assessment (“paper-and-pencil”) with Available Normative data

Up to the mid 1930’s (Weisenberg & McBride, 1935), the neuropsychological method coupled a qualitative, often insightful, analysis of the patient’s deficits with, whenever possible, a post-mortem examination of the patient’s brain, to determine the localization of the cerebral lesion, and to relate it with the observed symptoms and signs. Starting from the seminal observations by Broca about aphasia (Broca, 1861; Vallar & Caputi, 2021), the assessment of the patients’ behavioral deficits was qualitative, and consisted in the accurate observation of the behavioral deficits, which were typically apparent and, due to some peculiar features, had captured the attention of the examiner. The neuropsychological examination became then more and more based on standardized tests, which provide quantitative measures (i.e., scores) of the patients’ performances. Moreover, the performances of brain-damaged patients started to be compared, with the support of statistical procedures, with those of control participants, comparable to patients for demographic and socio-cultural variables, and differing from them only for the presence of the cerebral lesion and the putative deficit under investigation. The research work by the Italian neurologist Ennio De Renzi provides an example of this approach (Arrigoni & De Renzi, 1964; De Renzi, 1982). As to spatial neglect, the quantitative assessment of its clinical manifestations has traditionally relied on paper-and-pencil tests, which includes the tasks described in the following.

1) Target cancellation: to find out and mark targets printed on a sheet of paper. In the Line (Albert, 1973) and Circle (Bisiach et al., 1979; Vallar & Perani, 1986) tests only targets are displayed. In the Bells (Gauthier et al., 1989), Star (Wilson et al., 1987), and Letter (Diller & Weinberg, 1977) tests, targets are shown intermingled with distractors. Tests requiring target vs. non-target discrimination are more likely to detect spatial neglect than those in which only targets are present (Arduino et al., 2002; Vallar, 2000). Cancellation tasks, in which patients are required to discriminate between complete and incomplete targets (i.e., the incomplete ones with a left-sided or right-sided missing portion), allow to distinguish between egocentric vs. allocentric, stimulus-based, types of spatial neglect. In target cancellation tests, when left spatial neglect occurs in an egocentric reference frame (e.g., the mid-sagittal plane of the body), complete and incomplete targets, left-sided with reference to such a frame, are not crossed out. In allocentric, object-based left spatial neglect, targets with a left-sided missing part are not crossed out across the whole display, with reference to coordinates based on the target itself (Bickerton et al., 2011; Mancuso et al., 2015; Ota et al., 2001, 2003). Quantitative scores include the total number of omissions and the difference between the number of omissions in the two sides (left and right) of the display (a sheet of paper). Moreover, these tests may provide qualitative information also about other aspects of the patients’ performance, including: (i) the location of the first target crossed out; (ii) the directional pattern of exploration; (iii) the speed of execution, and (iv) the types of error. Errors include the defective omission of targets, that are not crossed out, and the productive manifestations: simple perseveration (repeated crossing out marks), and complex gratuitous productions (various types of drawings, e.g., a hen, the patients’ signature) (Gandola et al., 2013; Ronchi et al., 2009; Rusconi et al., 2002). Another measure of spatial neglect in cancellation tasks is the “center of cancellation”, consisting in the scaled mean position in the horizontal (left-right) dimension of cancelled targets (Rorden & Karnath, 2010; Toraldo et al., 2017).

2) Bisection of a horizontal line: to mark the location corresponding to the physical mid-point of a line, which is then divided it into two segments. In the standard version of line bisection patients set the mid-point using the not paretic arm and hand, ipsilateral to the side of the lesion (right-brain-damaged patients use their right hand, left-brain-damaged patients their left). Lines of different lengths may be presented, with their center aligned with the mid-sagittal plane of the participant’s body, or displaced leftward or rightward (Daini et al., 2002; Schenkenberg et al., 1980; Vallar et al., 2000). Typically, patients with spatial neglect set the mark shifted toward the end of the line ipsilateral to the side of the lesion (i.e., right-brain-damaged patients with left spatial neglect exhibit a rightward deviation from the geometrical mid-point of the line). Typically, the longer the line, the larger the bisection error (Schenkenberg et al., 1980; Vallar, 2000). The Landmark test (Milner et al., 1993) is a variant of manual line bisection, which does not require a motor response. In the Landmark test, in different trials participants decide which of the two halves of pre-bisected lines is longer or shorter, and communicate their response to the examiner, verbally or by pointing. In patients with left spatial neglect, deciding that the right half of the line is longer or shorter than the left half, independent of their actual lengths, and of the question posed by the examiner (longer or shorter?) indicates a response or output bias. By contrast, judging, erroneously with respect to the actual stimulus, that the left half of the line is shorter (and that the right half is longer) indicates a perceptual underestimation of the lateral extent of the left-sided portion of the pre-bisected line.

3) Drawing: to copy a meaningful or meaningless figure presented in front of the participant (Gainotti et al., 1972) or to draw a familiar figure from memory to a verbal command (Bisiach & Vallar, 2000; Rode et al., 2006). Patients’ performances are evaluated considering the extent of the omissions (complete or partial) of details in both the left- and the right-hand sides of the figure, or of each object, when the drawing includes multiple stimuli (Gainotti et al., 1972). Typically, drawings by patients with left spatial neglect are inaccurate and incomplete in their left-hand side. In copying figures including multiple objects, patients may omit either left-hand side objects, with reference to the mid-sagittal plane of their body showing, as discussed above, egocentric spatial neglect, or the left-hand side of each object, with reference to its coordinate frames, showing allocentric spatial neglect (Walker, 1995).

4) Reading: single words and non-words, sentences and passages of prose. Patients showing neglect dyslexia may commit errors of omission, substitution (Ellis et al., 1987; Kinsbourne & Warrington, 1962) and (less frequently) addition (Siéroff, 2017; Vallar et al., 2010) of letters and words (in reading sentences and passages of prose). Reading errors on words and orthographically legal non-words may be classified as neglect errors, using the “neglect point measure” (Ellis et al., 1987). Neglect errors in reading are defined as “errors in which target and error words are identical to the right of an identifiable neglect point in each word but share no letters in common to the left of the neglect point”. Left neglect dyslexia may be also detected using sentences (Antonucci et al., 1995). More ecologically-based reading material (i.e., passages of prose, as in articles or a menu) has been shown to be sensitive to detect the deficit (Galletta et al., 2014).

Patients with spatial neglect may also show a directional bias, or some other forms of neglect, in Activities of Daily Living (ADL). Prompted by several classical observations (see, for instance, Brain, 1941; Paterson & Zangwill, 1944), assessments based on the simulation of realistic conditions have been proposed to quantify the extent of spatial neglect in ADL. The Behavioral Inattention Test (Halligan et al., 1991; Wilson et al., 1987) is a comprehensive standardized battery that includes both conventional paper-and-pencil and behavioral subtests, in which patients are asked to perform tasks reproducing more practical and everyday life activities. Several batteries, including an ADL scale to evaluate extra-personal and personal spatial neglect have been proposed (McIntosh et al., 2000; Pizzamiglio et al., 1989; Zoccolotti et al., 1992; Zoccolotti & Judica, 1991). The Catherine Bergego Scale (CBS) assesses both the functional deficits in extra-personal and personal spaces (e.g., “forgets to eat food on the left side of his/her plate”; “forgets to clean the left side of his/her mouth after eating”), and the patients’ awareness of them, by comparing the observation by the examiner (the physician, the nurse, a relative) with the patients’ evaluation of their own behavior (Azouvi, 1996, 2002; Azouvi et al., 2003; Bergego et al., 1995). The CBS has been repeatedly reported to be a more sensitive tool to detect spatial neglect than the conventional paper-and-pencil tasks (Azouvi, 2002; Azouvi et al., 2003, 2006). Other ecologically valid and straightforward tasks used to assess spatial neglect in personal space require patients to reach contralateral body parts with their ipsilateral hand (Bisiach et al., 1986), or to explore their own body, using the ipsilateral unaffected arm and hand and, with eyes closed, to remove several Velcro stickers previously attached on their clothes, on both sides of the body (Cocchini & Beschin, 2020).

To summarize, a comprehensive clinical assessment of spatial neglect should include: (i) neuropsychological tests, evaluating both near extra-personal (target cancellation, line bisection, drawing by copy and from memory and reading tasks) and personal (body perception, body awareness, assessed by tests requiring patients to explore the two sides of the body, searching for body parts or objects on them) manifestations of the deficit (see Halligan et al., 2003; Menon & Korner-Bitensky, 2004; Vallar & Bolognini, 2014, for reviews); (ii) ADL scales or ecologic tasks, also assessing the patients’ awareness for the disorder (see for review Azouvi, 2017).

Early Attempts to use Information Technologies (IT) for the Neuropsychological Assessment of Spatial Neglect

With increasing time from stroke onset, many patients recover, and may do well in neuropsychological tests, but still have difficulties in ADLs. A more complex control of spatial attention and awareness, and a major contribution from executive processes, may be required since, in everyday life, every activity is a multiple and not repetitive task, at variance from the previously discussed laboratory tests (Bonato et al., 2010, 2012; Della Sala et al., 2018).

Starting from the late 1990s, several computer- and touch-screen-based tasks have been devised and implemented, to provide a more sensitive and informative assessment, allowing to detect impairments in performance even in patients who do within the normal range in paper-and-pencil tests (Deouell et al., 2005; Erez et al., 2009; Rabuffetti et al., 2002; Schendel & Robertson, 2002).

Computerized tests have indeed some pros, typically recording more precisely information such as accuracy and response latencies throughout the execution of the task. To implement the architecture of the test devised by the experimenter, stimuli may be presented in different locations, simultaneously or as a sequence, with equal or varying time intervals between stimuli, across trials, sessions, or both, and may be repeated many times (Bonato et al., 2010; Buxbaum et al., 2012; Deouell et al., 2005). To manipulate the load posed to the participant’s resources, different levels of difficulty of the task can be easily implemented, varying for instance the duration and the number of the stimuli, and adding a concurrent task (Bonato et al., 2013; Bonato & Deouell, 2013). Additional behavioral measures may also be recorded, such as eye movements (Van der Stigchel & Nijboer, 2010), and data available from touch-screen recording (Rabuffetti et al., 2002), such as number and duration of touches, along with response latency measures recorded throughout the execution of the task (Kim et al., 2010). Furthermore, chances for a ceiling effect (namely, most patients obtain the maximum score in a test) are reduced. Quantitative, continuous measures, appropriate for statistical analyses in single patients, may be easily recorded (Robey et al., 1999), including sensitive individual monitoring of variations in performance levels through repeated assessments. Because of the possibility of easily producing unpredictable settings (e.g., presenting stimuli in random locations of the working space, with different shapes, and time duration), computerized tests can be made harder to learn, making more difficult the development of compensatory strategies on the patient’s part. Computerized tests are thus more suitable for test-retest designs, for instance readily allowing the assessment of the generalization effects of a given rehabilitation treatment (Geusgens et al., 2007). Finally, and importantly, computer-based tasks do not pose a general heavy burden on stroke patients in the post-acute and chronic phases. No adverse reactions, such as headache, fatigue, or boredom, have been reported. Conversely, digital tests have been evaluated as funnier, more precise and safer than the traditional paper-and-pencil ones (see, for instance, Pallavicini et al., 2015; Quinn et al., 2018; Smit et al., 2013; Ulm et al., 2013).

Also Augmented and Virtual Reality-CNADs should address all the above-mentioned issues. Augmented Reality (AR) can be defined as a set of techniques and tools, which add information to physical reality. AR has been used in many fields, including clinical psychology (Chicchi et al., 2015 for review), while its applications to neuropsychology are still limited. Virtual Reality (VR) is a computer-based multisensory, stimulating, and interactive environment, which occurs in real-time. Participants are engaged in activities that appear similar to real-world objects and events (Rizzo et al., 2004). VR can be subdivided in non-immersive (2-dimensional screen presentations, with interaction devices, such as a joystick), and immersive (requiring the integration of computers with further devices, such as Head-Mounted Displays, HDMs, VR controllers, or body-tracking sensors) applications. The immersive applications allow users to experience the virtual environment, and to interact with it, based on the movements of the head or body. For these reasons, starting from the 2000s, several VR paradigms have been implemented in both clinical and research settings (Neguț et al., 2016; Parsons, 2015; Rizzo et al., 2004). As compared to both paper-and-pencil and computer-based assessments, VR may be a useful tool to adequately evaluate ADLs in patients with spatial neglect, through scenarios that resemble everyday life situations (see for reviews Ogourtsova et al., 2017; Pedroli et al., 2015; Tsirlin et al., 2009). Several issues need however to be considered, although several neuropsychological VR paradigms have been developed in the last few years, facilitated by the decreasing costs of the hardware components and the increasing availability of open-access systems. Open issues include: (i) VR-Induced adverse Symptoms and Effects – VRISE (nausea, dizziness, disorientation, fatigue and instability). These symptoms and effects are mainly related to hardware and software inadequacies, not shared/overcome by more contemporary VR hardware and software (Kourtesis et al., 2019a, b). The virtual reality neuroscience questionnaire (VRNQ) is a tool which may facilitate the quantitative assessment of software attributes and intensity of VRISE (Kourtesis et al., 2019b). (ii) Ergonomic features of VR tools. Ergonomic and naturalistic interactions also minimize the risk of VRISE (Kourtesis et al., 2019a). Realistic interfaces, with direct hand interactions and wands with six degrees of freedom of movement, facilitate naturalistic and ergonomic interactions. Direct hand interactions are easier in terms of familiarization with their controls and efficiency and offer a more pleasant user experience; however, they appear less accurate than wands with six degrees of freedom of movement (Figueiredo et al., 2018; Sportillo et al., 2017). Stroke patients often use a wheelchair for locomotion, due to the presence of motor deficits, such as hemiparesis or hemiplegia. VR systems, that can be controlled with one hand (in patients with a hemispheric stroke, the non-paretic ipsilesional upper limb and hand), substantially improve the user experience of the tool itself (Fordell et al., 2011; Kim et al., 2011; Tsirlin et al., 2009). (iii) Psychometric features of the developed systems. Ecological validity is crucial for the assessment of spatial neglect because, with increasing time from stroke onset, patients may still show difficulties in ADLs, despite doing well in neuropsychological paper-and-pencil tests (Bonato et al., 2010, 2012; Della Sala et al., 2018). Immersive VR technology allows to collect data through the employment of dynamic stimuli and interactions with a high degree of control within an ecologically valid environment (Kourtesis et al., 2020; Rizzo et al., 2004). However, immersion depends on the placement, plausibility, and embodiment illusions (Maister et al., 2015; Slater, 2009, 2018). The placement illusion consists in the deception of users to be in a real and not in a virtual environment. The plausibility illusion is the deception of users that the environment reacts to their actions: they thus consider plausible that they are immersed in a real environment. The embodiment illusion refers to the deception that users own the body of the virtual avatar. (iv) Competences. Software developers and researchers are required to have adequate technological abilities, to opt for the appropriate hardware and software and to achieve their research, clinical aims, or both (Kourtesis et al., 2019a). Since a clinical staff typically has no programming skills, cooperation with software developers is needed, to develop and customize these systems (Fordell et al., 2011; Mainetti et al., 2013; Sedda et al., 2013; Sugarman et al., 2011). (v) User experience. This factor is affected by the used hardware, the quality of the sound and graphics, the level of immersion experienced by the user and the level of the enjoyment of the VR experience (Kourtesis et al., 2019b).

Notwithstanding the available evidence, the American Academy of Clinical Neuropsychology (AACN) and the National Academy of Neuropsychology (NAN), raise eight key issues regarding the development, dissemination and implementation of Computerized Neuropsychological Assessment Devices (CNADs), for both research and clinical aims: (i) safety and effectivity, (ii) identity of the end-user, (iii) technical features of hardware and software, (iv) privacy and data security, (v) psychometrics properties, (vi) examinee’s issues, (vii) use of reporting services, and (viii) reliability of responses and results. CNADs could be a standalone device or software, that can be run on a personal computer, a laptop, a tablet, or a smartphone (Bauer et al., 2012; see Kourtesis & MacPherson, 2021 for VR-related issues).

The Review

Up to present, the manifestations of spatial neglect are mainly detected through neuropsychological paper-and-pencil tests, in both clinical and research settings. As noted above, in the subacute and chronic phases post stroke (Rehme et al., 2012), due to spontaneous recovery and cognitive rehabilitation, patients with spatial neglect may do well in neuropsychological tests, but still have difficulty in ADLs (Bonato et al., 2010, 2012; Della Sala et al., 2018). Information Technology (IT)-based devices (computer-, touch-screen- and VR-based) are useful tools for neuropsychological assessment, recording more information than paper-and-pencil tests and allowing to replicate and alter the surrounding world, to evaluate the patients’ behavior in contexts simulating real life (see for reviews Ogourtsova et al., 2017; Pedroli et al., 2015; Tsirlin et al., 2009). However, as previously mentioned, when developing, disseminating and implementing CNADs, for both research and clinical aims, several issues must be taken into account (Bauer et al., 2012; Kourtesis & MacPherson, 2021 for VR-specific issues).

The aim of this scoping review was to present an overview of studies from 2010 onwards, in which IT-based devices were used as a potential tool for the assessment of spatial neglect. Scoping reviews represent indeed a useful instrument to identify and map all available research on a specific topic, and to report information about how studies have been conducted (Munn et al., 2018). Only studies since 2010 onwards were included in this review for two reasons. Firstly, studies prior to 2010 have already been reviewed (Ogourtsova et al., 2017; Pedroli et al., 2015; Tsirlin et al., 2009); secondly, this review aimed at critically verifying whether the tools developed in the last decade successfully target the AACN & NAN Criteria (Bauer et al., 2012; see Kourtesis & MacPherson, 2021 for VR-related issues).

Methods

The norms previously established by the PRISMA-P Group for scoping reviews (Tricco et al., 2018) were adopted. The protocol for this review was not previously registered.

Search Strategy and Selection Criteria

The study search was initially kept broad to capture all relevant articles concerning the use of IT devices [computer, tablet and virtual (VR) or augmented (AR) reality], as possible tools for the clinical assessment of manifestations of the syndrome of spatial neglect. The following electronic databases were used for the identification of papers, up to July 2022: MEDLINE, ScienceDirect, Web of Science and Scopus. The search sting was: (“technology” OR “computer” OR “tablet” OR “virtual reality” OR “augmented reality”) AND [“neglect” OR (“spatial neglect” OR “visual neglect” OR “visuospatial neglect” OR “hemispatial neglect” OR “unilateral spatial neglect”)]. The reference lists of included articles and relevant reviews were checked to identify additional studies. The selection process was performed using Rayyan – a web and mobile app for systematics reviews (access from: https://www.rayyan.ai) (Ouzzani et al., 2016). Titles, abstracts, and full-text articles were screened independently by four out of the six authors (ST, FA, GM and RE), and evaluated for eligibility, based on the following inclusion and exclusion criteria:

  • Inclusion criteria.

    • Related to, and making use of, any of the technologies employed in the clinical assessment of spatial neglect in brain-damaged patients (studies concerning nanotechnologies, technologies applied to eating behavior, nano-molecules, functional Magnetic Resonance Imaging - fMRI, Transcranial Magnetic Stimulation - TMS, transcranial Direct Current Stimulation - tDCS techniques and drug therapies were not included).

    • Focusing on adult brain-damaged patients with spatial neglect (i.e., at least one patient entered the study) and using the abovementioned technologies to assess neglect in patients with a clinical diagnostic aim.

    • Published since 2010.

    • Published in peer-reviewed journals.

    • Published in English.

  • Exclusion Criteria.

    • Technical papers with no clinical application in patients with spatial neglect.

    • Literature reviews.

    • Unpublished data, conference proceedings or articles not reporting quantitative data.

These criteria are in line with those of other published reviews (Bucur & Papagno, 2021, 2022). A preliminary selection based on title, abstract and keywords was run, excluding those articles not matching selection criteria. A further selection was then made by inspecting full text manuscripts and applying inclusion and exclusion criteria. Unresolved papers were discussed by all authors, to reach a consensus. Duplicates were identified and removed through hand search.

In line with several review articles on neuropsychological topics (Bucur & Papagno, 2021, 2022; Ogourtsova et al., 2017; Zhang et al., 2017) we chose: (i) to include only papers published in English, (ii) not to include articles from gray literature (e.g., unpublished Ph.D. thesis) for two main reasons: (a) often, Ph.D. students embargo their dissertations, and (b) good level theses may be subsequently published in journals as research articles (see, for instance, Bucur & Papagno, 2019, 2022). With these inclusion and exclusion criteria, it is likely that only papers, which have passed the peer review process at an international level, are included in the present review.

Results

The literature search retrieved 9764 articles. After duplicate removal, 5344 were screened for further evaluation because they were potentially relevant research articles, that explored the use of technologies (computer, tablet, VR or AR) as possible tools for the clinical assessment of spatial neglect. After Title and Abstract screening, 52 articles were assessed for eligibility, and, finally, 42 articles met all inclusion criteria and were further analyzed. Figure 1 shows the entire search and selection process.

Fig. 1
figure 1

Flow diagram of study selection and inclusion criteria

Articles meeting the inclusion criteria were then categorized into four groups, depending on the digital instruments used to administering the digital tasks: computer (pc and laptop) (13), graphics tablet and touch-screen tablet (8), virtual reality (16), and other (5). The main features of the included studies are summarized in Table 1. In all studies, the presence/absence of spatial neglect was defined according to the neurological exam, a neuropsychological paper-and-pencil assessment, or both. Ten out of 12 studies describing computer-based tools included only patients suffering from a cerebrovascular attack (CVA). Two out these 12 studies (Bonato et al., 2013; Rabuffetti et al., 2012) included patients with different aetiologies (CVA, brain tumor, and traumatic brain injury). One study included also a control group of patients with Mild Cognitive Impairment (Blini et al., 2016). All 13 studies included right-brain-damaged patients; left-brain-damaged patients were included in six out of 13 studies (Blini et al., 2016; Kocanaogullari et al., 2020; Rabuffetti et al., 2012; Villarreal et al., 2020, 2021, 2022). All eight studies, describing graphics tablet- and touch-screen tablet-based assessment tools, included right-brain-damaged patients suffering from CVA; left-brain-damaged patients were included in two out of eight studies (Chung et al., 2016; Pierce et al., 2021). All 16 studies but one (Mesa-Gresa et al., 2011, which included also patients with a brain tumour) using VR-based tools included patients suffering from CVA. Left-brain-damaged patients were included in three out of the 16 studies (Aravind et al., 2015; Aravind & Lamontagne, 2014; Fordell et al., 2011). Immersive VR paradigms were used in all but two studies (Buxbaum et al., 2012; Grattan & Woodbury, 2017). Finally, among the other group, all the five studies included patients suffering from CVA and only two (Peru et al., 2017; Spreij et al., 2020) left-brain-damaged patients.

Analysis of the Quality of the Studies

The quality of the included studies was analyzed using the ad-hoc quality criteria of Corti et al. (2021). Four of the authors (ST, FA, GM and RE) independently evaluated the quality of the studies; no discrepancies in ratings among the evaluators were found. The evaluation criteria, based on parameters considered as relevant for performing experimental and clinical neuropsychological studies, considered the presence (1 = yes, 0 = no) of:

  1. 1.

    Sample size calculation, based on power analysis, to avoid underpowered studies with unclear results, and the occurrence of publication bias. Recent literature points out the need to calculate the required sample size by using power analysis (Brysbaert, 2019; Kühberger et al., 2014). Specifically, for studies focused on tools for clinical neuropsychological assessment, power analysis can be performed by estimating the sample size needed to detect a significant difference in performance, with a specific power, with respect to a control group of participants not presenting impairments in the target construct, namely: healthy controls or patients without the impairment of interest (spatial neglect in the present review, see criterion below).

  2. 2.

    Control group of healthy participants, of patients not presenting an impairment in the target construct, or of both. The comparison of the patients’ performances with those of a control group of participants, matched for age, sex, and educational level, is a standard neuropsychological practice, to evaluate the performance of a patients’ sample with a specific diagnosis (in this review, spatial neglect), with respect to participants who do not show that specific impairment, namely: healthy control participants, patients without spatial neglect or both (Kan et al., 2019; Vallar, 2000). Statistical corrections, such as using co-variates in the analysis of variance (Capitani & Laiacona, 1997), can be performed to control for the possible effects of concomitant variables (age, sex, educational level, duration of disease in stroke patients, severity of the overall neurological impairment, etc.) on the patients’ performances in the tests assessing the target construct of interest (in this review, neglect).

  3. 3.

    Sensitivity or specificity reporting. Sensitivity is the ability of a test to correctly identify patients with a disease; specificity is the ability of a test to correctly identify people without the disease (Swift et al., 2020). Sensitivity and specificity are two psychometric properties that are used to assess target construct validity (i.e., the extent to which an instrument provides a measure of a theoretical construct).

  4. 4.

    Measures of construct validity (the degree to which a test or instrument can measure a concept, trait, or other theoretical entity): convergent and divergent or discriminant validity reporting (Strauss & Smith, 2009). Convergent or congruent validity is the extent to which responses to a test or to an instrument exhibit a strong relationship with responses to similar tests or instruments, namely the extent to which a measure correlates with other measures of the same or of a similar construct. Divergent or discriminant validity is the degree to which a test or measure diverges from (i.e., does not correlate with) another measure, whose underlying construct is conceptually unrelated to it.

  5. 5.

    Ecological validity reporting. Ecological validity indicates if the findings of a study can be generalized to realistic situations in everyday life. It is a subtype of external validity, that can be defined as the extent to which the measures of outcome correlate with, or predict, performance in ADLs (see also Holleman et al., 2020).

  6. 6.

    User experience reporting. This criterion was added to those proposed by Corti et al. (2021), to evaluate the presence of negative (i.e., heavy burden or adverse reactions, as headache, fatigue or boredom) or positive (i.e., feelings of fun and safety) reporting about the experience of using these digital tools, as compared to the traditional paper-and-pencil ones. This criterion was considered as met if user experience reporting was obtained from healthy participants, brain-damaged patients, clinicians, or from all of them.

The analysis of the quality of each study was then based on these six ad-hoc established criteria. With respect to sample size calculation based on power analysis, only two studies (Aravind & Lamontagne, 2014; Quinn et al., 2018) applied this procedure (4.76%). Twenty-six studies included a control group of healthy participants (61.90%), 19 had a control group of patients without spatial neglect (45.23%), and 12 included both (28.57%). As for psychometric properties, 23 studies reported sensitivity or specificity measures (45.23%), 20 convergent or discriminant validity (46.61%) and nine ecological validity (14.28%). Furthermore, among the studies reporting specificity or sensitivity, only two (Buxbaum et al., 2012; Chung et al., 2016) also reported the cut-off for impairment. This means that only for two out of 42 studies data to classify the patients’ performance as impaired or non-impaired (4.76%) were available. Finally, with respect to user experience reporting, only nine studies showed these data (21.42%). Table 2 shows the analysis of the quality of each study, based on the six criteria reported above.

Table 1 Characteristics of studies included. Summary of abbreviations and references:
Table 2 Evaluation of the quality of the studies

Discussion

At present, the main manifestations of spatial neglect are mainly detected through clinical observation and neuropsychological paper-and-pencil tests, that allow to explore its features (e.g., egocentric vs. allocentric, personal vs. extra-personal) in a standardized way, and to quantify its severity (Heilman et al., 2003; Lezak et al., 1995; Vallar & Bolognini, 2014). However, some pieces of evidence point out that, the longer the duration of the disease after stroke onset, the better the patients’ performances in neuropsychological tests, although difficulties are still found with ADLs (Bonato et al., 2010, 2012; Della Sala et al., 2018). This improvement of spatial neglect in stroke patients is due to both spontaneous recovery of neurological and neuropsychological deficits (Corbetta et al., 2005; Murphy & Corbett, 2009; Stefaniak et al., 2020; Stinear, 2017), and the effects of rehabilitation (Azouvi et al., 2017; Bowen et al., 2013; Kerkhoff & Schenk, 2012). Emerging literature suggests that IT-based devices could be an important tool for neuropsychological assessment. These technologies allow to record much more information with respect to paper-and-pencil tests, and to replicate and alter the surrounding world, permitting to evaluate the patients’ behavior in contexts resembling real life conditions (Ogourtsova et al., 2017; Pedroli et al., 2015; Tsirlin et al., 2009). Despite the potential of these technologies, several issues need to be taken into account when developing, disseminating and implementing CNADs (Bauer et al., 2012; see Kourtesis & MacPherson, 2021 for VR-related issues).

This scoping review aimed at presenting an overview of the studies in which IT-based devices were used as tools to assess manifestations of spatial neglect. The inclusion criteria were met by 42 articles, which were categorized into five groups, depending on their technological approaches: computer (13), graphics tablet or tablet (8), virtual reality-based assessment (16), and other (5).

Computer (pc/laptop)

Computer tools have proved to be effective for the evaluation of manifestations of spatial neglect. These systems appear to be more sensitive in recording difficulties along the left-right spatial dimension, with respect to the gold standard and time-honoured paper-and-pencil tests. Many advantages of this kind of technology are acknowledged by the AACN and the NAN, as well as by this review. Relevant issues include the following: (i) testing large numbers of participants quickly (e.g., parallel administration); (ii) tests available at any time; (iii) enhanced accuracy and precision (e.g., reaction times, or exploration time of the two sides of space, Machner et al., 2018; Mizuno et al., 2016; Rabuffetti et al., 2012; Rossa et al., 2019; Villarreal et al., 2020, 2021); (iv) shorter administration times and reduced costs for test administration and scoring; (v) when appropriate, test versions for different languages; (vi) automatically exporting recorded data to databases (Chiba et al., 2010; Jee et al., 2015, for a digital version of the line bisection task); (vii) increased accessibility, for instance from remote; (viii) integration of algorithms for making decisions on issues such as the identification of an impairment or of a statistically reliable change in performance. These technologies can be easily integrated with other systems, thus providing a wider range of measures (e.g., Electroencephalography, Kocanaogullari et al., 2020). Finally, computerized tests adequately suit the purpose of being administered via telematics from remote: the currently ongoing SARS-CoV-2 world pandemic has emphasized this need.

On the other hand, the present qualitative evaluation of the studies points out several critical issues that must be addressed. With respect to sample size calculation, none of the studies applied power analyses. Ten studies included a control group of healthy participants, and four studies a control group of patients without spatial neglect. Only three out of 13 studies included both. This means that the ability of about 75% of the studies to distinguish different levels of performance among different groups of participants (healthy, affected or not affected by the deficit under investigation, as indicated by other tasks) was not completely assessed, making uncertain the possible clinical validity of most of the proposed IT-based tasks for distinguishing a defective from a preserved performance. As for psychometric properties, eight studies reported sensitivity or specificity measures; two studies report convergent or discriminant validity and only one ecological validity (only Machner et al., (2018) reported both). None of the studies reporting specificity or sensitivity, provided cut-offs for detecting performance impairments. Finally, with respect to user experience reporting, only Ulm et al. (2013) showed these data. The parameters of ecological validity and user experience reporting are most relevant, considering that ADL scales are most sensitive to detect spatial neglect (Azouvi, 2017) and the used IT-based devices are novel tools for both clinicians and patients.

Graphic Tablet and Touch-Screen Tablet

The same list of pros discussed in the previous section readily applies to graphics and touch-screen tablets. The present qualitative evaluation raises several critical issues also for the studies included in this group. In the first place, sample size calculation based on power analysis was applied by only one study (Quinn et al., 2018). A control group of healthy participants was included in four studies and five studies included a control group of patients without spatial neglect; only two out of eight studies had both. This means, as for the computer group, that in about 75% of the studies, the possible clinical validity of most of the proposed IT-based tasks for distinguishing a defective from a preserved performance is uncertain. As for psychometric properties, sensitivity, or specificity measures and convergent or discriminant validity were more frequently reported than in the computer group: five studies reported sensitivity or specificity measures (62.5%) however, among them, only Chung et al. (2016) reported cut-offs for detecting impaired performances. Four studies (50%) reported convergent or discriminant validity. Ecological validity was underreported also in this group: only the article by Pallavicini et al. (2015) provided these measures. Finally, only two out of eight studies mentioned user experience reporting (Pallavicini et al., 2015; Quinn et al., 2018), showing high ratings of acceptance and usability of the NeglectApp and of the StrokeVision app.

Virtual Reality

Among the 16 studies included in this group, non-immersive paradigms were used only in two studies (Buxbaum et al., 2012; Grattan & Woodbury, 2017). The Virtual Reality Lateralized Attention Test (VRLAT) consists indeed of a 2-dimensional screen presentation of a virtual, nonbranching path, in which participants interact with the environment using a joystick. The VRLAT detects manifestation of spatial neglect better than the traditional paper-and-pencil tasks; moreover, scores obtained at the VRLAT correlate with those obtained in the CBS and in the Naturalistic Action Test (Schwartz et al., 2002). However, Buxbaum et al. (2012) report that six out of 70 right brain-damaged patients (about 9%) did not complete the experimental procedure, due to difficulties in operating with the joystick; no other measures of user experience are provided. All the remaining 14 studies included in the VR group employed instead immersive paradigms: in particular, 10 studies integrated computers with head-mounted display, two used shutter glasses (Dvorkin et al., 2012; Fordell et al., 2011), and two body-tracking sensors (Mesa-Gresa et al., 2011; Sugarman et al., 2011). Regarding the methodological aspects, only Aravind and Lamongagne (2014) applied sample size calculation based on power analysis. Eight studies included a control group of healthy participants and eight a control group of patients without spatial neglect; six studies included both groups. As for psychometric properties, seven studies (43.75%) reported sensitivity or specificity measures, but only Buxbaum et al. (2012), also cut-offs for detecting performance impairments. Twelve studies reported instead convergent or discriminant validity measures (75%).

VR paradigms allow to replicate and alter the surrounding world to evaluate the patients’ behavior in contexts simulating real life (Ogourtsova et al., 2017; Pedroli et al., 2015; Tsirlin et al., 2009). However, only Buxbaum et al. (2012), who employed a non-immersive paradigm, and Kim et al. (2021), who used instead an immersive paradigm, provided ecological validity measures (12.5%). The paucity of measures regarding ecological validity is worth noticing, considering that one possible advantage of VR paradigms is to create scenarios that resemble real life situations, thus giving leeway to the evaluation of the patients’ behaviour in these contexts. In particular, the fact that only one out of 14 studies adopting immersive paradigms reported these measures is even more notable, considering that immersive VR technology allows to collect data through the employment of dynamic stimuli and interactions with a high degree of control within an ecologically valid environment (Kourtesis et al., 2020; Rizzo et al., 2004). Finally, with respect to user experience ratings, data, although encouraging, are few: as for non-immersive VR-paradigms, about 9% of the clinical sample did not complete Buxbaum et al.’s (2012) test, due to difficulties in operating with the joystick. As for the immersive VR-paradigms, Fordell et al. (2011) employed a three-item questionnaire: results show that, during the VR-DiSTRO battery administration, 41% of the clinical sample reported to feel focused, 22% to feel pleased during the administration of the task, and 25% to feel alert. Sugarman et al. (2011) made use of the Short Feedback Questionnaire (Rand et al., 2008): the only patient tested with the VR react task of the SeeMe system reported feelings of enjoy, control and success in the VR environment. In the study by Knobel et al. (2020), the administration of the System Usability Scale (Brooke, 1996) to the clinical sample revealed high ratings of acceptance and usability of their VR cancellation task. Conversely, Ogourtsova et al. (2018a, b) conducted focus groups and administered questionnaires directly involving the clinician: experts showed interest and availability to use VR paradigms in their clinical practice. Finally, only two studies investigated VRISE: no participant reported nausea or other adverse effects during or after the administration of the VR-DiSTRO battery of Fordell et al. (2011). Furthermore, cybersickness connected to the VR cancellation task of Knobel et al.‘s (2020) was rated as “very low” at the Simulator Sickness Questionnaire (Kennedy et al., 1993).

Other

Studies included in this section employ an interactive table (Peru et al., 2017), a planar end-effector robot (REAplan®robot) (Montedoro et al., 2019), a tangible user interface system consisting in an integrated system of concrete objects that participants can manipulate (E-TAN platform) (Cerrato et al., 2020) and a driving simulator (Sotokawa et al., 2015; Spreij et al., 2020). None of the included study applied sample size calculation procedures. Four out of five studies included a control group of healthy participants, but only one (Spreij et al., 2020) also a control group of patients without neglect. Sensitivity or specificity measures were reported in three studies, convergent or discriminant validity measures in two. Only Spreij et al. (2020) reported measures of ecological validity. Finally, with respect to user experience, only Cerrato et al. (2020) provided the reports of some patients, who spontaneously described the task as agreeable and quick to perform, and as a welcome change after so many boring paper-and-pencil tests.

For sure, the simulated driving-based assessment offers an evaluation based on tasks like the situations people face in real life while driving; however, the two studies included in this section share the same limitations previously mentioned for VR paradigms. Furthermore, a requirement of driving simulators and VR devices is that both the experimenter and the participant have to be trained to be able to use them; this can be difficult for patients with cognitive impairment, such as memory deficits (Baddeley et al., 2003; Ferbinteanu, 2019). The same problem concerns all the innovative devices included in this section, that are realized with proprietary hardware and software components, since a specific training for each technological proposal is required.

Taken together, data emerging from the present qualitative synthesis of the studies included in this scoping review, although encouraging, are still few and leave than room for several critical issues. In the first place, from a methodological point of view, the recent literature points out the need to calculate the required sample size by using power analysis (Brysbaert, 2019; Kühberger et al., 2014). Accordingly, underpowered studies which may provide unclear results, and then the occurrence of publication biases, are to be avoided. However, despite the above mentioned methodolological critical issues, only two studies included in our review (about the 5%) used sample size calculation procedures. Furthermore, only 12 studies (less than 30%) included both a control group of healthy participants and patients without neglect: the comparison of the patients’ performances with those of a control group of participants, is indeed standard neuropsychological practice, to evaluate the performance of a patients’ sample with a specific diagnosis (in this review, spatial neglect), with respect to participants who do not show that specific impairment, namely: healthy control participants, patients without spatial neglect or both (Kan et al., 2019; Vallar, 2000). As for psychometric features, sensitivity, sensibility measures and convergent or divergent validity were reported in about 50% of the reviewed studies. Moreover, considering that in the subacute, and particularly in the chronic phase post-stroke, patients may still show difficulties in ADLs despite doing well in neuropsychological paper-and-pencil tests (Bonato et al., 2010, 2012; Della Sala et al., 2018), it is even more notable that ecological validity was addressed only in about 13% of the studies. Ecological validity is a subtype of external validity, that can be defined as the extent to which the measures of outcome correlate with, or predict, performance in ADLs (see also Holleman et al. 2020). Finally, despite user experience represents a further critical issue that should be addressed to benefit from the advantages of CNADs (Bauer et al., 2012; see Kourtesis and MacPherson, 2021 for VR-related issues), only about the 20% of the included studies reported user experience measures.

Conclusion

The current findings make it difficult to establish golden standard tests or assessment procedures other than those (i.e., paper-and-pencil tests), that are extensively used in clinical settings for the evaluation of spatial neglect. Moreover, the process of developing IT-based tests in clinical neuropsychology should take into account the issues raised for the development, implementation and dissemination of CNADs and VR paradigms (Bauer et al., 2012; see Kourtesis & MacPherson, 2021 for VR-related issues). This is a laborious procedure, requiring the cooperation of a multidisciplinary team, including clinicians, researchers, informatics, interaction designers and engineers. While embracing technology is necessary, it is also essential to be aware of every aspect that leads to an efficient and effective assessment (see, for instance, Kourtesis & MacPherson, 2021). Nevertheless, it is our view that, through technical and user experience improvements, studies in brain-damaged patients and control participants, comparing paper-and-pencil and IT-based devices, and normative studies providing the cut-off scores, needed for the diagnostic process (e.g., Mancuso et al., 2015; Vallar et al., 1994), it will be possible to gather increasing evidence about the efficacy of at least some of the tests considered in this review. This, in turn, will provide additional support for changes and improvements in clinical practice, for diagnostic and follow up purposes. This review and the discussion of the available IT-based studies provides a database of information for the future developments of this line of research.