Our official English website, www.x-mol.net, welcomes your feedback! (Note: you will need to create a separate account there.)
Recovery of vanished URLs: Comparing the efficiency of Internet Archive and Google
Malaysian Journal of Library & Information Science ( IF 1.475 ) Pub Date : 2017-05-31 , DOI: 10.22452/mjlis.vol22no2.3
D. Vinay Kumar , B. T. Sampath Kumar

This article examines the vanishing nature of URLs and recovery of vanished URLs through Internet Archive and Google search engine. For that purpose study investigates the URLs cited in the articles of two LIS journals published during 2009-2013. A total of 226 articles published in two open access LIS journals were selected. Of 5197 citations cited in 226 articles, 21.05 percent were URLs (1094). Study found that 38.12 percent (417 out of 5197) URLs were found missing and remaining 61.88 percent of URLs were active at the time of URL check with W3C link checker. The HTTP 404 error message – “page not found” was the overwhelming message encountered and represented 54.2 percent of all HTTP error message. Internet Archive and Google search engine were used to recover vanished URLs. However, the Internet Archive recovered 66.19 percent of the total vanished URLs, whereas, Google manages to recover only 30.70 percent of the total vanished URLs. The recovery of vanishing URLs through Internet Archive and Google increased the active URL’s rate from 61.88 per cent to 87.11 per cent and 73.58 per cent respectively. Study found that Internet Archive is a most efficient tool to recover vanished URLs compared to Google search engine.

中文翻译:

恢复消失的 URL:比较 Internet Archive 和 Google 的效率

本文通过 Internet Archive 和 Google 搜索引擎检查 URL 的消失性质和消失的 URL 的恢复。为此,研究调查了 2009-2013 年出版的两本 LIS 期刊文章中引用的 URL。共选择了在两个开放获取 LIS 期刊上发表的 226 篇文章。在 226 篇文章中引用的 5197 次引用中,21.05% 是 URL (1094)。研究发现,在使用 W3C 链接检查器检查 URL 时,发现 38.12%(5197 个中的 417 个)URL 丢失,其余 61.88% 的 URL 处于活动状态。HTTP 404 错误消息——“找不到页面”是遇到的压倒性的消息,占所有 HTTP 错误消息的 54.2%。Internet Archive 和 Google 搜索引擎用于恢复消失的 URL。然而,互联网档案馆恢复了 66。19% 的消失的 URL,而谷歌只能恢复 30.70% 的消失的 URL。通过 Internet Archive 和 Google 恢复消失的 URL 将活跃 URL 的比率分别从 61.88% 提高到 87.11% 和 73.58%。研究发现,与 Google 搜索引擎相比,Internet Archive 是恢复消失 URL 的最有效工具。
更新日期:2017-05-31
down
wechat
bug