Menu
Home Explore People Places Arts History Plants & Animals Science Life & Culture Technology
On this page
List of Web archiving initiatives
List article

This article contains a list of Web archiving initiatives worldwide. For easier reading, the information is divided in three tables: web archiving initiatives, archived data, and access methods.

Some of these initiatives may or may not make use of several web archiving file formats and/or their own proprietary file formats.

This Wikipedia page was originally generated from the results obtained for the research paper A survey on web archiving initiatives published by the Arquivo.pt (the Portuguese web-archive) team at the time (Daniel Coelho Gomes, João Miranda and Miguel Costa).

Web archiving initiatives

NameCountryCreation YearTechnologiesNumber of EmployeesComments
Full-timePart-time
End of Term Web ArchiveUnited States2008Heritrix, Wayback6–10The End of Term Web Archive captures and saves U.S. Government federal government websites (.gov, .mil, etc) in the Legislative, Executive, or Judicial branches of the government at the end of presidential administrations. Beginning in 2008, the EOT has thus far preserved websites from administration changes in 2008, 2012 and 2016, and is currently gearing up for the 2020 transition. Project partners include CA Digital Library, Internet Archive, Library of Congress, George Washington University, Stanford University, University of North Texas, and the US Government Publishing Office.
Arkiwera2Worldwide (but based in Sweden)2020Open source solutions and custom programming and scripts34Arkiwera is a Swedish company that maintains digital archives of websites and social media accounts for an annual fee. It supports automatic collection, replay, full-text search and data exports.
EU Web Archive3European Union2013Archive-it service1The EU Web Archive compiles the captures of the websites of the European Union institutions, which are hosted on the europa.eu domain and subdomains. Its aim is to preserve EU web content in the long term and to keep it accessible for the public. The archive was created in 2013 by the Historical Archives of the European Union and in 2018, the Publications Office of the EU took over this task and created the EU Web Archive service. The collection of archived websites is covered by the EU Legal Deposit scheme, which collects all the material produced by EU entities in a comprehensive bibliography.
Alabama State Government and Politics Web Site and Social Media Archives4United States2005Archive-it service
Australia's Web Archive5Australia1996PANDORA Digital Archiving System (PANDAS), Heritrix, Bamboo, NLA Trove, HTTrack, Webrecorder, outbackCDX.4>10The National Library of Australia leads the 'PANDORA' component of the Australian Web Archive which takes a selective approach and is a collaborative program of 10 agencies providing curatorial input. PANDORA uses the PANDAS workflow system (developed by the NLA in the late 1990s) with HTTrack as the default harvester. The National Library of Australia also conducts bulk harvesting of Australian government (the Australian Government Web Archive) websites using the Heritrix harvester and Webrecorder with a backend infrastructure (referred to as 'Bamboo') to organise content and the NLA developed outbackCDX tool to manage indexing access restrictions for content. In addition to these approaches the National Library also conducts annual harvests of the whole .au domain which is done in collaboration with the Internet Archive using Heritrix and Wayback. In 2019, PANDORA, the Australian Government Web Archive and the whole domain harvests were integrated into a new single discovery and delivery portal through the NLA's Trove discovery service.
PROMISE project6Belgium2017Heritrix, PyWB7The PROMISE project was a two-year project (2017–2019) that explored the policy-related, legal, technical and scientific issues related to archiving the Belgian web. The aim of the project was to a) identify best practices in the field of web-archiving b) develop a strategy for preserving the Belgian web c) set up a pilot for preserving and providing access to the archived Belgian web and d) make recommendations for the implementation of a sustainable web-archiving service. The project was launched by the Royal Library of Belgium7 and the State Archives of Belgium8 in collaboration with Ghent University (Research Group for Media, Innovation and Communication9 and Ghent Centre for Digital Humanities),10 Université de Namur (Research Centre in Information, Law and Society)11 and Haute-École Bruxelles-Brabant12 (Unité de Recherche et de Formation en Sciences de l'Information et de la Documentation). In October 2019 the concluding colloquium 'Saving the web: the promise of a Belgian web archive')13 took place at KBR. The main research findings were presented during this colloquium.
KBR web archive14Belgium20201KBR15 or the Belgian Royal Library is developing an operational web archive based on the findings of the PROMISE research project16 (2017–2019). Operational policies and technical infrastructure will be developed based on the strategy outlined in the PROMISE project.
KADOC-KU LeuvenBelgium2022HTTrack, Heritrix, Archiveweb.page, Replayweb.page01Thematic archive with a collection concentrated around the interaction between religion, culture and society in Belgium. In 2023 a research project Best practices for social media archiving in Flanders and Brussels ended.
MT.GOV ConnectUnited States2007Archive-It Service1Montana State Library collection of state agency websites dating from 1996 in partial fulfillment of statutory mandate17 to identify, acquire, describe, and provide permanent public access to state publications. Digitized historic state publications available at https://archive.org/details/MontanaStateLibrary
Stillio18Worldwide2011Puppeteer, V8 engine, Gecko, WebKit, Amazon Web Services34SaaS solution for periodical website & social media archiving. Provides screenshot archiving of both static and dynamic web pages in a fixed duration which can be customized as per requirements. Helps in regulatory compliances, trend tracking, ad banner verification, version changes.
PageFreezer19Worldwide2009PageFreezer's Deep Web Crawler, Hadoop, Cassandra, Elastic Search60SaaS solution for website & social media archiving. Provides automatic collection, replay, full-text search and data export of websites, blogs, social media and enterprise collaboration platforms for eDiscovery and regulatory compliance with FDA, FINRA, FSA, SEC, Federal Rules of Evidence, FOIA and records management laws.
OoCities — GeoCities Archive / GeoCities Mirror20Germany2009
Wikiwix Archive — Linterweb 2122France2008Selenium + MongoDBIn production on French-speaking Wikipedia since 2008, open-source project which optimizes the consumption of inodes and thus fills hard drives. Contains an annotation space for archived documents. Main developer Fabien Coulon doctor du Litis on behalf of Linterweb, hosted by Renater https://gitlab.com/dev_linterweb.
Webarchive Austria23Austria2008NetarchiveSuite, Heritrix, OpenWayback11
Deutsche Nationalbibliothek24Germany2012Tools of oia GmbH6The crawling for the selective web archive is done by the German company oia GmbH. The access is restricted to the reading rooms of the German National Library.
DILIMAG (Digital Literature Magazines)25Austria2007WebCurator2One technician, one for collecting and metadata.
Bibliothèque et Archives nationales du Québec (BAnQ)26Canada2012Heritrix, Wayback.2
Web Archiving Program at Library and Archives Canada27Canada2005Archive-It service43Web archiving in Canada is a legislated activity that is conducted for digital preservation purposes under section 8 (2) of the Library and Archives of Canada Act.28 Four FTEs and three part-time staff work on the program. Web archiving at Library and Archives Canada29 is also utilized to effect Legal Deposit.30
Web Information Collection and Preservation - WICP (Chinese Web Archive)31China2003Heritrix, Wayback and NutchWAX Archived 2015-06-26 at the Wayback Machine.
Croatian Web Archive (Hrvatski arhiv weba - HAW)32Croatia2004Crawl: DAMP software, Heritrix

Access: Wayback, Lucene

22The Croatian Web Archive (HAW) is a collection of content harvested from the Internet. In 2004 the Archive started as a concept of selective capturing of web resources. Whole .hr domain harvests have been conducted annually since 2011. as well as thematic/event harvesting for events of national interest. The content of the Archive is publicly available via HAW website. (2 librarians full time, 1 librarian part time, NUL), 2 IT professionals part time (SRCE - University of Zagreb, University Computing Centre)
Webarchiv (National Library of the Czech Republic)33Czech Republic2000Heritrix, Wayback and Seeder.52Czech web archive (Webarchiv) maintained by National Library of the Czech Republic focuses on archiving the Czech national web. Acquisition policy consists of three lines: selective harvests (collection of resources based on selection criteria), topic collections (focused on significant topics in the area of the Czech web) and comprehensive harvests (automatic harvests of content on the national domain). Staff contains 1 manager, 3.5 curators + 1.5 technical staff.
Netarkivet34/ The Danish web archive (Royal Danish Library)Denmark2005Schedule/crawling: NetarchiveSuite, Heritrix, Browsertrix, Archiveweb.page

Access/search/discovery frontend and playback: SolrWayback. Still installed Wayback for alternative playback, but planning to migrate to PyWb.

15.5 FTESince 2005 the collection and preservation of the Danish part of the internet is included in the Danish Legal Deposit Law. The task is undertaken by the Royal Danish Library.

There is no public access to the Danish web archive .The archive is only accessible to researchers affiliated with a Danish research institution who have requested and been granted special permission to use the collection for specific research purposes.

This website https://www.kb.dk/en/find-materials/collections/netarkivet is designed to inform researchers, website owners, and other interested parties about the Danish web archive.

Estonian Web Archive35Estonia2010Heritrix, Squidwarc, PhantomJS and Puppeteer for screenshots of websites frontpages, Pywb, Custom Curator Tool.31Since 2006 the Legal Deposit Law allows the National Library of Estonia to collect Estonian websites as legal deposit copies. Web harvesting is done and archive is maintained by the National Library of Estonia.
Finnish Web Archive36Finland2006Heritrix, Solr, Pywb, Browsertrix crawler, Webrecorder -addon, OutbackCDX, Twarc2, YT-DPL.3>3Maintained by the National Library of Finland. Annually, all *.fi domains are harvested, as well as web servers located in Finland. Outside these harvests, the library manually selects relevant websites.
BnF - Web Legal Deposit37France2006Heritrix, NetarchiveSuite, BCWeb, OpenWayback, SolrWayback, WARC Indexer/Solr11In France, since 2006, the law on copyright and related rights in the information society (known as DADVSI) extended the scope of legal deposit to "signs, signals, writings, images, sounds or messages of any kind " communicated to the public by electronic means - in other words legal deposit of the web. Archiving the French web is a legal commitment, which continues the heritage mission of the BnF. As it is technically impossible to permanently collect all Web content, the goal of completeness from the legal deposit of printed documents has evolved into a sampling approach to create digital collections that show the production and the behaviour of French internet users.
Ina (Institut National de l'Audiovisuel)38France2009Crawl: PhagoSite, Crocket based on Firefox, Fantomas based on PhantomJS / Access: Vortex / Search: Dowser based on Elasticsearch7
Bibliotheksservice-Zentrum Baden-Württemberg39Germany2003Archive-It service0.5Websites of about 20 cities, municipalities, districts and associated corporations, and state libraries are collected by BSZ in commission within various Archive-It collections. Public access. Data storage: San Francisco (Archive-It) as well as backup with Baden-Wuerttemberg storage infrastructure.
Web archive of the German Bundestag40Germany2005
National Széchényi Library Web ArchiveHungary2017Heritrix, Wayback, PyWb, Brozzler, Webrecorder, WCT32From April 2017 till December 2019 the National Széchényi Library (http://www.oszk.hu) ran a web archiving pilot project as part of its comprehensive IT infrastructure development programme. In 2020 web archiving became a permanent service of the National Széchényi Library. From 2021 on, the legal framework was established and the web archive works according to the modified paragraphs of the cultural law and the corresponding government decree. They run thematic, event-based and domain harvests. They have a small demo collection with metadata and full-text search capabilities. The rest of the archive is not publicly available.
Iceland41Iceland2004Heritrix, OpenWayback
National Library of Ireland Web Archive42Ireland2011Archive-it service10.5 FTEThe National Library of Ireland selectively archives Irish websites of scholarly, cultural and political importance through its NLI Selective Web Archive.
Palestine Web ArchivePalestine2011Heritrix, Web curator tool, Wayback, Rosetta1>3National Library of Palestine collecting '.PS' domains, 1 Project Manager part time, 1 Technical Leader full time, 1 librarian part time, 1 IT Infrastructure part time
National Central Library of FlorenceItaly2018Archive-it ServiceThe aim of the project is to collect and to archive digital documents and websites having "cultural interest" for Italian history and culture, according with the principles of the national legal deposit law. The Archive-it Collection is publicly available.
Web Archiving Project (WARP), The National Diet Library, Japan43Japan2002Heritrix, OpenWayback, Solr41Web Archiving Project (WARP) has been archiving websites since 2002. The National Diet Library Law revised in 2009 and coming into force in April 2010, allows the NDL to archive Japanese official institutions' websites: the government, the Diet, the courts, local governments, independent administrative organizations, and universities. Websites of cultural and international events held in Japan, and those related to online periodicals, are also archived based on the permission of their webmasters.
National Library of Korea - OASIS (Online Archiving & Searching Internet Sources)44Korea2001Own system based on Oracle DBMS and specialized search engine (IRS) that performs data management and search function.311
Bibliothèque nationale du LuxembourgLuxembourg2015Heritrix, Wayback, Browsertrix, Solr2The National library of Luxembourg conducts quarterly broad crawls for the .lu domain as well as selective and event-based crawls.

The websites that are harvested in the Luxembourg Web Archive enrich the patrimonial collections of the National library, which allows for the preservation of digital publications for future generations.

Webarchive.lu is the Luxembourg Web Archive's information and participation platform.

Koninklijke Bibliotheek45Netherlands2007Heritrix 3.3, Web Curator Tool 3.0, Wayback, KB e-Depot system~101 crawl engineer, 1 software developer, and 9 collection specialists, all part-time (equivalent to around 4 full-time). The KB selectively collects Dutch sites of research and cultural value.
National Library of Latvia46Latvia2005Web Curator Tool and Wayback1Currently only storing for preservation, access to public in development (ETA June 2012). The Latvian term for web harvesting is "rasmošana".
New Zealand Web Archive47New Zealand1999Web Curator Tool, Heritrix3, Webrecorder, ArchiveIT, Browsertrix, Pywb, OutbackCDX, Rosetta5>10National domain harvests have been run since 2008, and annually since 2015 in collaboration with the Internet Archive. Selective harvesting is undertaken by the National Library of New Zealand primarily using the Web Curator Tool. Three full time staff harvest websites and a number of rostered staff harvest HTML serials or HTML monographs. Supported by one dedicated web archiving engineer, and wider departmental ITMS. Digital Preservation issues are handled by staff who work with Rosetta.
The National Library of Norway48Norway200149
Arquivo.pt5051Portugal2007In-house development, Heritrix, Wayback, NutchWAX Archived 2015-06-26 at the Wayback Machine, Pywb, Apache Solr, Brozzler, Webrecorder.net tools34Arquivo.pt is a research infrastructure that preserves information gathered from the web since 1996 and provides a public search service over this collection. Arquivo.pt preserves websites in several languages and provides user interfaces in English. The archived data can be automatically processed to perform Big Data research through a distributed processing platform or through Application Programming Interfaces that facilitate the development of added-value applications. The Arquivo.pt team has also contributed with scientific and technical articles related to web archiving published in open-access.
Web archive of Cacak52Serbia2009HTTrack1
Web Archive Singapore53Singapore2006Wayback, Heritrix, Solr3The Web Archive Singapore is managed by the National Library Board, Singapore (NLB). NLB conducts domain and selective archiving of websites with a focus on Singapore content. The collection is viewable at the National Library, Singapore with selected content cleared by copyright owners available online.
Digital Resources (University Library in Bratislava)54Slovak Republic2015Heritrix 3.2.0, OpenWayback 2.2.0, Solr 5.2.1, Invenio, Custom Curator Tool, Archivewebpage.org41The University Library in Bratislava (ULIB) performed the first experiments of webharvesting in 2008–2009. In 2015 ULIB carried into operation a platform for web- and e-Born archiving (during the implementation of the national project "Digital resources", that was supported by the European regional development fund) - www.webdepozit.sk/).
Slovenian Web Archive55Slovenia2007Heritrix, OpenWayback, Web Curator Tool1
Archivo de la Web Española56Spain2009NetarchiveSuite, OpenWayback, Solr3+supervisor2Maintained by the National Library of Spain with the collaboration of regional libraries. Takes a mixed approach of selective and broad harvests. Whole .es domain harvests have been conducted annually since 2009 to 2013 in collaboration with the Internet Archive using Heritrix and Wayback. Since 2014 selective harvests have been made by National Library of Spain, using NetarchiveSuite. National Library = 3 librarians full time, 2 crawl engineers part time. Regional libraries = several librarians part time. Since 26 October 2015 the Legal Deposit Law allows the National Library of Spain and the regional libraries to collect Spanish websites as part of the legal deposit and make them available to the public observing the rules of copyright law.
PADICAT: The Web Archive of Catalonia57Spain2005Heritrix, OpenWayback, OutbackCDX and CAT.2PADICAT is the open access Web Archive of Catalonia, created by the Biblioteca de Catalunya: the public institution responsible for collecting, preserving and distributing the bibliographic heritage of Catalonia, in Spain.
ONDARENET - Basque Digital Heritage Archive58Spain2008Heritrix, Wayback, NutchWAX Archived 2015-06-26 at the Wayback Machine and Web Curator.1
Sweden (Kulturarw3)59Sweden1996NetarchiveSuite, Heritrix. Inhouse system for storage, maintenance and access, but moving to pywb or SolrWayback.1.25The Swedish web harvesting project started in 1996 and the first harvest was performed in 1997. In 2002 daily harvests of certain newspaper web sites were added. There was a pause in operation November 2009 - May 2011, but a harvest for 2010 was made with the help of the Internet Archive. No domain harvests were made in 2016, 2018 and 2019 due to problems with the harvesting platform. The daily harvests of newspaper websites were paused between May 2017 and December 2018, but was the expanded to cover all Swedish newspaper web sites on a daily basis. Since April 2013 the National Library of Sweden also receives online material through the Legal Deposit Act for Electronic Material.
Aleph Archives60Switzerland, United States2010Web archiving platform, capture domain name, high performance search engine, Near real time indexing, Web Monitoring tools>10Enterprise-grade automatic web archiving platform for online capture and preservation. Support eDiscovery with powerful and qualitative technology.

Aimed to corporations, institutions and agencies seeking to capture, preserve and leverage their Web content; dynamic websites, wikis, social media, forums, comments, disclaimers, and ads, for compliance (FDA, FINRA, FSA, SEC, FOIA), marketing or pure preservation purposes.

Expatriate Archive Centre Blog Archive61The Hague, The Netherlands2019Archive-It serviceThe focus of this project is blogs written by any people who have lived abroad. We preserve these blogs and their contents because we recognise their cultural and historical value. Adding a blog archive to our collection will enrich the research opportunities for students and other academics who choose the us as a place of study. The archived blogs will be selected based on very specific criteria and their quality will be checked on a regular basis.
Web Archiving Bucket62Switzerland, United States, Canada2012WARC Software Development Kit, Cobalt, Holon web serverThe "Web Archiving Bucket" is an initiative launched by Aleph Archives, to preserve data and provide libraries and organizations with free-to-use web archiving tools and components.

The Web Archiving Bucket provides set of tools to help archivists and professionals in their daily work.

Web Archive Switzerland63Switzerland2008Heritrix, Wayback, Pywb, Webrecorder, Browsertrix Cloud62 crawl engineers, 3 persons for quality assurance (sharing less than 1 full time), 1 coordinator. The curators, who do the selection, are partner libraries all over Switzerland.
NTU Web Archiving System, NTUWAS64Taiwan2007Lucene3
Web Archive Taiwan65Taiwan2007
UK Web Archive66United Kingdom2004Heritrix, Web Curator Tool, Wayback, Solr for searching.
UK Government Web Archive (UKGWA)67United Kingdom2003MirrorWeb71The UK National Archives' UK Government Web Archive (UKGWA) is a fully open web archive. It includes over 5,000 central government websites and social media taken at regular intervals (1996 to present). The scope of UKGWA is outlined in the OSP27 document. Technical side of web archiving operation is supplied by MirrorWeb.
UK Parliament Web ArchiveUnited Kingdom2009MirrorWeb12The UK Parliament Web Archive captures, preserves, and make accessible UK Parliament information published on the web. The web archive includes websites and social media dating from 2009 to the present. The technical side of web archiving operation is supplied by MirrorWeb.
EU Exit Web Archive68United Kingdom2020MirrorWebThe UK National Archives' EU Exit Web Archive is a fully open web archive. It contains a wide selection of documents taken from EUR-Lex (the European legislation website), including Treaties, legislative documents, the Official Journal of the EU, case law and other supporting materials, and judgements of the European Court of Justice in English, French and German. The collection contains all content published up to the completion of the implementation period, at 11pm GMT on 31 December 2020.69

It provides a comprehensive and official UK reference point for EU law as it stood at the end of the implementation period.70

The technical side of web archiving operation is supplied by MirrorWeb.

MirrorWeb71Worldwide2012Heritrix, PYWB for public archives, custom replay for archives inside the MirrorWeb platform. Custom social media archiving tools.40MirrorWeb provides a website and social media archiving platform for financial services and the public sector entities. They run a range of public archives, two of which include; the UK Government Web Archive and the UK Parliament Web Archive.
Internet Archive (provides Archive-it service)72United States1996Heritrix, Wayback, NutchWAX Archived 2015-06-26 at the Wayback Machine and other tools developed by the Internet Archive150Internet Archive's Wayback Machine is the largest and oldest web archive in the world, dating back to 1996. Internet Archive also provide various web archiving services, including Archive-IT, Save Page Now, and domain level contract crawls. The Wayback Machine is the publicly available access service to Internet Archive and partners' collections.
Stanford University Libraries73United States2007Heritrix, HTTrack, Wayback, CDL Web Archiving Service, Internet Archive Archive-It25Stanford University Libraries has been engaged in web archiving projects since 2007 and started establishing a web archiving program in 2013. Collections that SUL is engaged in include Stanford University Archives, Bay Area Governments, Congressional Research Service (CRS) Reports, Freedom of Information Act (FOIA), Fugitive US Executive Agencies and many more. SUL is also involved in collaborative web archiving projects like the Archive of the California Government Domain, CA.gov with libraries at the University of California and the CA State Library, the End of Term Web Archive, and the Ivy Plus Libraries Confederation.
Columbia University Libraries74United States2009Archive-it service2>1The Columbia University Libraries (CUL) web resources collection program archives selected websites in thematic areas corresponding to existing CUL collection strengths, websites produced by affiliates of Columbia University, and websites from organizations or individuals whose papers or records are held in CUL's physical archives. Began web archiving in 2008.
Cornell University LibraryUnited States2011Archive-it service1>1
North Carolina State Government Web Site Archives75United States2005Archive-it service3
Latin American Web Archiving Project76United States2005Archive-it service
Web Archiving Project for the Pacific Islands77United States2009Archive-it service4
Library of Congress Web Archives78United States2000Heritrix, Wayback, and the DigiBoard, an in-house curatorial/permissions tool680The part time workers spend a few hours per month (on average) selecting content for the collections.
Harvard LibraryUnited States2006Archive-It>10Harvard Library web collections consist of 10 curatorial units' collections,79 with variable staff contributing to both technical and curatorial activities. Harvard is also involved in collaborative web collecting through the Ivy Plus Libraries Confederation.

Harvard Library initiated web archiving activities in 2006 using a self-developed Web Archive Collection Service (WAX) and transitioned to Archive-It in 2017.80  

Web Archiving Service from California Digital Library (WAS service)81United States2005Heritrix, Wayback, NutchWAX Archived 2015-06-26 at the Wayback Machine4>1The number of hours that curators devote to the service is very variable.
Bentley Historical Library (University of Michigan) Web Archives82United States2000HTTrack, Teleport Pro, WAS service (2010-)2
University of Texas at San Antonio Web Archives83United States2009Archive-It3The number of hours varies dependent upon how the crawls are scheduled.
qumram84Switzerland2010qumram Web Archiving / Web Information Governance Software SuiteCommercial web archiving / web information governance software suite. Provides both remote harvesting as well as transactional web archiving. Allows integrations with any possible web application (WCMS, Portal, Sharepoint, eShop, custom applications) as well as repository (database, file system, electronic archive or records management system, cloud-based solution). Allows capturing and reproduction of public information as well as specific user interactions.
SAPERION85Germany2011SAPERION ECM Web Content ArchiveCommercial enterprise content management suite specializes on regulatory compliance. The product provides both harvesting as well as transactional web archiving based on the integration of qumram's86 Chronos Web Archiving Software Suite. Web content is just another channel from which content is reaching SAPERION. Others may be scanner, fax, e-mail, mobile devices, office suites or any other system creating content like ERP systems.
Bibliotheca Alexandrina's Internet ArchiveEgypt2002Heritrix, OpenWayback, WARCrefs3Current crawling interests: Egypt beyond January 25, Arab League ccTLDs

Deduplication: using WARCrefs tool to deduplicate Web archive content in BA clusterOpenWayback: handling big data indexing by using ZipNumCluster to locate a certain URI in compressed CDX files

AUEB Web Archive87Greece2010Heritrix, Wayback and NutchWAX Archived 2015-06-26 at the Wayback Machine.11This project is part of the function of the University Library.88
World Bank Web Archives89United States2007HTTrack crawler, Oracle RDBMS, Google Search Appliance03
Russian National Digital Archive90Russia2010wpull, grab-site, HTTrack crawler, ad-hoc scripts developed for social media archiving. Experimenting: Heritrix, WaybackAbout 5000 government websites collected (May 2018) using wpull and provided as archives for downloading.
Archive TeamWorldwide2009wpull, ad hoc scripts1~100Volunteer group. They partially archived GeoCities, Yahoo! Videos, Google Video and others.
WikiTeamWorldwide2011ad hoc scripts00Volunteers group. Over 20,000 wikis preserved.91
University of North Texas CyberCemetery92United States199793Heritrix, Wayback; formerly HTTrack2The CyberCemetery is an archive of government websites that have ceased operation (usually websites of defunct government agencies and commissions that have issued a final report). This collection features a variety of topics indicative of the broad nature of government information. In particular, this collection features websites that cover topics supporting the university's curriculum and particular program strengths.
archive.today94Worldwide2012Apache Accumulo, HDFS, Chromium,95 ad hoc scripts11Saves external links from community web-sites (wikis, forums, blogs, ...). Can save snapshots of Web 2.0 pages.
Greek Web Archive PortalGreece2022Heritrix, Wayback01

The Greek Web Archive Portal is a service provided by the National Library of Greece (NLG). It allows users to navigate through the historical content of the Greek Web, a separate collection of web content that includes snapshots of all .gr domain sites from 1996 up to the present day, harvested by the Internet Archive. The service was developed in collaboration with the Internet Archive and provides search either by keyword or by URL, covering web pages as well as other types of files: images, audio files, videos and PDFs. .

ΕΣΑΕΙ Web Archive – National Archiving System of Greek WebGreece2017Heritrix, Open Wayback, Solr, Netarchive Suite04The ΕΣΑΕΙ project was the first attempt to harvest all .gr content and get to know its dimensions. It was implemented by the National Library of Greece in collaboration with the Athens University of Economics and Business and it included two bulk and three selective harvests, regarding the collections of "Local Government", "News" and "Education“. NLG Curator Tool was created for the playback of the collection.
Tamiment Library and Robert F. Wagner Labor Archives at New York University96United States2007WAS Service11Archives websites related to New York City and National Labor and Left Movements. Projects include: Alternative Mass Media / News; Anarchism; Animal Rights; Arts and Cultural Left; Civil Rights and Civil Liberties; Communism, Socialism, Trotskyism; Economic and Social Justice (Including Occupy Wall Street); Education and Student Movements; Electoral Politics and Parties / Political Action (U.S. Left); Environmentalism / Green Movement; Feminism and Women's Movements; Guantanamo Bay Detention Camp & War Crimes (U.S.); Housing; Internet/Cyberspace Democracy; Jewish American Progressive & Left Activity; Labor Unions and Organizations (U.S.); Left Academia and Theory, Intellectuals and Other Notables; LGBT Rights; Other Left Activism; Peace Movements; Prisoners Rights and Political Prisoners; Progressive Policy/ Educational Organizations.
Preservica97Worldwide2012Heritrix, Preservica core product, WaybackCloud-based heterogeneous archiving service that allows ingest from multiple sources (including web archiving ingest via Heritrix). Ability to migrate content within WARC files and render in Wayback. Ingest runs as workflow so very little effort needed to run it. Developed, supported and run by Preservica.
Central State Electronic Archives of UkraineUkraine2007HTTrack, Wget2Archives interested in keeping websites and creating the thematic collections of such websites, Is presently in storage the Archives collections of websites which includes the topic of presidential elections in Ukraine from 2010 until today, about the Chornobyl disaster, the local elections, of Euro 2012 in Ukraine, UNESCO World Heritage sites in Ukraine, the 200th anniversary of the birth of Taras Shevchenko.9899
York University Libraries, York University Libraries Wayback Machine100Canada2012Browsertrix, pywb10
New York Art Resources Consortium (NYARC)101102United States2012Archive-It service1~3Collaboration among Frick Art Reference Library, Brooklyn Museum Library & Archives, and Museum of Modern Art (MoMA) Library to archive specialist art historical web resources.
Netherlands Institute for Sound and Vision (Sound and Vision) web archive103Netherlands2011Heritrix, Elasticsearch for full-text index, Drupal for front-end~7Sound and Vision has been involved in web archiving projects since 2008, starting with the EU research project LiWA.104 After a couple of pilots,105 web archiving projects were scaled up in 2014.106
Rhizome (organization)United States1999ArtBase, Webrecorder, Oldweb.Today31Rhizome operates a digital preservation program, led by Dragan Espenschied, which is focused on the creation of free, open source software tools to decentralize web archiving and software preservation practices and ensure access to its collections of born-digital art. Oldweb.Today and Webrecorder are its tools focused on web archiving specifically.107
University of Texas at Austin Libraries, Human Rights Documentation InitiativeUnited States2009Archive-It service11The University of Texas Libraries' Human Rights Documentation Initiative (HRDI) captures the websites of human rights organizations in order to provide secure access to human rights documentation in the event that these often-fragile sites are taken down.108
Kentucky Department for Libraries and ArchivesUnited States2009Archive-it, Wayback>10This collection includes captures of websites for Kentucky state agencies in the Executive, Legislative, and Judicial Branches. Stand-alone websites for boards, councils, committees, quasi-governmental agencies, and agency programs are also archived. Captures for websites dating 2000–2008 are included in this collection via a transfer to our account from the Wayback Machine.109
University of California, San Francisco LibraryUnited States2007Archive-it, Wayback, CDL WAS Service>10This collection documents the web presences of UCSF, as well as the larger health science focuses of AIDS history; anesthesiology; biotechnology and biomedical research; tobacco control and regulation; neuroscience; and computational medicine.110 Staff is one full-time digital archivist with various responsibilities in addition to web-archives.
Ivy Plus Libraries Confederation111United States2013Archive-It, Conifer11The Ivy Plus Libraries Confederation's Web Resources Collection Program is a collaborative collection development effort to build curated, thematic collections of freely available, but at-risk, web content in order to support research at participating Libraries and beyond. Participating Libraries are: Brown, Chicago, Columbia, Cornell, Dartmouth, Duke, Harvard, Johns Hopkins, MIT, Penn, Princeton, Stanford, and Yale.112 Collections are accessible via Archive-It.
Malaysian Government Web Archive (MyGWA)Malaysia2017Wayback, WGET, WPULL>10National Archive of Malaysia started to archive websites of public sector in Malaysia since 2017.
HTTP ArchiveCrawls popular websites for Data analysis113
National Library of Medicine (U.S.)United States2009Archive-It, Conifer~8NLM web collecting is guided by the Collection Development Guidelines of the National Library of Medicine and other strategic collecting efforts. Collections include Global Health Events, the Opioid Epidemic, HIV/AIDS, Health and Medicine Blogs, and NLM's own web presence. 114
Smithsonian Libraries and Archives (U.S.)115United States2000Heritrix, Archive-It, Webrecorder, Conifer, Browsertrix, other5The Smithsonian Libraries and Archives collects websites and social media accounts that document the history of the Institution.

116

Ghost Archive117United States2021118Webrecorder1
Common Crawl119United States2008Apache Nutch, Apache Tika, pywb, in-house tools33

Archived data

NameArchived Contents (millions)Disk Space Occupied (TB)Archive FormatTLD/Broad CrawlsSelective Crawls (Yes/No)Comments
EU Web Archive120WARC.EUY.EU 250 websites in europa.eu domain and subdomains, crawled once per quarter + ad hoc crawls on request of website owners (selective crawls). Status Feb 2019.
Australia's Web Archive12111000600WARC.AUY.AU crawls (1996–2018): 10.15 billion files (530 TB). Selective crawls (1996–2019): 755 million files (44 TB). AGWA (2011–2018): 525 million files (58 TB).
Our digital island, a Tasmanian Web Archive1220.336HTTrackYPreserves online content related to Tasmania. ODI has operated since its inception under the assumption that web sites fall within the definition of 'Book' in the Tasmanian Library Act 1984.123 Thus, no permission to capture from publishers is required.
Webarchive Austria1244095164ARC.AT, .wien, .tirolYA copy of the data is stored in a high security data storage unit.
Deutsche Nationalbibliothek125WARC.DEYOnly one experimental TLD crawl.
DILIMAG (Digital Literature Magazines)1260.030.996ARCProject from 2007-03-01 until 2010-12-23. The project DILIMAG for collecting, describing and archiving of digital German literary magazines.
Bibliothèque et Archives nationales du Québec (BAnQ)12716731ARC/WARCYHarvesting began in 2009. Selective crawls of Quebec websites.
Government of Canada Web Archive (GCWA)128175070ARC/WARC.GC.CAYWeb archiving at Library and Archives Canada (LAC)129 began in 2005 and concentrated on collecting the federal government web presence and capturing the federal elections, the Olympics, and Canadian commemorative events. Thematic web collections of Canadiana research interest have been curated as an ongoing program activity since 2009.
Web Information Collection and Preservation - WICP (Chinese Web Archive)130.GOV.CNYHarvest of the web pages about the events that have great influence on the society, economy and so on, and the sites in 'gov.cn' domain.
Croatian Web Archive (Hrvatski arhiv weba - HAW)13123113Mirror, WARC.HRYSince 2004 selective harvesting over 5000 web resources. Since 2011 annual harvesting of national .hr domain as well as thematic harvesting. All archived content is publicly available via HAW website.
Webarchiv (National Library of the Czech Republic)1329412350ARC/WARC.CZYHarvesting began in 2001.
Netarkivet133/ The Danish web archive (Royal Danish Library)36000634ARC/WARC.DKY+36 billion objects:
  • html : 19077101525
  • image : 5859756918
  • other : 4080719309
  • text : 757030275
  • pdf : 97318057
  • audio : 8166680
  • video : 7085143
  • word : 47510
  • powerpoint : 5660
  • excel : 4721
  • Snapshot harvesting
  • Selective harvesting
  • Event harvesting
  • Special harvesting
Estonian Web Archive13487456ARC/WARC.EEYArchive consists selective, event and topical crawls since 2010. Whole national domain crawls are done yearly since 2015. Besides TLD .ee, Estonia related web content is harvested from other TLD-s like .eu, .org, .com etc.
Finnish Web Archive1354300300ARC/WARC / .json / .mp4.FI, .AXYAlso crawls content hosted on machines physically located in Finland, independently from their domain.
BnF - Web Legal Deposit13648 0001 800ARC/WARC.FR + all sites hosted in FranceYBnF is making copies137 of all sites in the .FR TLD, as well as all sites hosted and produced in France, ignoring both the Robots exclusion standard and the licenses of the documents.
BnL Web-Archive54341WARC.LUYThe BnL conducts 2 domain crawls per year, as well as event-based and selective crawls.
Ina (Institut National de l'Audiovisuel)1381058002359DAFFYAs of 2021-03-08

DAFF handles full content deduplication, so the size on disk takes into account compression and deduplication; the equivalent disk storage in compressed ARC format would be approximately 10 PB

E-diaspora (Télécom ParisTech, FMSH)139103013DAFFYDAFF handles full content deduplication, so the size on disk takes into account compression and deduplication; the equivalent disk storage in compressed ARC format would be approximately 51 TB
Internet Memory Foundation180WARCCan be done by partnersYFormerly European Archive.140 Collaborate with Internet Memory Research, which provides the ArchiveTheNet Service (ATN Service). Selective crawls (140 TB), Domain crawls (40 TB), expect to grow to 1PB in 2012. New datacenter and a new crawler in 2012.
Bibliotheksservice-Zentrum Baden-Württemberg1419WARCYWebsites of about 20 cities, municipalities, districts + their associated corporations, and state libraries are collected by BSZ in commission within various Archive-It collections. Public access. Data storage: San Francisco (Archive-It) as well as backup with Baden-Wuerttemberg storage infrastructure.
Web archive of the German Bundestag142YGerman Federal Parliament. Selective. At regular intervals or at certain events are snapshots (snapshots) of www.bundestag.de and other web presences of the German Bundestag made. These are available in the web archive to date available.
Iceland143
Palestine Web ArchiveARC/WARC.PSY.PS crawls (2006–2011): Pilots Crawls (500 GB). Selective crawls (1996, 2011)
Web Archiving Project (WARP), The National Diet Library, Japan144126701313WARC-Yas of March 202315 TB of selective crawls based on permission (2002–2010). Started the web archiving of official institution sites based on the legislation from April 2010.
National Library of Korea - OASIS (Online Archiving & Searching Internet Resource)14524YRequires consent before archiving. Targets 56,401 Websites. Web archiving is managed under Digital resource management systems. In 2011 web archiving system will be rebuilt.
Koninklijke Bibliotheek14640736WARCYSelective crawls (annually) of ca. 20.400 sites (December 2020)
New Zealand Web Archive1474300260ARC/WARC.NZY.NZ crawls (2008–2023): 4+ billion URLS (260TB). Selective crawls 33,500 websites (ca. 9TB). Legal deposit covers born digital material (including websites).
The National Library of Norway148
Arquivo.pt14915021 1181 455ARC/WARCFocused on .PT but also other domainsY.PT domain crawls and integration of external collections since 2007 and daily crawls of a selection of online publications of since 2010. Selective crawls related to national events such as elections or international content related to science such as websites about Research & Development projects funded by the European Union.
Web archive of Cacak1510.2550.013HTTrackYSelective crawls of 130 sites related to the city of Cacak. Collaboration with the Webarchiv team from the National Library of the Czech Republic.
Web Archive Singapore152WARC.SGYSelective crawls of Singapore-related sites and .SG domain archiving.
Digital Resources (University Library in Bratislava)1531 92189WARC.SK + other TLDs with Slovacical contentYHarvesting of the Slovak web started in 2015. Since then ULB has performed six (2016 - 2021) full-domain harvests (harvesting of the national .SK domain), multiple selective crawls and thematic crawls (topic centered and event devoted campaigns).
Slovenian Web Archive15430WARCSelective crawls since 2007, national domain crawls since 2014.
Archivo de la Web Española1552539117WARC.ESYDomain .ES crawls (2009–2013): 2.421 million files (111 TB) in collaboration with Internet Archive. Selective crawls (2014–2015): 119 mil files (6 TB). About 30 news media sites crawled every day. Not launched publicly yet.
PADICAT: The Web Archive of Catalonia15662032,5ARC/WARC.CATYIn accordance with the general trend, the archive model is a hybrid system consisting: Mass compilation of open-access digital resources published on the Internet (.cat); Systematic archiving of the web site output of Catalan organizations; Fostering of lines of research through themed integration of the digital resources pertaining to specific events in Catalan public life (elections, museums, etc.)
Basque Digital Heritage Archive157210.8ARCY
Sweden (Kulturarw3)1585700360Multipart MIME.se, Swedish .nu and geolocation for other tld'sYBulk crawls approximately twice a year.Selective crawls of about 140 newspapers every day.
Aleph Archives159>10000000>25Native HTML, WARC, WARC2, ARC and HTTrack to WARC migration toolsYEnterprise-grade automatic web archiving platform for online capture and preservation. Support eDiscovery with powerful and qualitative technology.

Aimed to corporations, institutions and agencies seeking to capture, preserve and leverage their Web content; dynamic websites, wikis, social media, forums, comments, disclaimers, and ads, for compliance (FDA, FINRA, FSA, SEC, FOIA), marketing or pure preservation purposes.

Web Archive Switzerland16080ARC, WARCYMainly selected .ch crawls
NTU Web Archiving System, NTUWAS16120014Y
Web Archive Taiwan162
The UK Web Archive16320.6WARCYSelective crawls with previous permission. Now also conducting wholesale UK domain-scale crawls under Non-Print Legal Deposit legislation, enacted April 2013. This content will only be available on premises controlled by one of the six legal deposit libraries. The UKWA is a spin-off from the UK Web Archiving Consortium that ended in 2007.
Hanzo Archives1647WARCYCommercial web archiving services and appliances, for government and corporations whose compliance or legal obligations / needs extend to their websites, intranet, and social media. Many 'dark' archives across Europe and USA.
UK Government Web Archive1651000 +150ARC

WARC post July 2017

Between 2003 - 2005 the Internet Archive undertook the technical side of web archiving on behalf of The UK Government Web Archive. Between 2005 - July 2017 the technical side of the web archiving service was contracted out to the Internet Memory Foundation. From July 2017 MirrorWeb took over the contract and moved the entire archive to the cloud. The UK Government Web Archive was part of the UK Web Archiving Consortium from 2004 - 2009.
Internet Archive (provides Archive-it service)16669000021000WorldwideYProvides the Archive-it service and leads the Archive-access project (Internet Archive ARC access tools). Collection is mirrored at Bibliotheca of Alexandrina in Egypt.
Columbia University Libraries Web Resources Collection Program16772350.4ARC/WARCYSelective crawls with permission or notification. Thematic collections in: Human rights; New York City built environment; New York City religions; Resistance. Also capture Columbia University web domain.
North Carolina State Government Web Site Archives16851.53.8WARCY
Latin American Web Archiving Project169Y
Web Archiving Project for the Pacific Islands1705.5ARC/WARCYIncludes sites of 18 countries.
Library of Congress Web Archives1717741420ARC/WARCYFormerly MINERVA. Selective crawls with notification and permission; primarily event and thematic collections.
Harvard University Library: the Web Archive Collection Service (WAX)172190.661ARCYSelective crawls with no previous authorization.
Web Archiving Service from California Digital Library (WAS service)17321625.2ARC/WARCCan be done by partnersYProvides Web Archiving Service (WAS) to partners worldwide. Was developed at the California Digital Library.
Bentley Historical Library (University of Michigan) Web Archives17434.52.6ARC/WARCYWAS service since 2010.
University of Texas at San Antonio Web Archives175261.135ARC/WARCYUniversity administration, faculty and student sites; as well as selective captures on San Antonio and South Texas subject areas, including San Antonio organizations; San Antonio Online Journals and Blogs; Tejano and Conjunto music; Gay, Lesbian, Bisexual, Transgender and Queer Related Web sites in Texas, San Antonio and the Rio Grande Valley; Immigration/Borderlands; Mexican Cooking Blogs; San Antonio Restaurants; Renewable Energy in Texas; Rio Grande Valley Organizations; and Rio Grande Watershed and Texas Water Issues .
AUEB Web Archive1763WARCaueb.grNThe amount of data crawled from the domain aueb.gr ranges between 10GB and 14.9GB . The data is stored on disk compressed and requires between 8.8GB and 9.7GB, resulting in space savings between 12% and 35%. In the case of new crawl, we can only store on disk the Web pages that change since the previous crawl. Consequently, we crawled 13.1GB from the domain aueb.gr, but we only stored on disk 1.6GB, resulting in space savings of 88%.
World Bank Web Archives1770.143HTTrackno, so farY450 sites with historical or research value have been harvested since 2007, each archived before being taken offline or before a major upgrade.
University of North Texas CyberCemetery1780.887WARC.govY
Bibliotheca Alexandrina's Internet Archive800001000ARC/WARCEgyptian news and politicsY
York University Digital Library1790.435WARCyorku.ca + faculty requestsY
Netherlands Institute for Sound and Vision (Sound and Vision) web archive180ARC/WARCYAmong other av-heritage, Sound and Vision is tasked with archiving programmes broadcast by Dutch Public Broadcasters. Therefore, an important part of the web archive consists of websites of public broadcaster related to these programmes. Furthermore, websites are archived that do not have a direct link to the collection, but that are of interest in a broader, media-historical way.181 Examples are websites of commercial broadcasters.
Kentucky Department for Libraries and Archives30.3007WARCY
University of California, San Francisco Library12.50.587ARC/WARCYWebsites requested by staff and faculty, and growing list attempting to capture all UCSF websites as comprehensively as possible.
Ivy Plus Libraries Confederation34716ARC/WARCYSelective crawls with notification. Thematic collections in politics and political protests, architecture, composers, design, gaming, geology, webcomics, documentary films, art, religion, sexuality, climate change, and more.182
Malaysian Government Web Archive (MyGWA)10WARC.GOV.MYYCrawls only Malaysian public sector websites only. View is by subject, i.e. administration, economy, security, and social.
National Library of Medicine (U.S.)1229.1WARCY
Smithsonian Libraries and Archives (U.S.)10WARCY
Common Crawl250 0008 000ARC/WARCworldwideY

Access methods

NameURL history (Yes/No)Meta-data (catalog/advanced) search (Yes/No)Full-text search (Yes/No)Memento Compliance (No/Native/Proxy)Comments
EU Web Archive183YYYFreely accessible to all via [2]
Australia's Web Archive184YYYNoSelected sites are publicly available through a directory structure. Domain harvests are not. The PANDORA Archive is indexed and searchable through the NLA's single search service Trove.185 The Australian Domain Harvests are full-text indexed but are not currently publicly available. The Australian Government Web Archive is searchable by URL and full-text indexes through its portal.
Our digital island, a Tasmanian Web Archive186YYNNoPresents thumbnails generated through Html To Image supplemented in HTTrack. Information is organized in directory: A-Z Subject listing, A-Z Title listing.
Webarchive Austria187YNYNoPossible to search online for versions either by URL or in (partial) fulltext. The websites are only accessible on special terminals at the Austrian National Library. Has bookmarking feature which allows to save versions online and recall them at the library webarchive terminals.
Deutsche Nationalbibliothek188YYYNoOnly accessible in the reading rooms of the German National Library. The metadata is included in the publicly accessible library catalogue.
DILIMAG (Digital Literature Magazines)189YYNNoMetadata are publicly available, for the archived versions provides free or restricted access depending on the right holders agreement. Full-text search is implemented in the new version (online since February 2015).
Bibliothèque et Archives nationales du Québec (BAnQ)190YNNNoProvides access according to partner policy.191
Government of Canada Web Archive (GCWA)192YYYProxyLibrary and Archives Canada193 makes its federal government web archives (materials under Crown Copyright) publicly accessible. Indices are available for discovering Canadian federal web resources alphabetically by authoring organization and by URL. Full text indexing is based on Lucene.
Web Information Collection and Preservation - WICP (Chinese Web Archive)194YNoArchive content is only available in intranet in National Library of China. Some collections are publicly available, with meta-data search and browsable by collection.
Croatian Web Archive (Hrvatski arhiv weba - HAW)195YYYProxyFull open access.
Webarchiv (National Library of the Czech Republic)196YNNNDue to copyright restrictions, only a limited number of archived websites for which agreements were signed with the publishers is available online. For other resources you can find out whether a given website was archived and the number of harvested versions. Unlimited access to all resources in Webarchiv is available from public terminals in the National Library.
Netarkivet.dk197YNYNoOnline access granted only to researchers through a Citrix login to free text search based on Solr and a proxy solution that accesses an archive through the Wayback. It has established a framework for running batch jobs with the possibility of data mining.
Estonian Web Archive198YYNNoPublic access to archived content is allowed only with a permission of the copyright owner. Full archive is accessible merely to the web archive personnel.
Finnish Web Archive199YN15% of material.NoURL search but on-site access to content. Full-text search is available to 15% of material.
BnF - Web Legal Deposit200YN15% of the collectionNoAccessible to authorized users through the reading rooms of the BnF Research Library located in Paris and Avignon and in partner libraries in regions and overseas territories. Wayback was customized and interface was translated to French. Full Text search only available on specific collections (i.e. news, Covid-19, the early French web). Builds special collection galleries based on a selection from the archive on a given topic.
Ina (Institut National de l'Audiovisuel)201YYYNoFull text indexing is based on Lucene. To accommodate results from frequent crawls (several crawls per hour for some pages) clustering is operated to handle similar versions of pages
E-diaspora (Télécom ParisTech, FMSH)202YNNNo1381 sites are currently crawled to build an archive on migrants usage of the web, social studies researchers have launched a long run project based on this archive Ina is handling crawls and storage
Internet memory FoundationYYYNoProvides access and search services according to partners policy.
Bibliotheksservice-Zentrum Baden-Württemberg203YYYNativeArchived websites accessible via Archive-It; integrated in the SWB union catalog. Full open access for major part of snapshots, some restricted by IP.
Web archive of the German Bundestag204YNNNoWeb archive itself are snapshots of www.bundestag.de and other websites. Navigation is possible by clicking on the years.205
Iceland206Native
Palestine Web ArchiveNYNNoStill in development and pilots
Web Archiving Project (WARP), The National Diet Library, Japan207YYYNativeAll the archived websites are available on the premises. 85% of them is also accessible on the Internet with the permission of webmasters.
National Library of Korea - OASIS (Online Archiving & Searching Internet Resource)208YYYNo100% of the archive is indexed. Enables search by topic classification (e.g. Religion, Science, Arts). Search available.209
Koninklijke Bibliotheek210YNNNoThe web archive is accessible on terminals in the KB reading rooms to full members ('onsite').
New Zealand Web Archive211YYYNativeDomain harvests: available to selected staff using Pywb and limited to URL searches. Selective harvests: each website is described in the catalogue (providing subject, author, title and URL searches) and can be viewed by the public via the Internet by clicking on the link to the archived copy. A small subset of the selective harvests are accessible using full-text search.
The National Library of Norway212NYNoSites are integrated in the Catalog. Left bar enables facet navigation with drill-down.213
Arquivo.pt - the Portuguese web-archive214YYYNativeA full-text and URL search service is freely available. Image search is also supported. Archived data can be mined through an Hadoop platform or publicly available Application Programming Interfaces to develop web applications.
Web archive of Cacak215NNNNoPlans to develop a search engine in the future. One bad characteristic of HTTrack is that it renames files during the archiving, so the original structure of the website is lost, as well file names.
Web Archive Singapore216YYYNoThe collection is viewable at the National Library, Singapore with selected content cleared by copyright owners available online.
Digital Resources (University Library in Bratislava)217YYNNoIt is possible to find out whether a website was archived and how many harvested versions exist. Due to the copyright restrictions only a limited number of archived websites is publicly available (based on agreements with publishers). The access to other archived resources is available locally in the University Library in Bratislava.
Slovenian Web Archive218YNYNoThe archive of selective crawls is publicly accessible. Use is possible by browsing and full-text search. National domain crawls are not accessible yet but will be in the future.
Archivo de la Web Española219Y (Future)Y (Future)Y (Future)NoPlan to provide access on-site in the short-medium term.
PADICAT: The Web Archive of Catalonia220YYYNoFull open access.
Basque Digital Heritage Archive221YYYNo
Sweden (Kulturarw3)222YNNNoPublic access through dedicated machines in the library building.
Aleph Archives223YYYNoEnterprise-grade automatic web archiving platform for online capture and preservation. Support eDiscovery with powerful and qualitative technology.

Aimed to corporations, institutions and agencies seeking to capture, preserve and leverage their Web content; dynamic websites, wikis, social media, forums, comments, disclaimers, and ads, for compliance (FDA, FINRA, FSA, SEC, FOIA), marketing or pure preservation purposes.

Web Archive Switzerland224YYYNoWeb Archive Switzerland is the collection of the Swiss National Library containing websites with a bearing on Switzerland. Web Archive Switzerland has been integrated in e-Helvetica,225 the access system of the Swiss National Library, giving access to the entire digital collection. So you can do full text searching of a part of the Web Archive. But the archived versions of websites can only be viewed in the reading rooms of the Swiss National Library and of our partner libraries who help us build the collection of Swiss websites. But you can view the metadata of the archived versions from anywhere.
NTU Web Archiving System, NTUWAS226YYYNoPresents page thumbnails, archived pages mapped to geographical locations.
Web Archive Taiwan227YYYNo
PageFreezer228YYYNoEnterprise Class On Demand service to archive and replay websites, blogs, Ajax, Flash, video, audio & social media for litigation protection, eDiscovery and regulatory compliance with FDA, FINRA, FSA, SEC, SOX, Federal Rules of Evidence and records management laws. Used by government agencies and public listed corporations in Pharmaceutical, Food, Finance, Healthcare and Retail industry.
The UK Web Archive229YYNNative
Hanzo Archives230YYYNoCommercial web archiving services and appliances. Access includes full-text search, annotations, redaction, URL/History, archive policy and temporal browsing, and configurable metadata schema for advanced e-discovery applications. Used in government and corporations whose compliance or legal obligations / needs extend to their websites, intranet, and social media. Many 'dark' archives across Europe and USA.
UK Government Web Archive (UKGWA)231YYYNativeFull text search is operational on the UK Government Web Archive (UKGWA).232 Users can browse the collection using a full A-Z list of all sites233
EU Exit Web ArchiveYYYNativeFull text search is operational on the EU Exit Web Archive
Internet Archive (provides Archive-it service)234YYYNativeURL history is available for all archived data. Meta-data and full-text search only for selected crawls. Until 2002 had a mining platform for research composed by Alexa Shell Perl Tools

av_tools and p2 platform for parallel processing.235 It was replaced by a simpler access and direct method that enables automatic access to files but no platform for processing.236

Columbia University Libraries Web Resources Collection Program237YYYNoAccessible through Archive-it service.238
North Carolina State Government Web Site Archives239YYYNoAccessible through Archive-it service.240
Latin American Web Archiving Project241YYYNoContent can be accessed via full-text search, or by browsing by country or by specialized sample collection.
Web Archiving Project for the Pacific Islands242YYYNoSupported by Archive-it service.
Library of Congress Web Archives243YYNProxyAccess provided via LCWA. Records in MODS (Metadata Object Descriptive Schema) format.
Harvard University Library: the Web Archive Collection Service (WAX)244YYYNo
Web Archiving Service from California Digital Library (WAS service)245YYYNoAccess for private study, scholarship and research. Most archives built with WAS have not yet been published because it is up to the partners to decide if they want to provide access. There are 16 partners using the service and they have created over 80 web archives, only 30 are publicly accessible. NutchWAX performance did not permit full archive search. Upcoming transition to SOLR will permit both full archive and collection-specific full text search.
Bentley Historical Library (University of Michigan) Web Archives246YYYNoPowered by the WAS from the California Digital Library.247 Access is public but usage is restricted for private study, scholarship and research.
University of Texas at San Antonio Web Archives248YYYNativeAccessible through Archive-it service249 and the Texas Archival Repositories Online database250
AUEB Web Archive251YYYNo
World Bank Web Archives252YYYNoURL history provided via open access to collection via standard web browser. Full text search is only available within each individual site. Search on metadata is available via advanced search within Web Archives collection.
University of North Texas CyberCemetery253NYYNo
Tamiment Library and Robert F. Wagner Labor Archives at New York University254YYYNoAccess is provided through the WAS service255 as well as through finding aids that are searchable through NYU's finding aids portal.256
York University Digital Library257YYY
Netherlands Institute for Sound and Vision (Sound and Vision) web archive258YYNSelected sites for which agreements have been made are publicly available.259 Full text indexing is done with Elasticsearch, the front-end is built in Drupal.
Kentucky Department for Libraries and ArchivesYYYNoFull open access
University of California, San Francisco LibraryYYYNative (through IA)Both capture and access for archived content are provided by the Archive it service, so all capabilities are same as for Archive-It
Ivy Plus LibrariesYYYNoAccessible through Archive-It service.
Malaysian Government Web Archive (MyGWA)YYYNoOpen Access
National Library of Medicine (U.S.)YYYAccess is provided through Archive-It
Smithsonian Libraries and Archives (U.S.)YYYAccess is provided through Archive-It

References

  1. Daniel Coelho Gomes; João Miranda; Miguel Costa (25–29 September 2011). "A survey on web archiving initiatives". International Conference on Theory and Practice of Digital Libraries 2011. Springer. Retrieved 23 October 2012. /w/index.php?title=Daniel_Coelho_Gomes&action=edit&redlink=1

  2. "Arkiwera - Hem - English". 2023-07-09. Retrieved 2024-06-09. https://arkiwera.se/wp/en/

  3. "EU Web Archive - EU Web Archive - Publications Office of the EU". EU Web Archive. Retrieved 2024-06-09. https://op.europa.eu/en/web/euwebarchive

  4. "Alabama Department of Archives and History Digital Collections". digital.archives.alabama.gov. Retrieved 2018-10-28. http://digital.archives.alabama.gov/

  5. "Pandora — Australia's Web Archive". nla.gov.au. May 1999. Retrieved 2013-11-17. http://pandora.nla.gov.au/

  6. "PROMISE project". Retrieved 2020-01-31. https://www.kbr.be/en/projects/promise-project/

  7. "Royal Library of Belgium". www.kbr.be. Retrieved 2020-01-31. https://www.kbr.be/

  8. "State Archives of Belgium". www.arch.be. Retrieved 2020-01-31. http://www.arch.be/index.php?l=en

  9. "Research Group for Media, Innovation and Communication Technologies". www.ugent.be. Retrieved 2020-01-31. https://www.ugent.be/ps/communicatiewetenschappen/mict/en

  10. "Ghent Centre for Digital Humanities". www.ghentcdh.ugent.be. Retrieved 2020-01-31. http://www.ghentcdh.ugent.be/

  11. "Research Centre in Information, Law and Society". www.crids.eu/. Retrieved 2020-01-31. http://www.crids.eu/

  12. "Haute-École Bruxelles-Brabant". he2b.be/. Retrieved 2020-01-31. http://he2b.be/

  13. "Saving the web: the promise of a Belgian web archive". KBR. Retrieved 2020-01-31. https://www.kbr.be/en/agenda/saving-the-web-the-promise-of-a-belgian-web-archive/

  14. "KBR web archive". Retrieved 2020-01-31. https://www.kbr.be/

  15. "KBR". www.kbr.be. Retrieved 2020-01-31. https://www.kbr.be/

  16. "PROMISE project". Retrieved 2020-01-31. https://www.kbr.be/en/projects/promise-project/

  17. "Montana Code Annotated 2019". https://www.leg.mt.gov/bills/mca/title_0220/chapter_0010/part_0020/section_0120/0220-0010-0020-0120.html

  18. "Stillio". Stillio.com. 2019-05-16. Retrieved 2019-05-16. https://www.stillio.com/

  19. "PageFreezer". pagefreezer.com. 2011-01-20. Retrieved 2013-11-17. http://www.pagefreezer.com/

  20. "OoCities - Geocities Archive / Geocities Mirror". www.oocities.org. Retrieved 2019-12-25. https://www.oocities.org/

  21. "Archive Wikiwix". {{cite web}}: Check |archive-url= value (help)CS1 maint: url-status (link) https://archive.wikiwix.com

  22. https://wikiwix.com. {{cite web}}: Missing or empty |title= (help) https://wikiwix.com

  23. "Webarchive Austria". Onb.ac.at. Retrieved 2020-12-11. https://webarchiv.onb.ac.at

  24. "Deutsche Nationalbibliothek". dnb.de. Archived from the original on 2018-05-08. Retrieved 2015-09-18. https://web.archive.org/web/20180508054405/http://www.dnb.de/EN/Netzpublikationen/Webarchiv/webarchiv_node.html

  25. "DILIMAG (Digital Literature Magazines". dilimag.literature.at. Retrieved 2013-11-17.[permanent dead link‍] http://dilimag.literature.at/

  26. "Bibliothèque et Archives nationales du Québec (BAnQ)". banq.qc.ca. Retrieved 2013-11-17. http://www.banq.qc.ca/collections/collections_patrimoniales/archives_web/index.html?language_id=1

  27. "Library and Archives Canada". Library and Archives Canada. 28 May 2020. Retrieved 2023-06-10. https://library-archives.canada.ca/eng

  28. "Library and Archives of Canada Act, S.C. 2004, c.11". Justice Canada. 2004-04-22. Retrieved 2014-12-16. https://laws-lois.justice.gc.ca/eng/acts/L-7.7/FullText.html

  29. "Library and Archives Canada". Library and Archives Canada. 28 May 2020. Retrieved 2023-06-10. https://library-archives.canada.ca/eng

  30. "Legal deposit at Library and Archives Canada". Library and Archives Canada. 15 June 2022. Retrieved 2023-06-10. https://library-archives.canada.ca/eng/services/publishers/legal-deposit/pages/deposit-digital-publications.aspx

  31. "Web Information Collection and Preservation - WICP (Chinese Web Archive)" https://web.archive.org/web/20110810171906/http://210.82.118.162:9090/webarchive/

  32. "Croatian Web Archive (Hrvatski arhiv weba - HAW)". Haw.nsk.hr. 2004-10-01. Archived from the original on 2013-07-13. Retrieved 2013-11-17. https://web.archive.org/web/20130713080922/http://haw.nsk.hr/

  33. "Webarchiv (National Library of the Czech Republic)". webarchiv.cz. Retrieved 2015-10-30. http://webarchiv.cz/en

  34. "Netarkivet". www.kb.dk (in Danish). Retrieved 2024-06-09. https://www.kb.dk/find-materiale/samlinger/netarkivet

  35. "Estonian Web Archive". National Library of Estonia. 2014-01-09. Retrieved 2014-01-09. http://www.nlib.ee/index.php?id=21581

  36. "Finnish Web Archive". kansalliskirjasto.fi. Retrieved 2013-11-17. http://verkkoarkisto.kansalliskirjasto.fi

  37. "Bibliothèque nationale de France - Web Legal Deposit". Bnf.fr. 2010-08-17. Retrieved 2013-11-17. http://www.bnf.fr/en/professionals/digital_legal_deposit.html

  38. "Ina (Institut National de l'Audiovisuel)" (in French). Ina.fr. Retrieved 2013-11-17. http://www.Ina.fr

  39. "Bibliotheksservice-Zentrum Baden-Württemberg". Bsz-bw.de. Retrieved 2013-11-17. http://www.bsz-bw.de/index.html

  40. "Web archive of the German Bundestag". Webarchiv.bundestag.de. Retrieved 2013-11-17. http://webarchiv.bundestag.de/cgi/kurz.php

  41. "Iceland - VEFSAFN". Vefsafn.is. Retrieved 2013-11-17. http://vefsafn.is/index.php?page=english

  42. "Digital Collections". National Library of Ireland Annual Report. 2011.

  43. "Web Archiving Project (WARP), The National Diet Library, Japan". da.ndl.go.jp. Retrieved 2013-11-17. https://warp.da.ndl.go.jp

  44. "National Library of Korea - OASIS (Online Archiving & Searching Internet Resource)". Oasis.go.kr. 2013-08-01. Archived from the original on 2013-10-31. Retrieved 2013-11-17. https://web.archive.org/web/20131031102219/http://www.oasis.go.kr/intro_new/intro_overview_e.jsp

  45. "WebART (Web Archive Retrieval Tools)". http://www.webarchiving.nl/

  46. "Latvijas Nacionālā bibliotēka - Rasmošana". http://webarhivs.lndb.lv/

  47. "New Zealand Web Archive". Natlib.govt.nz. Retrieved 2021-02-26. https://natlib.govt.nz/collections/a-z/new-zealand-web-archive

  48. "Nettarkivet". Nasjonalbiblioteket (in Norwegian Bokmål). Retrieved 2019-12-25. https://www.nb.no/samlingen/nettarkivet/

  49. "The National Library of Norway". IIPC. Retrieved 2019-12-25. http://netpreserve.org/about-us/members/nasjonalbiblioteket-national-library-norway/

  50. "Arquivo.pt - search pages from the past!". arquivo.pt. Retrieved 2024-06-09. https://arquivo.pt/?l=en

  51. "Arquivo.pt - the Portuguese web-archive: search pages from the past". Foundation for National Scientific Computing (FCCN). 13 August 2013. Retrieved 13 August 2013. https://arquivo.pt/?l=en

  52. Web archive of Cacak[permanent dead link‍]. digital.cacak.dis.rs http://digital.cacak-dis.rs/english/web-archive-of-cacak/

  53. "Web Archive Singapore". eresources.nlb.gov.sg/webarchives. Retrieved 2023-02-03. https://eresources.nlb.gov.sg/webarchives

  54. Digital Resources (Digital Resources Archive of the University Library in Bratislava)[1] https://www.webdepozit.sk/?lang=en

  55. "Slovenian Web Archive". National and University Library of Slovenia. Retrieved 2018-02-02. http://arhiv.nuk.uni-lj.si/

  56. Biblioteca Nacional de España. "Archivo de la web española". Archived from the original on 2014-02-23. Retrieved 2014-02-20. https://web.archive.org/web/20140223093113/http://www.bne.es/es/LaBNE/ArchivoWeb/

  57. National Library of Catalonia (16 November 2012). "PADICAT: The Web Archive of Catalonia". National Library of Catalonia. Retrieved 16 November 2012. http://www.padicat.cat/en/

  58. Kai Oswald Seidler. "Basque Digital Heritage Archive (ONDARENET)". euskadi.net. Archived from the original on 2012-12-20. Retrieved 2013-11-17. https://archive.today/20121220171016/http://www.ondarenet.kultura.ejgv.euskadi.net/

  59. "Kulturarw3 - Kungliga biblioteket" (in Swedish). Kb.se. 2020-01-01. Retrieved 2021-05-04. https://www.kb.se/hitta-och-bestall/hitta-i-samlingarna/kulturarw3.html

  60. AAW Designs. "Aleph Archives". aleph-archives.com. Retrieved 2013-11-17. http://aleph-archives.com/

  61. "Expatriate Archive Centre Blog Archive". xpatarchive.com. Retrieved 2020-02-03. https://xpatarchive.com/initiatives/eac-blog-archive/

  62. "Web Archiving Bucket". webarchivingbucket.com. Retrieved 2013-11-17. http://webarchivingbucket.com/

  63. "Web Archive Switzerland". E-helvetica.nb.admin.ch. Retrieved 2013-11-17. https://www.e-helvetica.nb.admin.ch/pages/user/webarchive/webArchiveSearch.jsf?BITfw2Ctx=xUSHZSYm2d19oyAFE&lang=en

  64. "NTU Web Archiving System, NTUWAS". ntu.edu.tw. Retrieved 2013-11-17. http://webarchive.lib.ntu.edu.tw/eng/default.asp

  65. "Web Archive Taiwan". ncl.edu.tw. Retrieved 2013-11-17. http://webarchive.ncl.edu.tw/nclwa98Front/

  66. "UK Web Archive". 2005-07-07. Retrieved 2013-11-17. https://www.webarchive.org.uk/ukwa/

  67. "UK Government Web Archive (UKGWA)". nationalarchives.gov.uk. Retrieved 2015-10-30. http://www.nationalarchives.gov.uk/webarchive/

  68. "EU Exit Web Archive - The National Archives". webarchive.nationalarchives.gov.uk. Retrieved 2024-06-09. https://webarchive.nationalarchives.gov.uk/eu-exit/

  69. "EU Exit Web Archive - The National Archives". webarchive.nationalarchives.gov.uk. Retrieved 20 February 2021. Text was copied from this source, which is available under an Open Government Licence v3.0. © Crown copyright. https://webarchive.nationalarchives.gov.uk/eu-exit/

  70. "EU Exit Web Archive - The National Archives". webarchive.nationalarchives.gov.uk. Retrieved 20 February 2021. Text was copied from this source, which is available under an Open Government Licence v3.0. © Crown copyright. https://webarchive.nationalarchives.gov.uk/eu-exit/

  71. "MirrorWeb: Your unified compliance platform". www.mirrorweb.com. Retrieved 2024-06-09. https://www.mirrorweb.com/

  72. "Internet Archive (provides Archive-it service)". 2001-03-10. Retrieved 2013-11-17. https://archive.org/

  73. "Web Archiving | Stanford University Libraries". Retrieved 2014-03-26. http://library.stanford.edu/projects/web-archiving

  74. "Columbia University Libraries Web Resources Collection Program". columbia.edu. Retrieved 2019-10-01. https://library.columbia.edu/collections/web-archives.html

  75. "North Carolina State Government Web Site Archives". ncdcr.gov. Retrieved 2013-11-17. http://webarchives.ncdcr.gov/

  76. "Latin American Web Archiving Project". utexas.edu. Retrieved 2013-11-17. http://lanic.utexas.edu/project/archives

  77. Dawrs, Stu. "Research Guides: Web Archiving Project of the Pacific Islands: Introduction". guides.library.manoa.hawaii.edu. Retrieved 2019-12-25. https://guides.library.manoa.hawaii.edu/pacificwebarchive

  78. "Library of Congress Web Archives". Loc.gov. Retrieved 2013-11-17. https://www.loc.gov/webarchiving/

  79. "Web Archives Collections". preservation.library.harvard.edu. Retrieved 2021-02-22. https://preservation.library.harvard.edu/web-archives-collections

  80. "Web Archiving". preservation.library.harvard.edu. Retrieved 2021-02-22. https://preservation.library.harvard.edu/web-archiving

  81. "Web Archiving Service from California Digital Library (WAS service)". cdlib.org. 2013-10-16. Retrieved 2013-11-17. http://webarchives.cdlib.org/

  82. "Bentley Historical Library (University of Michigan) Web Archives". umich.edu. Archived from the original on 2013-10-03. Retrieved 2013-11-17. https://web.archive.org/web/20131003045300/http://bentley.umich.edu/dchome/webarchives/index.php

  83. "University of Texas at San Antonio Web Archives". Archive-it.org. Retrieved 2013-11-17. http://www.archive-it.org/public/partner.html?id=318

  84. "Qumram". Qumram.com. 2011-06-30. Retrieved 2019-03-06. http://www.qumram.com

  85. SAPERION AG, Berlin. "Saperion ECM Web Content Archive". saperion.com. Retrieved 2013-11-17. http://www.saperion.com

  86. "Qumram". Qumram.com. 2011-06-30. Retrieved 2019-03-06. http://www.qumram.com

  87. "AUEB Web Archive". aueb.gr. 2011-10-21. Retrieved 2013-11-17. http://archive.aueb.gr/

  88. "Archiving the Web sites of Athens University of Economics and Business" (PDF). aueb.gr. Retrieved 2013-11-17. http://www.db-net.aueb.gr/index.php/corporate/content/download/505/2394/version/1/file/ArchivingAUEB_CameraReady_V6.pdf

  89. "World Bank Web Archives0". worldbank.org. 2012-12-20. Retrieved 2013-11-17. http://go.worldbank.org/67KZ5AH4Y0

  90. "Национальный цифровой архив России". http://ruarxive.org

  91. "Websites/WikiTeam". Retrieved 2016-02-05. https://wikiapiary.com/wiki/Websites/WikiTeam

  92. Government Documents Department, University of North Texas Libraries, State of Texas (2009-02-02). "University of North Texas CyberCemetery". unt.edu. Retrieved 2013-11-17.{{cite web}}: CS1 maint: multiple names: authors list (link) http://govinfo.library.unt.edu/

  93. "CyberCemetery". UNT Digital Library. Retrieved 2019-12-25. "ACIR Research Collection". 1998-02-10. Archived from the original on 1998-02-10. Retrieved 2019-12-25. Site established: July 1997 Proceedings of the ... Annual Federal Depository Library Conference. U.S. Government Printing Office. 1999. p. 45. https://digital.library.unt.edu/explore/collections/GDCC/

  94. "[ウェブサービスレビュー]ZIPや画像のダウンロードにも対応した魚拓サービス「Archive today」 - CNET Japan". CNET Japan. June 2014. Retrieved 2014-09-02. http://japan.cnet.com/news/society/35048691/?tag=as.rss

  95. "Archive.today blog". https://blog.archive.today/post/658637611402919936/on-ktcp6-you-are-redirected-to-an-adblock-as-of

  96. "NYU Libraries | Tamiment Library & Robert F. Wagner Labor Archives". Nyu.edu. Retrieved 2013-08-19. http://www.nyu.edu/library/bobst/research/tam/

  97. "How Preservica Works - Preservica". preservica.com. May 12, 2014. Archived from the original on May 12, 2014. Retrieved May 12, 2014. http://preservica.com/preservica-works/

  98. Central State Electronic Archives of Ukraine (CSEA Ukraine) http://tsdea.archives.gov.ua/en

  99. "Information Booklet CSEA Ukraine" (PDF). Archived from the original (PDF) on 2014-04-13. Retrieved 2014-04-10. https://web.archive.org/web/20140413125920/http://tsdea.archives.gov.ua/img/bookl/bookl_csea_en.pdf

  100. York University Libraries, Toronto, ON (2012-11-01). "York University Libraries Wayback Machine". library.yorku.ca. Retrieved 2023-11-20.{{cite web}}: CS1 maint: multiple names: authors list (link) https://wayback.library.yorku.ca

  101. "Web Archiving - New York Art Resources Consortium". nyarc.org. Retrieved 2014-12-17. http://www.nyarc.org/content/web-archiving/

  102. Karl-Rainer Blumenthal (October 27, 2014). "All together now: NYARC and the National Agenda for Digital Stewardship". Archived from the original on December 17, 2014. Retrieved December 17, 2014. http://ndsr.nycdigital.org/all-together-now-nyarc-and-the-national-agenda-for-digital-stewardship/

  103. "Sound and Vision web archive". beeldengeluid.nl/en. Retrieved 2015-01-21. http://beeldengeluidwebarchief.nl/

  104. "Living Web Archives". Retrieved 2015-01-21. http://liwa-project.eu/

  105. "WEB ARCHIVING AT SOUND AND VISION: OUTCOMES OF OUR NTR PILOT". 2014-08-18. Archived from the original on 2015-01-21. Retrieved 2015-01-21. https://web.archive.org/web/20150121165857/http://www.beeldengeluid.nl/en/blogs/research-amp-development-en/201408/web-archiving-sound-and-vision-outcomes-our-ntr-pilot

  106. "WSAVE THE DATE: STUDIEDAG WEBARCHIVERING". 2014-08-19. Archived from the original on 2015-01-21. Retrieved 2015-01-21. https://web.archive.org/web/20150121165244/http://www.beeldengeluid.nl/blogs/collecties/201408/save-date-studiedag-webarchivering

  107. "A Net Art Pioneer Evolves With the Digital Age: Rhizome Turns 20 | ARTnews". www.artnews.com. September 2016. Retrieved 2016-11-13. http://www.artnews.com/2016/09/01/a-net-art-pioneer-evolves-with-the-digital-age-rhizome-turns-20/

  108. "University of Texas Libraries Human Rights Documentation Initiative homepage | University of Texas Libraries". lib.utexas.edu. Retrieved 2017-04-06. https://www.lib.utexas.edu/hrdi

  109. "Kentucky Department for Libraries and Archives | Archive-It". https://www.archive-it.org/organizations/386

  110. "Archive-It - University of California, San Francisco (UCSF)". archive-it.org. Retrieved 2017-07-12. https://archive-it.org/organizations/986

  111. "Ivy Plus Libraries". https://ivpluslibraries.org

  112. "Ivy Plus Libraries Web Resources Collection Program". https://library.columbia.edu/collections/web-archives/Ivy_Plus_Libraries.html

  113. "HTTP Archive". httparchive.org. Retrieved 2020-12-28. https://httparchive.org/

  114. "NLM Web Collecting and Archiving". www.nlm.nih.gov. Retrieved 2021-02-19. https://www.nlm.nih.gov/webcollecting/index.html

  115. "Smithsonian Libraries and Archives". Retrieved 2021-08-19. https://librariesarchives.si.edu/

  116. "Web and Social Media Archiving". Retrieved 2021-08-19. https://siarchives.si.edu/what-we-do/digital-curation/web-and-social-media-archiving/

  117. "About Ghostarchive, a web archive". Ghost Archive. Retrieved September 10, 2022. https://ghostarchive.org/about.html

  118. "Whois lookup for ghostarchive.org". who.is. Retrieved September 10, 2022. Registered On 2021-08-13 https://who.is/whois/ghostarchive.org

  119. "Common Crawl". Common Crawl. Retrieved 2023-08-27. https://commoncrawl.org/

  120. "EU Web Archive - EU Web Archive - Publications Office of the EU". EU Web Archive. Retrieved 2024-06-09. https://op.europa.eu/en/web/euwebarchive

  121. "Pandora — Australia's Web Archive". nla.gov.au. May 1999. Retrieved 2013-11-17. http://pandora.nla.gov.au/

  122. "Our digital island, a Tasmanian Web Archive". tas.gov.au. Archived from the original on 2013-03-18. Retrieved 2014-05-29. https://web.archive.org/web/20130318092233/http://www.linc.tas.gov.au/tasmaniasheritage/search/tasmanian-websites

  123. "LINC Tasmania Online - Home page". Statelibrary.tas.gov.au. 2012-06-26. Retrieved 2012-07-17. http://www.statelibrary.tas.gov.au/collections/taho/legaldeposit

  124. "Webarchive Austria". Onb.ac.at. Retrieved 2020-12-11. https://webarchiv.onb.ac.at

  125. "Deutsche Nationalbibliothek". dnb.de. Archived from the original on 2018-05-08. Retrieved 2015-09-18. https://web.archive.org/web/20180508054405/http://www.dnb.de/EN/Netzpublikationen/Webarchiv/webarchiv_node.html

  126. "DILIMAG (Digital Literature Magazines". dilimag.literature.at. Retrieved 2013-11-17.[permanent dead link‍] http://dilimag.literature.at/

  127. "Bibliothèque et Archives nationales du Québec (BAnQ)". banq.qc.ca. Retrieved 2013-11-17. http://www.banq.qc.ca/collections/collections_patrimoniales/archives_web/index.html?language_id=1

  128. "Library and Archives Canada". Library and Archives Canada. 28 May 2020. Retrieved 2023-06-10. https://library-archives.canada.ca/eng

  129. "Library and Archives Canada". Library and Archives Canada. 28 May 2020. Retrieved 2023-06-10. https://library-archives.canada.ca/eng

  130. "Web Information Collection and Preservation - WICP (Chinese Web Archive)" https://web.archive.org/web/20110810171906/http://210.82.118.162:9090/webarchive/

  131. "Croatian Web Archive (Hrvatski arhiv weba - HAW)". Haw.nsk.hr. 2004-10-01. Archived from the original on 2013-07-13. Retrieved 2013-11-17. https://web.archive.org/web/20130713080922/http://haw.nsk.hr/

  132. "Webarchiv (National Library of the Czech Republic)". webarchiv.cz. Retrieved 2015-10-30. http://webarchiv.cz/en

  133. "Netarkivet". www.kb.dk (in Danish). Retrieved 2024-06-09. https://www.kb.dk/find-materiale/samlinger/netarkivet

  134. "Estonian Web Archive". National Library of Estonia. 2014-01-09. Retrieved 2014-01-09. http://www.nlib.ee/index.php?id=21581

  135. "Finnish Web Archive". kansalliskirjasto.fi. Retrieved 2013-11-17. http://verkkoarkisto.kansalliskirjasto.fi

  136. "Bibliothèque nationale de France - Web Legal Deposit". Bnf.fr. 2010-08-17. Retrieved 2013-11-17. http://www.bnf.fr/en/professionals/digital_legal_deposit.html

  137. "Bibliothèque nationale de France - Web Legal Deposit". Bnf.fr. 2010-08-17. Retrieved 2013-11-17. http://www.bnf.fr/en/professionals/digital_legal_deposit.html

  138. "Ina (Institut National de l'Audiovisuel)" (in French). Ina.fr. Retrieved 2013-11-17. http://www.Ina.fr

  139. "E-diasporas (Télécom ParisTech, FMSH)". ediasporas.ticmigrations.fr. Archived from the original on 2013-09-27. Retrieved 2013-11-17. https://web.archive.org/web/20130927195054/http://ediasporas.ticmigrations.fr/

  140. "European Archive". Archived from the original on 2007-12-08. Retrieved 2013-11-17. https://web.archive.org/web/20071208130747/http://www.europarchive.org/

  141. "Bibliotheksservice-Zentrum Baden-Württemberg". Bsz-bw.de. Retrieved 2013-11-17. http://www.bsz-bw.de/index.html

  142. "Web archive of the German Bundestag". Webarchiv.bundestag.de. Retrieved 2013-11-17. http://webarchiv.bundestag.de/cgi/kurz.php

  143. "Iceland - VEFSAFN". Vefsafn.is. Retrieved 2013-11-17. http://vefsafn.is/index.php?page=english

  144. "Web Archiving Project (WARP), The National Diet Library, Japan". da.ndl.go.jp. Retrieved 2013-11-17. https://warp.da.ndl.go.jp

  145. "National Library of Korea - OASIS (Online Archiving & Searching Internet Resource)". Oasis.go.kr. 2013-08-01. Archived from the original on 2013-10-31. Retrieved 2013-11-17. https://web.archive.org/web/20131031102219/http://www.oasis.go.kr/intro_new/intro_overview_e.jsp

  146. "WebART (Web Archive Retrieval Tools)". http://www.webarchiving.nl/

  147. "New Zealand Web Archive". Natlib.govt.nz. Retrieved 2021-02-26. https://natlib.govt.nz/collections/a-z/new-zealand-web-archive

  148. "Nettarkivet". Nasjonalbiblioteket (in Norwegian Bokmål). Retrieved 2019-12-25. https://www.nb.no/samlingen/nettarkivet/

  149. "Arquivo.pt - search pages from the past!". arquivo.pt. Retrieved 2024-06-09. https://arquivo.pt/?l=en

  150. Foundation for Science and Technology (FCT) (2 February 2023). "Arquivo.pt in numbers". Foundation for Science and Technology (FCT). Retrieved 2 February 2023. http://sobre.arquivo.pt/en/about/press/the-portuguese-web-archive-in-numbers/

  151. Web archive of Cacak[permanent dead link‍]. digital.cacak.dis.rs http://digital.cacak-dis.rs/english/web-archive-of-cacak/

  152. "Web Archive Singapore". eresources.nlb.gov.sg/webarchives. Retrieved 2023-02-03. https://eresources.nlb.gov.sg/webarchives

  153. "Digital Resources (Webdepozit of the University Library in Bratislava)". Digital Resources. 3 January 2021. https://www.webdepozit.sk/?lang=en

  154. "Slovenian Web Archive". National and University Library of Slovenia. Retrieved 2018-02-02. http://arhiv.nuk.uni-lj.si/

  155. Biblioteca Nacional de España. "Archivo de la web española". Archived from the original on 2014-02-23. Retrieved 2014-02-20. https://web.archive.org/web/20140223093113/http://www.bne.es/es/LaBNE/ArchivoWeb/

  156. National Library of Catalonia (16 November 2012). "PADICAT: The Web Archive of Catalonia". National Library of Catalonia. Retrieved 16 November 2012. http://www.padicat.cat/en/

  157. Kai Oswald Seidler. "Basque Digital Heritage Archive (ONDARENET)". euskadi.net. Archived from the original on 2012-12-20. Retrieved 2013-11-17. https://archive.today/20121220171016/http://www.ondarenet.kultura.ejgv.euskadi.net/

  158. "Kulturarw3 - Kungliga biblioteket" (in Swedish). Kb.se. 2020-01-01. Retrieved 2021-05-04. https://www.kb.se/hitta-och-bestall/hitta-i-samlingarna/kulturarw3.html

  159. AAW Designs. "Aleph Archives". aleph-archives.com. Retrieved 2013-11-17. http://aleph-archives.com/

  160. "Web Archive Switzerland". E-helvetica.nb.admin.ch. Retrieved 2013-11-17. https://www.e-helvetica.nb.admin.ch/pages/user/webarchive/webArchiveSearch.jsf?BITfw2Ctx=xUSHZSYm2d19oyAFE&lang=en

  161. "NTU Web Archiving System, NTUWAS". ntu.edu.tw. Retrieved 2013-11-17. http://webarchive.lib.ntu.edu.tw/eng/default.asp

  162. "Web Archive Taiwan". ncl.edu.tw. Retrieved 2013-11-17. http://webarchive.ncl.edu.tw/nclwa98Front/

  163. "UK Web Archive". 2005-07-07. Retrieved 2013-11-17. https://www.webarchive.org.uk/ukwa/

  164. "Hanzo Archives". hanzoarchives.com. Retrieved 2013-11-17. http://www.hanzoarchives.com/

  165. "UK Government Web Archive". Nationalarchives.gov.uk. Retrieved 2013-11-17. http://www.nationalarchives.gov.uk/webarchive/

  166. "Internet Archive (provides Archive-it service)". 2001-03-10. Retrieved 2013-11-17. https://archive.org/

  167. "Columbia University Libraries Web Resources Collection Program". columbia.edu. Retrieved 2019-10-01. https://library.columbia.edu/collections/web-archives.html

  168. "North Carolina State Government Web Site Archives". ncdcr.gov. Retrieved 2013-11-17. http://webarchives.ncdcr.gov/

  169. "Latin American Web Archiving Project". utexas.edu. Retrieved 2013-11-17. http://lanic.utexas.edu/project/archives

  170. Dawrs, Stu. "Research Guides: Web Archiving Project of the Pacific Islands: Introduction". guides.library.manoa.hawaii.edu. Retrieved 2019-12-25. https://guides.library.manoa.hawaii.edu/pacificwebarchive

  171. "Library of Congress Web Archives". Loc.gov. Retrieved 2013-11-17. https://www.loc.gov/webarchiving/

  172. "Harvard University Library: the Web Archive Collection Service (WAX)". harvard.edu. Retrieved 2013-11-17. http://wax.lib.harvard.edu/collections/home.do

  173. "Web Archiving Service from California Digital Library (WAS service)". cdlib.org. 2013-10-16. Retrieved 2013-11-17. http://webarchives.cdlib.org/

  174. "Bentley Historical Library (University of Michigan) Web Archives". umich.edu. Archived from the original on 2013-10-03. Retrieved 2013-11-17. https://web.archive.org/web/20131003045300/http://bentley.umich.edu/dchome/webarchives/index.php

  175. "University of Texas at San Antonio Web Archives". Archive-it.org. Retrieved 2013-11-17. http://www.archive-it.org/public/partner.html?id=318

  176. "AUEB Web Archive". aueb.gr. 2011-10-21. Retrieved 2013-11-17. http://archive.aueb.gr/

  177. "World Bank Web Archives0". worldbank.org. 2012-12-20. Retrieved 2013-11-17. http://go.worldbank.org/67KZ5AH4Y0

  178. Government Documents Department, University of North Texas Libraries, State of Texas (2009-02-02). "University of North Texas CyberCemetery". unt.edu. Retrieved 2013-11-17.{{cite web}}: CS1 maint: multiple names: authors list (link) http://govinfo.library.unt.edu/

  179. York University Libraries, Toronto, ON (2012-11-01). "York University Libraries Wayback Machine". library.yorku.ca. Retrieved 2023-11-20.{{cite web}}: CS1 maint: multiple names: authors list (link) https://wayback.library.yorku.ca

  180. "Sound and Vision web archive". beeldengeluid.nl/en. Retrieved 2015-01-21. http://beeldengeluidwebarchief.nl/

  181. "WSAVE THE DATE: STUDIEDAG WEBARCHIVERING". 2014-08-19. Archived from the original on 2015-01-21. Retrieved 2015-01-21. https://web.archive.org/web/20150121165244/http://www.beeldengeluid.nl/blogs/collecties/201408/save-date-studiedag-webarchivering

  182. "Archive-It - Ivy Plus Libraries Confederation". archive-it.org. Retrieved 2021-02-19. https://archive-it.org/home/IvyPlus

  183. "EU Web Archive - EU Web Archive - Publications Office of the EU". EU Web Archive. Retrieved 2024-06-09. https://op.europa.eu/en/web/euwebarchive

  184. "Pandora — Australia's Web Archive". nla.gov.au. May 1999. Retrieved 2013-11-17. http://pandora.nla.gov.au/

  185. "Trove (Pandora Archive search)". nla.gov.au. Retrieved 2013-11-17. http://trove.nla.gov.au/website?q=

  186. "Our digital island, a Tasmanian Web Archive". tas.gov.au. Archived from the original on 2013-03-18. Retrieved 2014-05-29. https://web.archive.org/web/20130318092233/http://www.linc.tas.gov.au/tasmaniasheritage/search/tasmanian-websites

  187. "Webarchive Austria". Onb.ac.at. Retrieved 2020-12-11. https://webarchiv.onb.ac.at

  188. "Deutsche Nationalbibliothek". dnb.de. Archived from the original on 2018-05-08. Retrieved 2015-09-18. https://web.archive.org/web/20180508054405/http://www.dnb.de/EN/Netzpublikationen/Webarchiv/webarchiv_node.html

  189. "DILIMAG (Digital Literature Magazines". dilimag.literature.at. Retrieved 2013-11-17.[permanent dead link‍] http://dilimag.literature.at/

  190. "Bibliothèque et Archives nationales du Québec (BAnQ)". banq.qc.ca. Retrieved 2013-11-17. http://www.banq.qc.ca/collections/collections_patrimoniales/archives_web/index.html?language_id=1

  191. "Bibliothèque et Archives nationales du Québec (BAnQ)". banq.qc.ca. http://www.banq.qc.ca/collections/collections_patrimoniales/archives_web/index.html?language_id=1

  192. "Library and Archives Canada". Library and Archives Canada. 28 May 2020. Retrieved 2023-06-10. https://library-archives.canada.ca/eng

  193. "Library and Archives Canada". Library and Archives Canada. 28 May 2020. Retrieved 2023-06-10. https://library-archives.canada.ca/eng

  194. "Web Information Collection and Preservation - WICP (Chinese Web Archive)" https://web.archive.org/web/20110810171906/http://210.82.118.162:9090/webarchive/

  195. "Croatian Web Archive (Hrvatski arhiv weba - HAW)". Haw.nsk.hr. 2004-10-01. Archived from the original on 2013-07-13. Retrieved 2013-11-17. https://web.archive.org/web/20130713080922/http://haw.nsk.hr/

  196. "Webarchiv (National Library of the Czech Republic)". webarchiv.cz. Retrieved 2015-10-30. http://webarchiv.cz/en

  197. "Netarkivet.dk". Netarkivet.dk. 2013-10-17. Retrieved 2013-11-17. http://netarkivet.dk/in-english/

  198. "Estonian Web Archive". National Library of Estonia. 2014-01-09. Retrieved 2014-01-09. http://www.nlib.ee/index.php?id=21581

  199. "Finnish Web Archive". kansalliskirjasto.fi. Retrieved 2013-11-17. http://verkkoarkisto.kansalliskirjasto.fi

  200. "Bibliothèque nationale de France - Web Legal Deposit". Bnf.fr. 2010-08-17. Retrieved 2013-11-17. http://www.bnf.fr/en/professionals/digital_legal_deposit.html

  201. "Ina (Institut National de l'Audiovisuel)" (in French). Ina.fr. Retrieved 2013-11-17. http://www.Ina.fr

  202. "E-diasporas (Télécom ParisTech, FMSH)". ediasporas.ticmigrations.fr. Archived from the original on 2013-09-27. Retrieved 2013-11-17. https://web.archive.org/web/20130927195054/http://ediasporas.ticmigrations.fr/

  203. "Bibliotheksservice-Zentrum Baden-Württemberg". Bsz-bw.de. Retrieved 2013-11-17. http://www.bsz-bw.de/index.html

  204. "Web archive of the German Bundestag". Webarchiv.bundestag.de. Retrieved 2013-11-17. http://webarchiv.bundestag.de/cgi/kurz.php

  205. "Web archive of the German Bundestag". bundestag.de. Retrieved 2013-11-17. http://webarchiv.bundestag.de

  206. "Iceland - VEFSAFN". Vefsafn.is. Retrieved 2013-11-17. http://vefsafn.is/index.php?page=english

  207. "Web Archiving Project (WARP), The National Diet Library, Japan". da.ndl.go.jp. Retrieved 2013-11-17. https://warp.da.ndl.go.jp

  208. "National Library of Korea - OASIS (Online Archiving & Searching Internet Resource)". Oasis.go.kr. 2013-08-01. Archived from the original on 2013-10-31. Retrieved 2013-11-17. https://web.archive.org/web/20131031102219/http://www.oasis.go.kr/intro_new/intro_overview_e.jsp

  209. "National Library of Korea - OASIS". go.kr. 2013-08-01. Archived from the original on 2012-03-20. Retrieved 2013-11-17. https://web.archive.org/web/20120320030141/http://www.oasis.go.kr/ctrlu?cmd=search-dbsite

  210. "WebART (Web Archive Retrieval Tools)". http://www.webarchiving.nl/

  211. "New Zealand Web Archive". Natlib.govt.nz. Retrieved 2021-02-26. https://natlib.govt.nz/collections/a-z/new-zealand-web-archive

  212. "Nettarkivet". Nasjonalbiblioteket (in Norwegian Bokmål). Retrieved 2019-12-25. https://www.nb.no/samlingen/nettarkivet/

  213. "National Library of Norway Search". nb.no. http://www.nb.no/sok/search.jsf

  214. Daniel Gomes (November 2022). "Web archives as research infrastructure for digital societies: the case study of Arquivo.pt" (PDF). Archeion. Retrieved 2 February 2023. https://sobre.arquivo.pt/wp-content/uploads/GomesArquivoPTCaseStudy2022.pdf

  215. Web archive of Cacak[permanent dead link‍]. digital.cacak.dis.rs http://digital.cacak-dis.rs/english/web-archive-of-cacak/

  216. "Web Archive Singapore". eresources.nlb.gov.sg/webarchives. Retrieved 2023-02-03. https://eresources.nlb.gov.sg/webarchives

  217. "Digital Resources Webdepozit of the University Library in Bratislava". Digital Resources. 3 February 2020. https://www.webdepozit.sk/?lang=en

  218. "Slovenian Web Archive". National and University Library of Slovenia. Retrieved 2018-02-02. http://arhiv.nuk.uni-lj.si/

  219. Biblioteca Nacional de España. "Archivo de la web española". Archived from the original on 2014-02-23. Retrieved 2014-02-20. https://web.archive.org/web/20140223093113/http://www.bne.es/es/LaBNE/ArchivoWeb/

  220. National Library of Catalonia (16 November 2012). "PADICAT: The Web Archive of Catalonia". National Library of Catalonia. Retrieved 16 November 2012. http://www.padicat.cat/en/

  221. Kai Oswald Seidler. "Basque Digital Heritage Archive (ONDARENET)". euskadi.net. Archived from the original on 2012-12-20. Retrieved 2013-11-17. https://archive.today/20121220171016/http://www.ondarenet.kultura.ejgv.euskadi.net/

  222. "Kulturarw3 - Kungliga biblioteket" (in Swedish). Kb.se. 2020-01-01. Retrieved 2021-05-04. https://www.kb.se/hitta-och-bestall/hitta-i-samlingarna/kulturarw3.html

  223. AAW Designs. "Aleph Archives". aleph-archives.com. Retrieved 2013-11-17. http://aleph-archives.com/

  224. "Web Archive Switzerland". E-helvetica.nb.admin.ch. Retrieved 2013-11-17. https://www.e-helvetica.nb.admin.ch/pages/user/webarchive/webArchiveSearch.jsf?BITfw2Ctx=xUSHZSYm2d19oyAFE&lang=en

  225. "Web Archive Switzerland - e-Helvetica". nb.admin.ch. Retrieved 2013-11-17. http://www.e-helvetica.nb.admin.ch

  226. "NTU Web Archiving System, NTUWAS". ntu.edu.tw. Retrieved 2013-11-17. http://webarchive.lib.ntu.edu.tw/eng/default.asp

  227. "Web Archive Taiwan". ncl.edu.tw. Retrieved 2013-11-17. http://webarchive.ncl.edu.tw/nclwa98Front/

  228. "PageFreezer". pagefreezer.com. 2011-01-20. Retrieved 2013-11-17. http://www.pagefreezer.com/

  229. "UK Web Archive". 2005-07-07. Retrieved 2013-11-17. https://www.webarchive.org.uk/ukwa/

  230. "Hanzo Archives". hanzoarchives.com. Retrieved 2013-11-17. http://www.hanzoarchives.com/

  231. "UK Government Web Archive". Nationalarchives.gov.uk. Retrieved 2013-11-17. http://www.nationalarchives.gov.uk/webarchive/

  232. "UK Government Web Archive Full Text Search". Retrieved 2018-02-08. http://webarchive.nationalarchives.gov.uk/search/

  233. "UK Government Web Archive A-Z list". nationalarchives.gov.uk. Retrieved 2013-11-17. http://www.nationalarchives.gov.uk/webarchive/atoz/

  234. "Internet Archive (provides Archive-it service)". 2001-03-10. Retrieved 2013-11-17. https://archive.org/

  235. "Researcher - Documentation". archive.org. https://archive.org/web/researcher/tool_documentation.php

  236. "Using Archive.org". archive.org. https://archive.org/about/using.php

  237. "Columbia University Libraries Web Resources Collection Program". columbia.edu. Retrieved 2019-10-01. https://library.columbia.edu/collections/web-archives.html

  238. "Archive-it: Columbia University Libraries". archive-it.org. http://www.archive-it.org/public/partner.html?id=304

  239. "North Carolina State Government Web Site Archives". ncdcr.gov. Retrieved 2013-11-17. http://webarchives.ncdcr.gov/

  240. "Archive-it: Columbia University Libraries". archive-it.org. http://www.archive-it.org/public/partner.html?id=304

  241. "Latin American Web Archiving Project". utexas.edu. Retrieved 2013-11-17. http://lanic.utexas.edu/project/archives

  242. Dawrs, Stu. "Research Guides: Web Archiving Project of the Pacific Islands: Introduction". guides.library.manoa.hawaii.edu. Retrieved 2019-12-25. https://guides.library.manoa.hawaii.edu/pacificwebarchive

  243. "Library of Congress Web Archives". Loc.gov. Retrieved 2013-11-17. https://www.loc.gov/webarchiving/

  244. "Harvard University Library: the Web Archive Collection Service (WAX)". harvard.edu. Retrieved 2013-11-17. http://wax.lib.harvard.edu/collections/home.do

  245. "Web Archiving Service from California Digital Library (WAS service)". cdlib.org. 2013-10-16. Retrieved 2013-11-17. http://webarchives.cdlib.org/

  246. "Bentley Historical Library (University of Michigan) Web Archives". umich.edu. Archived from the original on 2013-10-03. Retrieved 2013-11-17. https://web.archive.org/web/20131003045300/http://bentley.umich.edu/dchome/webarchives/index.php

  247. "California Digital Library Alternative Mass Media". cdlib.org. http://webarchives.cdlib.org/a/AlternativeMassMedia

  248. "University of Texas at San Antonio Web Archives". Archive-it.org. Retrieved 2013-11-17. http://www.archive-it.org/public/partner.html?id=318

  249. "Archive-it Partners". archive-it.org http://www.archive-it.org/public/partner.html?id=318

  250. "Texas Archival Repositories Online". utexas.edu. http://www.lib.utexas.edu/taro/index.html

  251. "AUEB Web Archive". aueb.gr. 2011-10-21. Retrieved 2013-11-17. http://archive.aueb.gr/

  252. "World Bank Web Archives0". worldbank.org. 2012-12-20. Retrieved 2013-11-17. http://go.worldbank.org/67KZ5AH4Y0

  253. Government Documents Department, University of North Texas Libraries, State of Texas (2009-02-02). "University of North Texas CyberCemetery". unt.edu. Retrieved 2013-11-17.{{cite web}}: CS1 maint: multiple names: authors list (link) http://govinfo.library.unt.edu/

  254. "Tamiment Library Web Archiving Project" Archived September 25, 2012, at the Wayback Machine http://www.nyu.edu/library/bobst/research/tam/webarchive.html/

  255. "Institution: New York University Libraries / Tamiment Library (Labor & the Left)". cdlib.org. Retrieved 2013-08-19. http://webarchives.cdlib.org/institutions/NYUL/

  256. "Search Finding Aids Hosted at New York University". nyu.edu. Retrieved 2013-08-19. http://dlib.nyu.edu/findingaids/?collectionId=tamwag/

  257. York University Libraries, Toronto, ON (2012-11-01). "York University Libraries Wayback Machine". library.yorku.ca. Retrieved 2023-11-20.{{cite web}}: CS1 maint: multiple names: authors list (link) https://wayback.library.yorku.ca

  258. "Sound and Vision web archive". beeldengeluid.nl/en. Retrieved 2015-01-21. http://beeldengeluidwebarchief.nl/

  259. "Sound and Vision web archive". beeldengeluid.nl/en. Retrieved 2015-01-21. http://beeldengeluidwebarchief.nl/