Webarchiv preserves Czech web


For basic informations about Webarchiv visit our introduction


FAQ - Frequently Asked Questions

General information

Why National Library of the Czech republic archive web?

The amount of documents published on the Internet is growing dramatically – many of them are often changing and others are even being lost. If the documents that have a research value are not archived a considerable part of the national cultural heritage would disappear forever. The responsibility for archiving online-born documents and their registration in the national bibliography is usually assumed by national libraries and/or other deposit libraries.

Which webpages do you archive?

We perform a complete archivation of the „entire“ Czech web.

What is the archived version of a webpage?

It is a snapshot of normal webpage which is stored in our archive.

How big is Czech web archive?

We store more than 250 TB data at the moment and this number is increasing. First web page was archived in September 2001.

What is Wayback Machine?

Wayback Machine is a web application built for accessing web archives. Wayback Machine was originally developed by Internet Archive and we use it for our archive too.

Can I cite webpage from archive?

Yes, you can! URLs in our archive are permanent, so you can cite the archived webpage as you would do with a normal webpage.

How can I search in Czech web archive?

If you know the website you are looking for, you can use a search box on our website. You can search by URL adress or keyword (name, topic etc.). There is an option to limit your searching by date range. You can also browse archive by subject categories. It is possible to find our bibliographic records in the library catalogue.

Is there any difference in searching on our web or in wayback machine?

You can search by URL adress both ways. On our website you can search by keywords extra.

Why is the website i’m looking for absent?

We are trying to archive the Czech web completely, if you are missing some website, you can nominate it for archiving.

Access to archive

Why are some archived websites incomplete or display strangely?

Web archiving tools used by the Czech web archive have technical limitations and are currently unable to capture every component from the website. This can cause that some websites display strangely or are incomplete.

Why are some archived websites blocked?

According to Czech copyright law online access to archived websites is based on agreement with website owner or on Creative Commons licence. Website without this agreement are blocked from our online archive and they are accessible only from the library terminals.

Harvesting

How can I suggest a website for archiving?

You are welcome to nominate a website for archiving, just fill in the form.

Why is my web page in Czech web archive and how can I remove it from archive?

One of Czech national library goals is to preserve the Czech cultural heritage. We are building the collection of the Czech websites - the cultural heritage of born-online documents. Since 2006 the Czech national library has begun to archive the whole of the CZ web domain. If you want to exclude your website from archive, please email us at webarchiv[@]nkp.cz.

How can I protect my privacy?

Czech web archive only collect and archive pages that are publicly available, the same one that you might find as you surf around the web. We do not collect pages that require a password to access or pages that are only accessible when a person types into and sends a form (for ex. search box).

If you think that there has been a violation of your privacy from our side, please contact us webarchiv[@]nkp.cz

Do you archive emails, chats or any other personal information?

No, we do not collect or archive chat systems or personal email messages. We collect pages that are publicly available, the same ones that you might find as you surf around the web.

Can I get a copy of my web site from archive if my site got hacked or damaged?

Unfortunately we don't cover backups for the general public. However, you may use the archive to locate and access archived versions of a your site, but we can't guarantee that your site has been or will be archived.

Do you ignore robots.txt standard?

Yes, we ignore robots.txt at the moment.