kb.nl Sources - URL Collection Data
Created: November-December 2021
About
This folder contains source data used to compile the URL lists for archiving kb.nl. The URLs were collected by searching for kb.nl references across various Wikimedia projects.
Contents
txt/
Raw text files with URLs found via Wikipedia/Wikimedia special searches (26 November 2021):
| File | Description |
|---|---|
| wiki_kb.nl_specialsearch_commons_26112021.txt | URLs found on Wikimedia Commons (~2.4 MB) |
| wiki_kb.nl_specialsearch_wikidata_26112021.txt | URLs found on Wikidata |
| wiki_kb.nl_specialsearch_wpen_26112021.txt | URLs found on English Wikipedia |
| wiki_kb.nl_specialsearch_wpnl_26112021.txt | URLs found on Dutch Wikipedia |
xlsx/
Excel versions of the source data (30 November - 2 December 2021):
| File | Description |
|---|---|
| wiki_kb.nl_specialsearch_commons_30112021.xlsx | Wikimedia Commons URLs |
| wiki_kb.nl_specialsearch_wikidata_30112021.xlsx | Wikidata URLs |
| wiki_kb.nl_specialsearch_wpen_30112021.xlsx | English Wikipedia URLs |
| wiki_kb.nl_specialsearch_wpnl_30112021.xlsx | Dutch Wikipedia URLs |
| wiki_kb.nl_specialsearch_all4wikisites_02122021.xlsx | Combined data from all 4 Wikimedia sites |
Methodology
The URLs in these files were found by using the special search functionality on each Wikimedia project to find all pages that link to or reference kb.nl domains. This helped identify which kb.nl pages were most referenced and therefore most important to archive.