Data Set 3.4
This pages provides downloads of the DBpedia datasets. The DBpedia datasets are licensed under the terms of the Creative Commons Attribution-ShareAlike License and the GNU Free Documentation License.
The downloads are provided as N-Triples and in CSV format. All files are bz2 packed.
Older Versions: DBpedia 3.3, DBpedia 3.2, DBpedia 3.1, DBpedia 3.0, DBpedia 3.0RC, DBpedia 2.0
Content
1 Wikipedia Input Files
The datasets were extracted from Wikipedia dumps generated in late September 2009. Specific dates and times:
en
de
fr
pl
ja
it
nl
es
pt
ru
sv
zh
Dump start
20 11:12
17 19:06
24 12:00
29 03:22
27 08:16
26 08:46
25 00:23
27 15:37
26 22:12
28 09:59
27 16:18
25 07:02
Dump end
24 07:46
18 08:51
25 01:03
29 08:25
27 14:19
26 15:06
25 04:22
27 22:09
27 02:51
28 16:12
27 18:33
25 10:57
2 Core Datasets
NOTE: You can find DBpedia dumps in 91 languages at our DBpedia download server.
Move the mouse over the download links to obtain additional information.
Dataset
en
de
fr
pl
ja
it
nl
es
pt
ru
sv
zh
DBpedia Ontology ( preview )
owl
--
--
--
--
--
--
--
--
--
--
--
Ontology Types ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Ontology Infoboxes (strict) ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Ontology Infoboxes (loose) ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Titles ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Short Abstracts ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Extended Abstracts ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Images ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Links to Wikipedia Article ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Articles Categories ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
External Links ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Infoboxes ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Properties ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Homepages ( preview )
nt csv
nt csv
nt csv
--
--
--
--
--
--
--
--
--
Geographic Coordinates ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Pagelinks ( preview )
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
nt csv
Persondata ( preview )
nt csv
nt csv
--
--
--
--
--
--
--
--
--
--
Redirects ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Disambiguation Links ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Categories (Labels) ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
Categories (Skos) ( preview )
nt csv
--
--
--
--
--
--
--
--
--
--
--
3 Extended Datasets
Move the mouse over the download links to obtain additional information.
Dataset
links
Links to RDF Bookmashup ( preview )
nt -
Links to DailyMed ( preview )
nt -
Links to DBLP ( preview )
nt -
Links to Diseasome ( preview )
nt -
Links to DrugBank ( preview )
nt -
Links to Eurostat ( preview )
nt -
Links to CIA Factbook ( preview )
nt -
Links to flickr wrappr ( preview )
nt -
Links to Freebase ( preview )
ttl -
Links to Geonames ( preview )
nt -
Links to Project Gutenberg ( preview )
nt -
Links to MusicBrainz ( preview )
nt -
Links to Cyc ( preview )
nt -
Links to Revyu ( preview )
nt -
Links to SIDER ( preview )
nt -
Links to TCMGeneDIT ( preview )
nt -
Links to US Census ( preview )
nt -
Links to WikiCompany ( preview )
nt -
WordNet Classes ( preview )
nt -
YAGO Classes ( preview )
nt -
YAGO Links ( preview )
nt -
4 Dataset Descriptions
DBpedia Ontology
The DBpedia ontology in OWL. See Ontology and DBpedia JWS article for more details.
Ontology Types
Contains triples of the form $object rdf:type $class from the ontology-based extraction.
Ontology Infoboxes (strict)
Infoboxe data from the strict ontology-based extraction.
Ontology Infoboxes (loose)
Infoboxe data from the loose ontology-based extraction.
Titles
Titles of all Wikipedia Articles in the corresponding language
Short Abstracts
Short Abstracts (max. 500 chars long) of Wikipedia Articles
Extended Abstracts
Additional, extended English abstracts.
Images
Thumbnail Links from Wikipedia Articles
Links to Wikipedia Article
Links to corresponding Articles in Wikipedia
Articles Categories
Links from concepts to categories using the SKOS vocabulary.
External Links
Links to external web pages about a concept.
Infoboxes
The Infobox Dataset is created using our initial, now two year old infobox parsing approach.
Properties
All properties / predicates used in infoboxes.
Homepages
Links to external webpages.
Geographic Coordinates
Geographic coordinates extracted from Wikipedia.
Pagelinks
Dataset containing internal links between DBpedia instances. The dataset was created from the internal pagelinks between Wikipedia articles. The dataset might be useful for structural analysis, data mining or for ranking DBpedia instances using Page Rank or similar algorithms.
Persondata
Information about persons (date and place of birth etc.) extracted from the English and German Wikipedia, represented using the FOAF vocabulary.
Redirects
Dataset containing redirects between Articles in Wikipedia
Disambiugation Links
Extraction from Disambiguation Templates
Categories (Labels)
Labels for Categories.
Categories (Skos)
Information which concept is a category and how categories are related using the SKOS Vocabulary.
Links to RDF Bookmashup
Links between books in DBpedia and data about them provided by the RDF Book Mashup. Provided by Georgi Kobilarov. Update mechanism: unclear/copy over from previous release.
Links to DailyMed
Links between DBpedia and DailyMed. Update mechanism: unclear/copy over from previous release.
Links to DBLP
Links between computer scientists in DBpedia and their publications in the DBLP database. Links were created manually. Update mechanism: Copy over from previous release.
Links to Diseasome
Links between DBpedia and Diseasome. Update mechanism: unclear/copy over from previous release.
Links to DrugBank
Links between DBpedia and DrugBank. Update mechanism: unclear/copy over from previous release.
Links to Eurostat
Links between countries and regions in DBpedia and data about them from Eurostat. Links were created manually. Update mechanism: Copy over from previous release.
Links to CIA Factbook
Links between countries in DBpedia and data about them from CIA Factbook. Links were created manually. Update mechanism: Copy over from previous release.
Links to flickr wrappr
Links between DBpedia concepts and photo collections depicting them generated by the flikr wrappr. Update mechanism: script in SVN.
Links to Freebase
Links between DBpedia and Freebase. Update mechanism: unclear/copy over from previous release.
Links to Geonames
Links between geographic places in DBpedia and data about them in the Geonames database. Provided by the Geonames people. Update mechanism: unclear/copy over from previous release.
Links to Project Gutenberg
Links between writers in DBpedia and data about them from Project Gutenberg. Update mechanism: script in SVN. Since this requires manual changes of files and a D2R installation, it will be copied over from the previous DBpedia version and updated between releases by the maintainers (Piet Hensel and Georgi Kobilarov).
Links to MusicBrainz
Links between artists, albums and songs in DBpedia and data about them from MusicBrainz. Created manually using the result of SPARQL queries. Update mechanism: unclear/copy over from previous release.
Links to Cyc
Links between DBpedia and Cyc concepts. Details. Update mechanism: awk script.
Links to Revyu
Links to Reviews about things in Revyu. Created manually by Tom Heath. Update mechanism: unclear/copy over from previous release.
Links to SIDER
Links between DBpedia and SIDER. Update mechanism: unclear/copy over from previous release.
Links to TCMGeneDIT
Links between DBpedia and TCMGeneDIT. Update mechanism: unclear/copy over from previous release.
Links to US Census
Links between US cities and states in DBpedia and data about them from US Census. Update mechanism: unclear/copy over from previous release.
Links to WikiCompany
Links between companies in DBpedia and companies in Wikicompany. Update mechanism: script in SVN.
WordNet Classes
Classification links to RDF representations of WordNet classes. Update mechanism: unclear/copy over from previous release.
YAGO Classes
Dataset containing rdf:type Statements for all DBpedia instances using YAGO classification algorithm. Includes the RDFS hierarchy of YAGO classes. This data set is created by running the DBpediaExport converter available at the YAGO website. Currently maintained by Fabian Suchanek and Jens Lehmann.
YAGO Links
Dataset containing owl:sameAs and owl:equivalent class mappings from DBpedia to YAGO. This data set is created by running the DBpediaLink converter available at the YAGO website. Currently maintained by Fabian Suchanek and Jens Lehmann.
[Note for Wiki Editors: The wiki code for this page is generated automatically. Please modify the files in http://dbpedia.svn.sourceforge.net/viewvc/dbpedia/related_apps/downloadpagecreator/ to make permanent changes.]