Freebase Data Dumps
Freebase provides full data dumps of all the current facts and assertions in our system. You've always been able to use our developer API to access this information, and we've recently made it easy to export tabular data about a single Freebase type through our website, but now for the first time you can grab everything in one convenient package. Freebase data dumps are complete, general-purpose extracts of the Freebase data in a variety of formats. We will be releasing a fresh data dump every three months.
Data Dumps
Two formats are currently available:
-
TSV
A tab-separated file for each type in Freebase, suitable for loading into spreadsheets or database software. Each line in these files represents an instance of a Freebase type, the columns represent the available properties for the type. You may download the full set, or browse Freebase domains and types to find specific data sets.- The December 2009 full download is approximately 1046 Mbytes compressed in the Bzip2 format.
- The December 2009 browseable set contains 6663 TSV files in 86 domains.
-
Link Export
A full dump of Freebase assertions in a simple utf8 text format. This is a complete "low level" dump of data which is suitable for post processing into RDF or XML datasets. The format of the link export is a series of lines, one assertion per line. The lines are tab separated quadruples, <source>, <property>, <destination>, <value> An assertion is a statement of fact about the <source> object. In any assertion, either the <destination> or <value> or both <destination> and <value> are present.- A sample of this output is available.
- The December 2009 Link Export is approximately 2490 Mbytes compressed in the Bzip2 format (updated December 30, 2009).
-
Simple Topic Dump
A tab-separated file containing basic identifying about every topic in Freebase. The columns are: GUID, English display name, Freebase /en keys (comma-separated), numeric English Wikipedia keys (comma-separated), Freebase types (comma-separated), and a short text description from Wikipedia (when available). Tabs and newlines are backslash-escaped, and null fields are represented by "\N".- The December 2009 Simple Topic Dump is approximately 920 Mbytes compressed in the Bzip2 format (updated December 30, 2009).
License
Freebase Data Dumps are provided free of charge for any purpose with regular updates by Metaweb Technologies. They are distributed, like Freebase itself, under the Creative Commons Attribution (CC-BY) license and use is subject to the Freebase Terms of Service. If you include the data from these data dumps in a website or application, you must attribute us as described in our Licensing Policy.
Contact
Questions and comments about these data dumps should be directed to the Freebase Developer Email List.
Citing
If you'd like to cite these data dumps in a publication, you may use:
Metaweb Technologies, Freebase Data Dumps, http://download.freebase.com/datadumps/, December 30, 2009
Or as BibTeX:
@misc{metaweb:datadumps,
title = "Freebase Data Dumps"
author = "Metaweb Technologies",
howpublished = "\url{http://download.freebase.com/datadumps/}",
edition = "December 30, 2009",
year = "2009"
}
Join the developers email list
Discuss the API, MQL, Acre, and everything else related to developing applications with Freebase. Search or browse the archives.
Join the data-modelers email list
Discuss new type schemas, the Commons, and everything else related to the structuring of data on Freebase. Search or browse the archives.