Menu top image

Introduction

Crunchbase is an online platform collecting information about startups and technology companies, including attributes and relations of companies, people, and investments.
By means of linked-crunchbase.org, we bring Crunchbase to the Web of Data so that its data can be used in the machine-readable format RDF by anyone on the Web.

Overall, we provide the following items:

  1. a Linked Data API for Crunchbase;
  2. instructions how to collect Crunchbase data in RDF via crawling;
  3. mappings from Crunchbase to other knowledge graphs.

Linked Data API

Data Formats

Our Linked Data API supports JSON, JSON-LD, and RDF/N-Triples.

Metadata
  1. VOAF: voaf.ttl
  2. VoID: void.ttl
  3. Our ontology with links to schema.org.

Please take note of the CrunchBase Terms of Service concerning the licensing of Crunchbase data.

Authentication

To use our Linked Data API for Crunchbase, you have to authenticate to Crunchbase. To do so, enter your api key here (put the key in the username field and keep the password field empty). If you are not authenticated, only triples concerning the mapping to DBpedia are returned due to the licensing model of Crunchbase.

Example Linked Data API Usage

  1. /api/company/facebook#id
  2. /api/people/mark-zuckerberg#id

With curl:

curl -v -H "Accept:text/turtle" --header "Authorization: Basic {Base64-encoded key}" http://linked-crunchbase.org/api/organizations/facebook#id

Crawling RDF Data with the Linked Data API

A detailed description of how to crawl an RDF data set based on our Linked Data API is given on GitHub.

Due to licensing issues, we cannot provide a current RDF data set. However, a data set from 2015 containing, among other things, information about 1.9 million jobs, 1.3 million websites, 568,000 organizations, 520,000 news, and 430,000 people is provided online at Zenodo and licensed partly under Creative Commons Attribution-NonCommercial License 4.0 (CC-BY-NC) and partly under Creative Commons Attribution License 4.0 (CC-BY).

Please take note of the CrunchBase Terms of Service concerning the licensing of Crunchbase data.

Mappings

Mappings to DBpedia

We provide mappings of organizations and mappings of people in Crunchbase to DBpedia.

About Us

For more information about Linked Crunchbase, see our papers

  1. Michael Färber, Carsten Menne, Andreas Harth. "A Linked Data Wrapper for CrunchBase". In: Semantic Web Journal 9(4). IOS Press, 2018, pp. 505–5015.
  2. Michael Färber: "Linked Crunchbase: A Linked Data API and RDF Data Set About Innovative Companies". arXiv preprint, abs/1907.08671, 2019.

The source-code of our wrapper is available on Github.

A previous, outdated version of the Crunchbase Linked Data API is provided here.

Feel free to reach out to us: michael . faerber [kit] edu

Changelog:
2016-02-06: initial release of the Crunchbase wrapper 1.0
2018-12-10: initial release of the Crunchbase wrapper 2.0
2020-04-23: website relaunch