Canadian Intellectual Property Office (CIPO) Patent PostgreSQL Database

The Canadian Intellectual Property Office (CIPO) Patent PostgreSQL Database is a metadata extract from CIPO's IP Horizon's XML Databank that contains information on over 2.5 million patent documents filed in Canada. Data are available from 1870 to present, and include metadata as well as the full text of patent descriptions and claims information. These documents represent both patent applications and patent grants, as well as patents that have expired. Many patents provide references to equivalent patents filed in other countries, via the World Intellectual Property Office's (WIPO) Patent Cooperation Treaty (PCT).

This XML has been converted into an object-relational database (updated annually) and is available to UofT faculty, staff, and students for querying in a high performance computing environment offered by SciNet. Currently the database contains data up to and including July 23, 2023.

This is an excellent dataset for use in text and data mining research. It can be programmatically queried via SQL statements directly or through python scripts, with no limits on query results.

Restrictions

While CIPO's XML files are available openly via IP Horizons, this PostgreSQL database is intended for academic study, research, teaching and administrative use at the University of Toronto. Access to the database is restricted to University of Toronto faculty, students, researchers and staff. 

Access

In order to access the database, you must first gain access to the high performance computing environment through a multi-step process to create the appropriate account (may take a few days to create the account initially).

Access Instructions

Working with the Database

You query the database using SQL statements, and then can either continue to work with the results within this computing environment, or download the results as a CSV file.

  • This document describes the various tables and their contents and can help you when constructing your SQL statements.
  • This Entity Relationship Diagram (ERD) provides a visual representation of all of the tables within the database and their relationships.

Help

If working with object-relational databases, SQL, and/or high performance computing environments are new to you, check out this tutorial for Windows users or this tutorial for Mac users to help you get started.

If you have any question, feel free to contact us.