New Data: Revelio Labs Workforce Data
The University of Toronto has recently licensed six separate datasets from Revelio Labs. These datasets provide detailed information on the global workforce, with coverage for most running from 2007/2008 through to the end of 2024. These datasets include:
Workforce Dynamics
This dataset provides an overview of a company's composition. This includes headcounts, inflows, and outflows for every unique position, segmented by job, seniority, geography, salary, education, skills, and gender & ethnicity.
Job Postings
This dataset includes active postings, new postings, removed postings, salaries, and full text of postings for any company, segmented by various employee characteristics (occupation, seniority, geography, keywords, skills, etc). This dataset is pulled from over 350 thousand company websites, all major job boards, and staffing firm job boards.
Sentiment
This datasets includes employee reviews for all companies, with the full text of each review split into positive and negative text. Reviews are mapped to various employee characteristics (occupation, seniority, geography).
Layoff Notices
This dataset includes post data and effective date for all layoffs in every company in the United States.
Individual Level Data
This datasets contains data on the full professional history of individuals, including their roles, education, skills, gender, ethnicity, salary, seniority, and geography.
Company Reference Data
This dataset contains information on companies that are covered by or referenced in the five other datasets listed above.
Accessing the Data
While these data are spreadsheets, they range in size from 1 - 4TB and so are too large to work with using traditional tools such as Excel or Notepad++. To facilitate easy access and querying across products, all datasets have been loaded into SciNet's supercomputing environment which is accessible to current University of Toronto Faculty, Staff and Students, after an application process.
The Map & Data Library has created a detailed tutorial on this application process. This tutorial also contains additional information on the data itself, how to query it, and where to go to get more help!