Data Dictionary

Data Dictionary

There are 7 data tables, 2 mapping tables, and the available metrics, demographics, and job categories are included at the bottom of the page.


Each row represents a metric / demographic group / job category combination for a single year and whether or not the company reported data during that year. A value of 2 means it was fully reported and a value of 1 means it was partially reported (for example white/nonwhite as opposed to specific values for each race/ethnicity or US-only gender breakdown without including worldwide). We are measuring disclosure for that year, so if a company just started disclosing a metric in 2020 and reported historical data, it would not be reflected in prior years. There is source attribution for every row (linked to the sources table).

Company Data

Each row represents a company-reported data value for a geography / metric / demographic type / job category combination for a single year. For comparability purposes, all figures are adjusted to exclude not disclosed/unknown categories from self-reported data. There is source attribution for every row (linked to the sources table).

Board of Directors & Executives

Each row represents summary statistics for a company’s Board of Directors and top executives (CEO, CFO, Chief Legal Officer/General Counsel) at a point in time going back to 2016, including, average age, average tenure, and representation by gender, race/ethnicity, and LGBTQ+.

DiversIQ researches individuals’ characteristics through publicly available corporate documentation including SEC filings, ESG-related publications, and investor relations/corporate governance websites. Additional reference tools include corporate biographies, LinkedIn profiles, public records, trade and mass media publications and professional associations for any self-identified diversity characteristics and contextual information.

EEO-1 Dataset

Each row represents one year of reported data that aligns with Equal Employment Opportunity (EEOC) job categories and demographic categories. Each dataset with disclosure > 1 (not including committed to disclose) has corresponding data points in the EEO-1 Datapoint table, and is linked to a source in the Sources table.

EEO-1 Datapoint

Each row represents one value for a gender—race/ethnicity—job category combination for a single year, and aligns with Equal Employment Opportunity (EEOC) reporting.

External Recognition

Each row shows inclusion of a company on a list or index for a single year. Includes: HRC Corporate Equality Index (CEI), Bloomberg Gender Equality Index (GEI), Disability:IN Disability Equality Index (DEI), DiversityInc Top 50 Companies for Diversity, Forbes America’s Best Employers for Diversity, Fortune Best Workplaces for Diversity

Glassdoor Ratings

Each row represents an average Glassdoor rating in 1 of 10 categories for a month. Ratings are collected directly from Glassdoor and are calculated based on their proprietary algorithm, which places an emphasis on recency. There is 24 months of historical data available in each category, except Diversity & Inclusion (16 months).


Every row represents one source for human capital and DEI information, which includes SEC filings (10-K, Proxy Statements), company reports (ESG/CSR, Diversity, EEO-1), company policies/statements (Human Rights, Code of Conduct), and company websites (Diversity, Corporate Governance, ESG).


Company identifiers and metadata for mapping and filtering.

Available Metrics
Available Demographics
Available Job Categories