KFF has conducted this annual survey of employer-sponsored health benefits since 1999. Since 2020, KFF has employed Davis Research LLC (Davis) to field the survey. From January to July 2024, Davis interviewed business owners as well as human resource and benefits managers at 2,142 firms.

SURVEY TOPICS

The survey includes questions on the cost of health insurance, health benefit offer rates, coverage, eligibility, plan type enrollment, premium contributions, employee cost sharing, prescription drug benefits, retiree health benefits, and wellness benefits.

Firms that offer health benefits are asked about the plan attributes of their largest health maintenance organization (HMO), preferred provider organization (PPO), point-of-service (POS) plan, and high-deductible health plan with a savings option (HDHP/SO).4 We treat exclusive provider organizations (EPOs) and HMOs as one plan type and conventional (or indemnity) plans as PPOs. The survey defines an HMO as a plan that does not cover nonemergency out-of-network services. POS plans use a primary care gatekeeper to screen for specialist and hospital visits. HDHP/SOs are plans with a deductible of at least $1,000 for single coverage and $2,000 for family coverage and that either offer a health reimbursement arrangement (HRA) or are eligible for a health savings account (HSA). Definitions of the health plan types are available in Section 4, and a detailed explanation of the HDHP/SO plan type is in Section 8. Throughout this report, we use the term “in-network” to refer to services received from a preferred provider.

To reduce survey burden, questions on cost sharing for office visits, hospitalization, outpatient surgery and prescription drugs were only asked about the firm’s largest plan type. Firms sponsoring multiple plan types were asked about premiums, worker contributions and deductibles for their two largest plan types. Within each plan type, respondents are asked about the plan with the most enrollment.

Firms are asked about the attributes of their current plans during the interview. While the survey’s fielding period begins in January, many respondents may have a plan whose 2024 plan year lags behind the calendar year. In some cases, plans may report the attributes of their 2023 plans and some plan attributes (such as HSA deductible limits) may not meet the calendar year regulatory requirements. Decisions concerning plan features and costs may have taken place months before the interview.
.

SAMPLE DESIGN

The sample for the annual KFF Employer Health Benefits Survey includes private firms and nonfederal government employers with three or more employees. The universe is defined by the U.S. Census’ 2020 Statistics of U.S. Businesses (SUSB) for private firms and the 2022 Census of Governments (COG) for non-federal public employers. At the time of the sample design (December 2023), this data represented the most current information on the number of public and private firms nationwide with three or more workers. As in the past, the post-stratification is based on the most up-to-date Census data available (the 2021 SUSB). We determine the sample size based on the number of firms needed to ensure a target number of completes in six size categories.

We attempted to repeat interviews with prior years’ survey respondents (with at least ten employees) who participated in either the 2022 or the 2023 survey, or both. Firms with 3-9 employees are not included in the panel to minimize the potential of panel effects. In total, 278 firms participated in 2022, 464 firms participated in 2023, and 612 firms participated in both 2022 and 2023. Non-panel firms are randomly selected within size and industry groups.

Since 2010, the sample has been drawn from a Dynata list (based on a census assembled by Dun and Bradstreet) of the nation’s private employers and the COG for public employers. To increase precision, we stratified the sample by ten industry categories and six size categories. The federal government and businesses with fewer than three employees are not included. Education is a separate category for the purposes of sampling, and included in ‘Service’ category for weighting. For information on changes to the sampling methods over time, please consult the extended methods at ehbs.kff.org.
.

RESPONSE RATE

Response rates are calculated using a CASRO method, which accounts for firms that are determined to be ineligible in its calculation. The overall response rate is 14% [Figure M.1].5 The response rate for panel firms is higher than the response rate for non-panel firms. Similar to other employer and household surveys, the Employer Health Benefits Survey has seen a general decrease in response rates over time. Since 2017, we have attempted to increase the number of completes by increasing the number of non-panel firms in the sample. While this generally increases the precision of estimates by ensuring a sufficient number of respondents in various sub-groups, it has the effect of reducing the overall response rate.

The vast majority of questions are asked only of firms that offer health benefits. A total of 1,703 of the 2,142 responding firms indicated they offered health benefits. We asked one question of all firms in the study with which we made phone contact even if the firm declined to participate: “Does your company offer a health insurance program as a benefit to any of your employees?”. A total of 4,769 firms responded to this question (including 2,142 who responded to the full survey and 2,627 who responded to this one question). These responses are included in our estimates of the percentage of firms offering health benefits.6 The response rate for this question is 31% [Figure M.1].

Figure M.1: Response Rates for Various Subsets of the Sample, 2024

Figure M.1: Response Rates for Various Subsets of the Sample, 2024

While response rates have decreased, elements of the survey design limit the potential impact of a response bias. Most major statistics are weighted by the percentage of covered workers at a firm. Collectively, 3,200,000 of the 71,600,000 workers covered by their own firm’s health benefits in the United States were employed by firms which completed in the survey. The most important statistic that is weighted by the number of employers is the offer rate; firms that do not complete the full survey are asked whether their firm offers health benefits to any employees. As noted, this question relies on a wider set of respondents than just those completing the full survey. As in years past the majority of firms are very small, so the considerable fluctuation we see across years in the offer rate for these small firms drives the overall offer rate.
.

FIRM SIZES AND KEY DEFINITIONS

Throughout the report, we report data by size of firm, region, and industry. Unless otherwise specified, firm size definitions are as follows: small firms: 3-199 workers; and large firms: 200 or more workers. [Figure M.2] shows selected characteristics of the survey sample. A firm’s primary industry classification is determined from Dynata’s designation on the sampling frame and is based on the U.S. Census Bureau’s North American Industry Classification System (NAICS), [Figure M.3]. A firm’s ownership category and other firm characteristics such as the firm’s wage level and the age of the work force are based on respondents’ answers. While there is considerable overlap in firms in the “State/Local Government” industry category and those in the “public” ownership category, they are not identical. For example, public school districts are included in the ‘Service’ industry even though they are publicly owned. Family coverage is defined as health coverage for a family of four.

Figure M.2: Selected Characteristics of Firms in the Survey Sample, 2024

Figure M.2: Selected Characteristics of Firms in the Survey Sample, 2024

Figure M.3: Industries by NAICS code

Figure M.3: Industries by NAICS code

[Figure M.4] presents the breakdown of states into regions and is based on the U.S Census Bureau’s categorizations. State-level data are not reported both because the sample size is insufficient in many states and we only collect information on a firm’s primary location rather than where all workers may actually be employed. Some mid- and large-size employers have employees in more than one state, so the location of the headquarters may not match the location of the plan for which we collected premium information.

Figure M.4: States by Region, 2024

Figure M.4: States by Region, 2024

[Figure M.5] displays the distribution of the nation’s firms, workers, and covered workers (employees receiving coverage from their employer). Among the three million firms nationally, approximately 60.4% employ 3 to 9 workers; such firms employ 7.5% of workers, and 3.8% of workers covered by health insurance. In contrast, less than one percent of firms employ 5,000 or more workers; these firms employ 37.8% of workers and 43.1% of covered workers. Therefore, the smallest firms dominate any statistics weighted by the number of employers. For this reason, most statistics about firms are broken out by size categories. In contrast, firms with 1,000 or more workers are the most influential employer group in calculating statistics regarding covered workers, since they employ the largest percentage of the nation’s workforce. Statistics among small firms and those weighted by the number of firms tend to have more variability.

Figure M.5: Distribution of Employers, Workers, and Workers Covered by Health Benefits, by Firm Size, 2024

Figure M.5: Distribution of Employers, Workers, and Workers Covered by Health Benefits, by Firm Size, 2024

Although most firms in the United States are small, most workers covered by health benefits are employed at large firms: 73% of the covered worker weight is controlled by firms with 200 or more employees. Conversely, firms with 3–199 employees represent 98% percent of the employer weight.

The survey asks firms what percentage of their employees earn more or less than a specified amount in order to identify the portion of a firm’s workforce that has relatively lower or higher wages. This year, the income threshold is $35,000 or less per year for lower-wage workers and $77,000 or more for higher-wage workers. These thresholds are based on the 25th and 75th percentile of workers’ earnings as reported by the Bureau of Labor Statistics using data from the Occupational Employment Statistics (OES) (2022).7 The cutoffs were inflation-adjusted and rounded to the nearest thousand.

Annual inflation estimates are calculated as an average of the first three months of the year. The 12 month percentage change for this period was 3.2%.8 Data presented is nominal unless indicated specifically otherwise.
.

ROUNDING AND IMPUTATION

Some figures in the report do not sum to totals due to rounding. Although overall totals and totals for size and industry are statistically valid, some breakdowns may not be available due to limited sample sizes or high relative standard errors. Where the unweighted sample size is fewer than 30 observations, figures include the notation “NSD” (Not Sufficient Data). Estimates with high relative standard errors are reviewed and in some cases not published. Many breakouts by subsets may have a large standard error, meaning that even large differences between estimates are not statistically different. Values below 3% are not shown on graphical figures to improve the readability of those graphs. The underlying data for all estimates presented in graphs are available in the Excel documents accompanying each section on ehbs.kff.org.

To control for item nonresponse bias, we impute values that are missing for most variables in the survey. On average, 12% of observations are imputed. All variables, with the exception of some single coverage premiums, are imputed following a hotdeck approach. The hotdeck approach replaces missing information with observed values from a firm similar in size and industry to the firm for which data are missing. For a given firm, if both single and family coverage premiums are missing, the single coverage premium is predicted using other known characteristics of the plan and firm through a random forest algorithm. This method reduces bias and improves subsequently hotdecked values (such as family premiums and worker contributions). In 2024, there were sixty-one variables where the imputation rate exceeded 20%; most of these cases were for individual plan level statistics. When aggregate variables were constructed for all of the plans, the imputation rate is usually much lower. There are a few variables that we have decided not to impute; these are typically variables where “don’t know” is considered a valid response option. Some variables are imputed based on their relationship to each other. For example, if a firm provided a worker contribution for family coverage but no premium information, a ratio between the family premium and family contribution was imputed and then the family premium was calculated. We estimate separate single and family coverage premiums for firms that provide premium amounts as the average cost for all covered workers.

To ensure data accuracy we have several processes to review outliers and illogical responses. Every year several hundred firms are called back to confirm or correct responses. In some cases, answers are edited based on responses to open-ended questions or based on established logic rules.

Figure M.6: Imputation Rates of Premiums, Worker Contributions, and Deductibles, by Plan Type, 2020-2024

Figure M.6: Imputation Rates of Premiums, Worker Contributions, and Deductibles, by Plan Type, 2020-2024

WEIGHTING

Because we select firms randomly, it is possible through the use of weights to extrapolate the results to national (as well as firm size, regional, and industry) averages. These weights allow us to present findings based on the number of workers covered by health plans, the number of total workers, and the number of firms. In general, findings in dollar amounts (such as premiums, worker contributions, and cost sharing) are weighted by covered workers. Other estimates, such as the offer rate, are weighted by firms.

The employer weight was determined by calculating the firm’s probability of selection. This weight was trimmed of overly influential weights and calibrated to U.S. Census Bureau’s 2021 Statistics of U.S. Businesses for firms in the private sector, and the 2022 Census of Governments totals. The worker weight was calculated by multiplying the employer weight by the number of workers at the firm and then following the same weight adjustment process described above. The covered-worker weight and the plan-specific weights were calculated by multiplying the percentage of workers enrolled in each of the plan types by the firm’s worker weight. These weights allow analyses of all workers covered by health benefits and of workers in a particular type of health plan.

The trimming procedure follows the following steps: First, we grouped firms into size and offer categories of observations. Within each strata, we calculated the trimming cut point as the median plus six times the interquartile range (M + [6 * IQR]). Weight values larger than this cut point are trimmed. In all instances, very few weight values were trimmed.

To account for design effects, the statistical computing package R version 4.4.0 (2024-04-24 ucrt) and the library “survey” version 4.4.2 were used to calculate standard errors.
.

STATISTICAL SIGNIFICANCE AND LIMITATIONS

All statistical tests are performed at the .05 confidence level. For figures with multiple years, statistical tests are conducted for each year against the previous year shown, unless otherwise noted. No statistical tests are conducted for years prior to 1999.

Statistical tests for a given subgroup are tested against all other firm sizes not included in that subgroup: For example, Northeast is compared to all firms NOT in the Northeast (an aggregate of firms in the Midwest, South, and West). However, statistical tests for estimates compared across plan types (for example, average premiums in PPOs) are tested against the “All Plans” estimate. In some cases, we also test plan-specific estimates against similar estimates for other plan types (for example, single and family premiums for HDHP/SOs against single and family premiums for HMO, PPO, and POS plans); these are noted specifically in the text. The two types of statistical tests performed are the t-test and the Wald test. The small number of observations for some variables resulted in large variability around the point estimates. These observations sometimes carry large weights, primarily for small firms. The reader should be cautioned that these influential weights may result in large movements in point estimates from year to year; however, these movements are often not statistically significant. Standard Errors for some key statistics are available in a technical supplement available at ehbs.kff.org.

Due to the complexity of many employer health benefits programs, this survey is not able to capture all the components of any particular plan. For example, many employers have complex and varied prescription drug benefits, premium contributions, and incentives for wellness programs. We attempted to complete interviews with the person who is most knowledgeable about the firm’s health benefits. In some cases, the firm may not know details of some elements of their plan and not others. While we collect information on the number of workers enrolled in health benefits, the survey is not able to capture the characteristics of the workers offered or enrolled in any particular plan.
.

DATA COLLECTION AND SURVEY MODE

Starting in 2022, we expanded the use of computer assisted web interview (CAWI), offering most respondents the opportunity to complete the survey using an online questionnaire rather a telephone interview. In 2024, fifty-seven percent of survey responses were completed via telephone interview, and the remainder were completed online.

Survey mode did not impact the survey results in a systematic or obvious manner. The effects of mode and firm size on major firm characteristics such as annual premiums, contributions, and deductibles was tested using standard linear regression. For certain plan types, survey implementation through telephone interview had a negative effect on the reported value. However, the plan types affected were random, so this effect is more likely due to confounding variables. When examining demographic characteristics between the two modes, there were small differences in the distribution of categorical variables such as region and age.
.

PRICSSA

In their Journal of Survey Statistics and Methodology article, Seidenberg, Moser, and West (2023) propose a checklist for survey administrators and sponsoring organizations to help external researchers quickly understand the methods used to create a complex sample dataset.9 The Preferred Reporting Items for Complex Sample Survey Analysis (PRICSSA) recommends a standard format to enumerate data collection and analysis techniques across a variety of different surveys. KFF has adopted this checklist to increase transparency for our readership and also to promote reproducibility among external researchers granted access to our public use files.

  • 1.1 Data collection dates: January 15, 2024–July 12, 2024.
  • 1.2 Data collection mode(s): fifty-seven percent computer-assisted telephone interviewing (CATI), and the remainder completed with computer assisted web interview (CAWI).
  • 1.3 Target population: Private firms as well as state and local government employers with three or more employees in 50 US states and Washington DC.
  • 1.4 Sample design: A sample stratified by ten industry categories and six size categories drawn from a Dynata list (based on a census assembled by Dun and Bradstreet) of the nation’s private employers and the Census of Governments for public employers.
  • 1.5 Full Survey response rate: 14 percent (CASRO method).
  • 2.1 Missingness rates: On average, 12% of observations are imputed.
  • 2.2 Observation deletion: Observations found to be duplicated firms.
  • 2.3 Sample sizes: 2,142 firms completed the entire survey, 4,769 completed at least the offer question, out of 26,426 initially sampled firms, generalizing to a total of three million firms.
  • 2.4 Confidence intervals / standard errors: All statistical tests are performed at the .05 confidence level.
  • 2.5 Weighting: empwt (firms), empwt_a6 (firms, including those answering only the offer question), wkrwt (workers), covwt (policyholders), hmowt, ppowt, poswt, and hdpwt (plan weights)
  • 2.6 Variance estimation: Taylor Series Linearization with newcell used as the stratum variable but no PSU variable.
  • 2.7 Subpopulation analysis: The R survey package toolkit such as svyby and a complex sample design’s subset method allowed for most analysis of subdomains.
  • 2.8 Suppression rules: Where the unweighted sample size is fewer than 30 observations, figures include the notation “NSD” (Not Sufficient Data). Estimates with high relative standard errors are reviewed and in some cases not published.
  • 2.9 Software and code: All design-based analyses were performed using R version 4.4.0 (2024-04-24 ucrt) and survey library version 4.4.2.
2024 SURVEY

The 2024 survey features new questions, on prescription drug policy, mental health and substance use, GLP-1 drug coverage, family-building services, transparency, and affordability issues for lower-wage workers.

As in previous years, modification were made to existing survey questions, both to improve clarity or respond to changes in the marketplace. Starting in 2024, the introductory text for web-based interviews was condensed. In an effort to reduce respondent burden, we removed questions on waiting periods, emergency room cost sharing, wellness program incentives, disease management, sites of care, and prior authorization. We also restricted clarification follow-up calls (or callbacks) to only firms with at least one premium or cost sharing-related item: observations for firms with outliers only in the drug tier section of the questionnaire were imputed based on logic rules.

In some cases respondents report they offer coverage, but do not offer a comprehensive major medical plan. This year we added additional questions to clarify whether plans with low premiums were major medical plans. We dropped fewer than five firms where the respondent told us they did not cover most physician services and hospital admissions other than preventative care from the premium average.

9.8 percent of respondents in 2024 were either unable to provide their firm’s single coverage premium, or provided a response which was logically inconsistent with other values. In these cases, in order to minimize non-response bias, missing responses were imputed. In prior years, if a firm was missing a single premium value, this value was imputed using a ‘hot deck’ approach in which a single premium was selected among firms with similar demographic characteristics. This year we revised this method. In cases in which both the single and family premiums were missing, the single premium was estimated based on other known characteristics of the plan and firm. This method is advantageous because it predicts single premiums based on specific firm characteristics related to the premium, such as cost-sharing and deductibles, as well as demographics. Then, the family premium, as well as single and family worker contributions (if missing), were subsequently hot decked based on their relationship to the single premium. In cases in which a firm provides a response to the family premium, the single premium is imputed based on a ratio of the premiums. Therefore, in total, 8% of responses were affected by this change.

The updated imputation process uses a random forest machine learning model. The model was trained on EHBS data from 2021-2023. Features were selected using step wise regression. The hyper-parameters of the model were tuned using the grid search algorithm. Compared to hot-decking, this method is significantly better at explaining the variation in the observed data (R-squared of 0.2071 versus 0.00018540). This methodological change has a relatively minor impact on the overall premium; the 2024 premium would be 0.8% different if we did not implement the new method. However, this change meaningfully impacts the precision of premium estimates for demographic subgroups.
.

OTHER RESOURCES

Additional information on the 2024 Employer Health Benefit Survey is available at ehbs.kff.org, including an article in the Journal Health Affairs, an interactive graphic and historic reports. Standard errors for some statistics are available in the online technical supplement. Researchers may also request a public use dataset here: https://www.kff.org/contact-us/

The survey design and methods section found on our website (ehbs.kff.org) contains an extended methods document that was not included in the portable document format (PDF) or the printed versions of this book. Readers interested in the extended methodology should consult the online edition of this publication.

Published: October 9, 2024. Last Updated: October 03, 2024.
.

HISTORICAL DATA

Data in this report focus primarily on findings from surveys conducted and authored by KFF since 1999. Between 1999 and 2017, the Health Research & Educational Trust (HRET) co-authored this survey. HRET’s divestiture had no impact on our survey methods, which remain the same as years past. Prior to 1999, the survey was conducted by the Health Insurance Association of America (HIAA) and KPMG using a similar survey instrument, but data are not available for all the intervening years. Following the survey’s introduction in 1987, the HIAA conducted the survey through 1990, but some data are not available for analysis. KPMG conducted the survey from 1991-1998. However, in 1991, 1992, 1994, and 1997, only larger firms were sampled. In 1993, 1995, 1996, and 1998, KPMG interviewed both large and small firms. In 1998, KPMG divested itself of its Compensation and Benefits Practice, and part of that divestiture included donating the annual survey of health benefits to HRET.

This report uses historical data from the 1993, 1996, and 1998 KPMG Surveys of Employer-Sponsored Health Benefits and the 1999-2017 Kaiser/HRET Survey of Employer-Sponsored Health Benefits. For a longer-term perspective, we also use the 1988 survey of the nation’s employers conducted by the HIAA, on which the KPMG and KFF surveys are based. The survey designs for the three surveys are similar.

[Figure M.7] displays the historic sample sizes and weights of firms, workers, and covered workers (employees receiving coverage from their employer).

Figure M.7: Historic Firm Sample Sizes and Weights, 1999-2024

Figure M.7: Historic Firm Sample Sizes and Weights, 1999-2024

[Figure M.8] displays the historic sample frames and weighting universes.

Figure M.8: Sampling and Weighting Targets, 1999-2024

Figure M.8: Sampling and Weighting Targets, 1999-2024

1999

The Kaiser Family Foundation and The Health Research and Educational Trust (Kaiser/HRET) began sponsoring the survey of employer-sponsored health benefits supported for many years by KPMG Peat Marwick LLP, an international consulting and accounting firm. In 1998, KPMG divested itself of its Compensation and Benefits Practice, and donated the annual survey of health benefits to HRET, a non-profit research organization affiliated with the American Hospital Association. From 1999 until 2017, the survey was conducted under a partnership between HRET and The Kaiser Family Foundation, a health care philanthropy and policy research organization that is not affiliated with Kaiser Permanente or Kaiser Industries. Starting in 1999, survey continued a core set of questions from prior KPMG surveys, but was expanded to include small employers and a variety of policy-oriented questions. Some reports include data from the 1993, 1996 and 1998 KPMG Surveys of Employer-Sponsored Health Benefits. For a longer-term perspective, we also use the 1988 survey of the nation’s employers conducted by the Health Insurance Association of America (HIAA), on which the KPMG, Kaiser/HRET, and Kaiser Family Foundation surveys were based. Many of the questions in the HIAA, KPMG, Kaiser/HRET, and Kaiser Family Foundation surveys are identical, as is the sample design. Since Point-of-Service (POS) plans did not exist in 1988, reports do not include statistics for this plan type in that year. Starting in 1999, Kaiser/HRET drew its sample from a Dun & Bradstreet list of the nation’s private and public employers with three or more workers. To increase precision, Kaiser/HRET stratified the sample by industry and the number of workers in the firm. Kaiser/HRET attempted to repeat interviews with many of the 2,759 firms interviewed in 1998 and replaced non-responding firms with another firm from the same industry and firm size. As a result, 1,377 firms in the 1999 total sample of 1,939 firms participated in both the 1998 and 1999 surveys.

For more detail about the 1999 survey, see the Survey Methodology section of that year’s report.

2000

Kaiser/HRET attempted to repeat interviews with many of the 1,939 firms interviewed in 1999 and replaced non-responding firms with other firms of the same industry and firm size. As a result, 982 firms in the 2000 survey’s total sample of 1,887 firms participated in both the 1999 and 2000 surveys. The overall response rate was 45% down from 60% in 1999. Contributing to the declining response rate was the decision not to re-interview any firms with 3-9 workers who participated in the 1999 survey. In 1999, the survey weights had instead been adjusted to control for the fact that firms with 3-9 workers that are in the panel (responded in either 1998 or 1999) are biased in favor of offering a health plan. The response rate in 2000 for firms with 3-9 workers was 30%.

For more detail about the 2000 survey, see the Survey Methodology section of that year’s report.

2001

For more detail about the 2001 survey, see the Survey Methodology section of that year’s report.

2002

The list of imputed variables was greatly expanded in 2002 to also include self-insurance status, level of benefits, prescription drug cost-sharing, copay and coinsurance amounts for prescription drugs, and firm workforce characteristics such as average income, age and part-time status. On average, 2% of these observations are imputed for any given variable. The imputed values are determined based on the distribution of the reported values within stratum defined by firm size and region.

For more detail about the 2002 survey, see the Survey Methodology section of that year’s report.

2003

The calculation of the weights followed a similar approach to previous years, but with several notable changes in 2003. First, as in years past, the basic weight was determined, followed by a nonresponse adjustment added this year to reflect the fact that small firms that do not participate in the full survey are less likely to offer health benefits and, consequently, are unlikely to answer the single offer rate question. To make this adjustment, Kaiser/HRET conducted a follow-up survey of all firms with 3-49 workers that did not participate in the full survey. Each of these 1,744 firms was asked the single question, “Does your company offer or contribute to a health insurance program as a benefit to its employees?” The main difference between this follow-up survey and the original survey is that in the follow-up survey the first person who answered the telephone was asked whether the firm offered health benefits, whereas in the original survey the question was asked of the person who was identified as most knowledgeable about the firm’s health benefits. Conducting the follow-up survey accomplished two objectives. First, statistical techniques (a McNemar analysis which was confirmed by a chi-squared test) demonstrated that the change in method-speaking with the person answering the phone rather than a benefits manager-did not bias the results of the follow-up survey. Analyzing firms who responded to the offer question twice, in both the original and follow-up survey, proved that there was no difference in the likelihood that a firm offers coverage based on which employee answered the question about whether a firm offers health benefits. Second, the follow-up survey demonstrated that very small firms not offering health benefits to their workers are less likely to answer the one survey question about coverage. Kaiser/HRET analyzed the group of firms that only responded to the follow-up survey and performed a t-test between the firms who had responded to the initial survey as well as the follow-up, and those who only responded to the follow-up. Tests confirmed the hypothesis that the firms that did not answer the single offer rate question in the original survey were less likely to offer health benefits. To adjust the offer rate data for this finding an additional non-response adjustment was applied to increase the weight of firms in the sample that do not offer coverage. The second change to the weighting method in 2003 was to trim the weights in order to reduce the influence of weight outliers. On occasion one or two firms will, through the weighting process, represent a highly disproportionate number of firms or covered workers. Rather than excluding these observations from the sample, a set cut point that would minimize the variances of several key variables (such as premium change and offer rate) was determined. The additional weight represented by outliers is then spread among the other firms in the same sampling cell. Finally, a post-stratification adjustment was applied. In the past, Kaiser/HRET was poststratified back to the Dun & Bradstreet frequency counts. Concern over volatility of counts in recent years led to the use of an alternate source for information on firm and industry data. This year the survey uses the recently released Statistics of U.S. Businesses conducted by the U.S. Census as the basis for the post-stratification adjustment. These Census data indicate the percentage of the nation’s firms with 3-9 workers is 59% rather than the higher percentages (e.g., 76% in 2002) derived from Dun & Bradstreet’s national database. This change has little impact on worker-based estimates, since firms with 3-9 workers accounted for less than 10% of the nation’s workforce. The impact on estimates expressed as a percentage of employers (e.g., the percent of firms offering coverage), however, may be significant. Due to these changes, Kaiser/HRET recalculated the weights for survey years 1999-2002 and modified estimates published in the survey where appropriate. The vast majority of these estimates are not statistically different. However, please note that the survey data published starting in 2003 varies slightly from previously published reports.

For more detail about the 2003 survey, see the Survey Methodology section of that year’s report.

2004

For more detail about the 2004 survey, see the Survey Methodology section of that year’s report.

2005

In 2005, the Kaiser/HRET survey added two additional sections to the questionnaire to collect information about high-deductible health plans (HDHP) that are offered along with a health reimbursement account (HRA) or are health savings account (HSA) qualified. Questions in these sections were asked of all firms offering these plan types, regardless of enrollment. Specific weights were also created to analyze the HDHP plans that are offered along with HRAs or are HSA qualified. These weights represent the proportion of employees enrolled in each of these arrangements.

We updated our data to reflect the 2002 Census of Governments. We also removed federal government employee counts from our post-stratification.

For more detail about the 2005 survey, see the Survey Methodology section of that year’s report.

2006

For the first time in 2006, Kaiser/HRET asked questions about the highest enrollment HDHP/SO as a separate plan type, equal to the other plan types. In prior years, data on HDHP/SO plans were collected as part of one of the other types of plans. Therefore, the removal of HDHP/SOs from the other plan types may affect the year to year comparisons for the other plan types. Given the decline in conventional health plan enrollment and the addition of HDHP/SO as a plan type option, Kaiser/HRET eliminated nearly all questions pertaining to conventional coverage from the survey instrument. We continue to ask firms whether or not they offer a conventional health plan and, if so, how much their premium for conventional coverage increased in the last year, but respondents are not asked additional questions about the attributes of the conventional plans they offer. Because we have limited information about conventional health plans, we must make adjustments in calculating all plan averages or distributions. In cases where a firm offers only conventional health plans, no information from that respondent is included in all plan averages. The exception is for the rate of premium growth, for which we have information. If a firm offers a conventional health plan and at least one other plan type, for categorical variables we assign the values from the health plan with the largest enrollment (other than the conventional plan) to the workers in the conventional plan. In the case of continuous variables, covered workers in conventional plans are assigned the weighted average value of the other plan types in the firm.

The survey newly distinguished between plans that have an aggregate deductible amount in which all family members’ out-of-pocket expenses count toward the deductible and plans that have a separate amount for each family member, typically with a limit on the number of family members required to reach that amount.

In 2006, Kaiser/HRET began asking employers if they had a health plan that was an exclusive provider organization (EPO). We treat EPOs and HMOs together as one plan type and report the information under the banner of “HMO”; if an employer sponsors both an HMO and an EPO, they are asked about the attributes of the plan with the larger enrollment.

Kaiser/HRET made a slight change to one of the industry groups: we removed Wholesale from the group that also included Agriculture, Mining and Construction. The nine industry categories now reported are: Agriculture/Mining/Construction, Manufacturing, Transportation/Communications/Utilities, Wholesale, Retail, Finance, Service, State/Local Government, and Health Care.

Starting in 2006, we made an important change to the way we test the subgroups of data within a year. Statistical tests for a given subgroup (firms with 25-49 workers, for instance) are tested against all other firm sizes not included in that subgroup (all firm sizes NOT including firms with 25-49 workers in this example). Tests are done similarly for region and industry: Northeast is compared to all firms NOT in the Northeast (an aggregate of firms in the Midwest, South, and West). Statistical tests for estimates compared across plan types (for example, average premiums in PPOs) are tested against the “All Plans” estimate. In some cases, we also test plan specific estimates against similar estimates for other plan types (for example, single and family premiums for HDHP/SOs against single and family premiums in HMO, PPO, and POS plans). Those are noted specifically in the text. This year, we also changed the type of Chi-square test from the Chi-square test for goodness-of-fit to the Pearson Chi-square test. Therefore, in 2006, the two types of statistical tests performed are the t-test and the Pearson Chi-square test.

For more detail about the 2006 survey, see the Survey Methodology section of that year’s report.

2007

Kaiser/HRET drew its sample from a Survey Sampling Incorporated list (based on an original Dun and Bradstreet list) of the nation’s private and public employers with three or more workers.

In prior years, many variables were imputed following a hotdeck approach, while others followed a distributional approach (where values were randomly determined from the variable’s distribution, assuming a normal distribution). This year, all variables are imputed following a hotdeck approach. This imputation method does not rely on a normal distribution assumption and replaces missing values with observed values from a firm with similar characteristics, in this case, size and industry. Due to the low imputation rate for most variables, the change in methodology is not expected to have a major impact on the results. In some cases, due to small sample size, imputed outliers are excluded. There are a few variables that Kaiser/HRET has decided should not be imputed; these are typically variables where “don’t know” is considered a valid response option (for example, firms’ opinions about effectiveness of various strategies to control health insurance costs).

The survey now contains a few questions on employee cost sharing that are asked only of firms that indicate in a previous question that they have a certain cost-sharing provision. For example, the copayment amount for prescription drugs is asked only of those that report they have copayments for prescription drugs. Because the composite variables are reflective of only those plans with the provision, separate weights for the relevant variables were created in order to account for the fact that not all covered workers have such provisions.

For more detail about the 2007 survey, see the Survey Methodology section of that year’s report.

2008

National Research, LLC (NR), our Washington, D.C.-based survey research firm, introduced a new CATI (Computer Assisted Telephone Interview) system at the end of 2007, and, due to several delays in the field, obtained fewer responses than expected. As a result, an incentive of $50 was offered during the final two and a half weeks the survey was in the field. Kaiser/HRET compared the distribution of key variables between firms receiving the incentive and firms not receiving the incentive to determine any potential bias. Chi-square test results were not significant, suggesting minimal to no bias.

In 2008, we changed the method used to report the annual percentage premium increase. In prior years, the reported percentage was based on a series of questions that asked responding firms the percentage increase or decrease in premiums from the previous year to the current year for a family of four in the largest plan of each plan type (e.g., HMO, PPO). The reported premium increase was the average of the reported percentage changes (i.e., 6.1% for 2007) weighted by covered workers. This year, we calculate the overall percentage increase in premiums from year to year for family coverage using the average of the premium dollar amounts for a family of four in the largest plan of each plan type reported by respondents and weighted by covered workers (i.e., $12,106 for 2007 and $12,680 for 2008, an increase of 5%). A principal advantage of using the premium dollar amounts to calculate the annual change in premiums is that we are better able to capture changes in the cost of health insurance for those firms that are newly in the market or that change plan types, especially those that move to plans with very different premium levels. For example, in the first year that a firm offers a plan of a new plan type, such as a consumer-directed plan, the firm can report the level of the premium they paid, but using the previous method would be unable to report the rate of change from the previous year since the plan was not previously offered. If the premium for the new plan is relatively low compared to other premiums in the market, the relatively low premium amount that the firm reports will tend to lower the weighted average premium dollar amount reported in the survey, but the firm responses would not provide any information to the percentage premium increase question. Another advantage of using premium dollar amounts to examine trends is that these data directly relate to the other findings in the survey and better address a principal public policy issue (i.e., what was the change in the cost of insurance over some past period). Many users noted, for example, that the percentage change calculated from the reported premium dollar amounts between two years did not directly match the reported average premium increase for the same period. There are several reasons why we would not expect these questions to produce identical results: 1) they are separate questions subject to varying degrees of reporting error, 2) firms could report a premium dollar amount for a plan type they might not have offered in the previous year, therefore, contributing information to one measure but not the other, or 3) firms could report a premium dollar amount for a plan that was not the largest plan of that type in the previous year. Although the two approaches have generated similar results in terms of the long-term growth rate of overall family premiums, there are greater discrepancies in trends for subgroups like small employers and self-funded firms. Focusing on the dollar amount changes over time will provide a more reliable and consistent measure of premium change that also is more sensitive to firms offering new plan options.

As we have in past years, this year we collected information on the cost-sharing provisions for hospital admissions and outpatient surgery that is in addition to any general annual plan deductible. However, for the 2008 survey, we changed the structure of the question and now include “separate annual deductible or hospital admissions” as a response option rather than collecting the information through a separate question. We continue to examine and sometimes modify the questions on hospital and outpatient surgery cost sharing because this can be a complex component of health benefit plans. For example, for some plans it is difficult to distinguish a separate hospital deductible from one categorized as a general annual deductible, where office visits and preventive care are covered and the deductible only applies to hospital use. Because this continues to be a point of confusion, we continue to refine the series of questions in order to clearly convey the information we are attempting to collect from respondents.

As in 2007, we asked firms if they offer health benefits to opposite-sex or same-sex domestic partners. However, this year, we changed the response options because during early tests of the 2008 survey, several firms noted that they had not encountered the issue yet, indicating that the responses of “yes,” “no,” and “don’t know” were insufficient. Therefore, this year we added the response option “not applicable/not encountered” to better capture the number of firms that report not having a policy on the issue.

For more detail about the 2008 survey, see the Survey Methodology section of that year’s report.

2009

In the fall of 2008, with guidance from experts in survey methods and design from NORC, we reviewed the methods used for the survey. As a result of this review, several important modifications were made to the 2009 survey, including the sample design and questionnaire. For the first time, this year we determined the sample requirements based on the universe of firms obtained from the U.S. Census rather than Dun and Bradstreet. Prior to the 2009 survey, the sample requirements were based on the total counts provided by Survey Sampling Incorporated (SSI) (which obtains data from Dun and Bradstreet). Over the years, we have found the Dun and Bradstreet frequency counts to be volatile because of duplicate listings of firms, or firms that are no longer in business. These inaccuracies vary by firm size and industry. In 2003, we began using the more consistent and accurate counts provided by the Census Bureau’s Statistics of U.S. Businesses and the Census of Governments as the basis for post-stratification, although the sample was still drawn from a Dun and Bradstreet list. In order to further address this concern at the time of sampling, we now also use Census data as the basis for the sample. This change resulted in shifts in the sample of firms required in some size and industry categories.

This year, we also defined Education as a separate sampling category, rather than as a subgroup of the Service category. In the past, Education firms were a disproportionately large share of Service firms. Education is controlled for during post-stratification, and adjusting the sampling frame to also control for Education allows for a more accurate representation of both Education and Service industries.

In past years, both private and government firms were sampled from the Dun and Bradstreet database. For the 2009 sample, Government firms were sampled in-house from the 2007 Census of Governments. This change was made to eliminate the overlap of state agencies that were frequently sampled from the Dun and Bradstreet database. Each year the survey attempts to repeat interviews with respondents from past years (see “Response Rate” section below), and in order to maintain government firms that had completed the survey in the past (firms that have completed the survey in the past are known as panel firms), government firms from the 2008 survey were matched to the Census of Governments to identify phone numbers. All panel government firms were included in the sample (resulting in an oversample). In addition, the sample of private firms is screened for firms that are related to state/ local governments, and if these firms are identified in the Census of Governments, they are reclassified as government firms and a private firm is randomly drawn to replace the reclassified firm. These changes to the sample frame resulted in an expected slight reduction in the overall response rate, since there were shifts in the number of firms needed by size and industry. Therefore, the data used to determine the 2009 Employer Health Benefits sample frame include the U.S. Census’ 2005 Statistics of U.S. Businesses and the 2007 Census of Governments. At the time of the sample design (December 2008), these data represented the most current information on the number of public and private firms nationwide with three or more workers. As in the past, the post- stratification is based on the most up-to-date Census data available (the 2006 update to the Census of U.S. Businesses was purchased during the survey field period) and the 2007 Census of Governments. The Census of Governments is conducted every five years, and this is the first year the data from the 2007 Census of Governments have been available for use.

Based on recommendations from cognitive researchers at NORC and internal analysis of the survey instrument, a number of questions were revised to improve the clarity and flow of the survey in order to minimize survey burden. For example, in order to better capture the prevalence of combinations of inpatient and outpatient surgery cost sharing, the survey was changed to ask a series of yes or no questions. Previously, the question asked respondents to select one response from a list of types of cost sharing, such as separate deductibles, copayments, coinsurance, and per diem payments (for hospitalization only). We have also expanded the number of questions for which respondents can provide either the number of workers or the percentage of workers. Previously, after obtaining the total number of employees, the majority of questions asked about the percentage of workers with certain characteristics. Now, for questions such as the percentage of workers making $23,000 a year or less or the enrollment of workers in each plan type, respondents are able to respond with either the number or the percentage of workers. Few of these changes have had any noticeable impact on responses.

For more detail about the 2009 survey, see the Survey Methodology section of that year’s report.

2010

New topics in the 2010 survey include questions on eligibility for dependent coverage, coverage for care received at retail clinics, health plan changes as a result of the Mental Health Parity and Addiction Equity Act of 2008, and disease management. As in past years, this year’s survey included questions on the cost of health insurance, offer rates, coverage, eligibility, enrollment patterns, premiums, employee cost sharing, prescription drug benefits, retiree health benefits, wellness benefits, and employer opinions.

Firms in the sample with 3-49 workers that did not complete the full survey are contacted and asked (or re-asked in the case of firms that previously responded to only one question about offering benefits) whether or not the firm offers health benefits. As part of the process, we conduct a McNemar test to verify that the results of the follow-up survey are comparable to the results from the original survey. If the test indicates that the results are comparable, a nonresponse adjustment is applied to the weights used when calculating firm offer rates. This year, for the first time since we began conducting the follow-up survey, the test indicated that the results from those answering the one question about offering health benefits in the original survey and those answering the follow-up survey were different (statistically significant difference at the p<0.05 level between the two surveys), suggesting the results are not comparable. Therefore, we did not use the results of this follow-up survey to adjust the weights as we have in the past. In the past, the nonresponse adjustment lowered the offer rate for smaller firms by one to three percentage points, so not making the adjustment this year makes the offer rate look somewhat higher when making comparisons to prior years. For 2010, we saw a very large and unexpected increase in the offer rate (from 60 percent in 2009 to 69 percent in 2010) overall and particularly for firms with 3 to 9 workers (from 46 percent in 2009 to 59 percent in 2010). While not making the adjustment this year added to the size of the change, there would have been a large and difficult to explain change even if a nonresponse adjustment comparable to previous years had been made.

For more detail about the 2010 survey, see the Survey Methodology section of that year’s report.

2011

New topics in the 2011 survey include questions on stoploss coverage for self-funded plans, cost sharing for preventive care, plan grandfathering resulting from the Affordable Care Act (ACA), and employer awareness of tax credits authorized under the ACA.

This year, we became aware that the way we have been using the data from the Census Bureau for calibration was incorrect and resulted in an over-count of the actual number of firms in the nation. Specifically, firms operating in more than one industry were counted more than once in computing the total firm count by industry, and firms with establishments were counted more than once in computing the total firm count by state (which affects the regional count). Because smaller firms are less likely to operate in more than one industry or state, the miscounts occurred largely for larger from sizes. The error affects only statistics that are weighted by the number of firms (such as the percent of firms offering health benefits or sponsoring a disease management plan). Statistics that are weighted by the number of workers or covered workers (such as average premiums, contributions, or deductibles) were not affected. We addressed this issue by proportionally distributing the correct national total count of firms within each firm size as provided by the U.S. Census Bureau across industry and state based on the observed distribution of workers. This effectively weights each firm within each category (industry or state) in proportion to its share of workers in that category. The end result is a synthetic count of firms across industry and state that sums to the national totals. Firm-weighted estimates resulting from this change show only small changes from previous estimates, because smaller firms have much more influence on national estimates. For example, the estimate of the percentage of firms offering coverage was reduced by about .05 percentage points in each year (in some years no change is evident due to rounding). Estimates of the percentage of large firms offering retiree benefits were reduced by a somewhat larger amount (about 2 percentage points). Historical estimates used in the 2011 survey release have been updated following this same process. As noted above, worker-weighted estimates from prior years were not affected by the miscount and remain the same.

For more detail about the 2011 survey, see the Survey Methodology section of that year’s report.

2012

New topics in the 2012 survey include the use of biometric screening, domestic partner benefits, and emergency room cost sharing. In addition, many of the questions on health reform included in the 2011 survey were retained, including stoploss coverage for self-funded plans, cost sharing for preventive care, and plan grandfathering resulting from the Affordable Care Act (ACA).

There are several variables in which missing data is calculated based on respondents’ answers to other questions (for example, when missing employer contributions to premiums is calculated from the respondent’s premium and the ratio of contributions to premiums). In 2012 the method to calculate missing premiums and contributions was revised; if a firm provides a premium for single coverage or family coverage, or a worker contribution for single coverage or family coverage, that information was used in the imputation. For example, if a firm provided a worker contribution for family coverage but no premium information, a ratio between the family premium and family contribution was imputed and then the family premium was calculated. In addition, in cases where premiums or contributions for both family and single coverage were missing, the hotdeck procedure was revised to draw all four responses from a single firm. The change in the imputation method did not make a significant impact on the premium or contribution estimates.

In 2012, the method for calculating the size of the sample was adjusted. Rather than using a combined response rate for panel and non-panel firms, separate response rates were used to calculate the number of firms to be selected in each strata. In addition, the mining stratum was collapsed into the agriculture and construction industry grouping. In sum, changes to the sampling method required more firms to be included and may have reduced the response rate in order to provide more balanced power within each strata.

To account for design effects, the statistical computing package R and the library package “survey” were used to calculate standard errors. All statistical tests are performed at the .05 level, unless otherwise noted. For figures with multiple years, statistical tests are conducted for each year against the previous year shown, unless otherwise noted. No statistical tests are conducted for years prior to 1999. In 2012 the method to test the difference between distributions across years was changed to use a Wald test which accounts for the complex survey design. In general this method was more conservative than the approach used in prior years.

In 2012, average coinsurance rates for prescription drugs, primary care office visits, specialty office visits, and emergency room visits include firms that have a minimum and/or maximum attached to the rate. In years prior to 2012 we did not ask firms the structure of their coinsurance rate. For most prescription drug tiers, and most services, the average coinsurance rate is not statically different depending on whether the plan has a minimum or maximum.

In 2012 the calculation of the response rates was adjusted to be slightly more conservative than previous years.

For more detail about the 2012 survey, see the Survey Methodology section of that year’s report.

2013

Starting in 2013, information on conventional plans was collected under the PPO section and therefore the covered worker weight was representative of all plan types.

For more detail about the 2013 survey, see the Survey Methodology section of that year’s report.

2014

Starting in 2014, we elected to estimate separate single and family coverage premiums for firms that provided premium amounts as the average cost for all covered workers, instead of differentiating between single and family coverage. This method more accurately accounted for the portion that each type of coverage contributes to the total cost for the 1 percent of covered workers who are enrolled at firms affected by this adjustment.

Several provisions of the ACA took effect on January 1, 2014 which impacted non-grandfathered plans as well as plans renewing in calendar year 2014, such as the requirement to have an out of pocket limit and a waiting period of not more than three months. As a result, firms with non-grandfathered plans that reported that they did not have out-of-pocket limits, or waiting periods exceeding three months, were contacted during our data-confirmation calls. We did not have information on the month in which a firm’s plan or plans was renewed. Many of these firms indicated that they had a plan year starting prior to January 2014, so these ACA provision were not yet in effect for these plans.

Firms with 200 or more workers were asked: “Does your firm offer health benefits for current employees through a private or corporate exchange? A private exchange is one created by a consulting firm or an insurance company, not by either a federal or state government. Private exchanges allow employees to choose from several health benefit options offered on the exchange.” Employers were still asked for plan information about their HMO, PPO, POS and HDHP/SO plan regardless of whether they purchased health benefits through a private exchange or not.

Beginning in 2014, we collected whether firms with a non-final disposition code (such as a firm that requested a callback at a later time or date) offered health benefits. By doing so we attempt to mitigate any potential non-response bias of firms either offering or not offering health benefits on the overall offer rate statistic.

For more detail about the 2014 survey, see the Survey Methodology section of that year’s report.

2015

To increase response rates, firms with 3-9 employees were offered an incentive of $75 in cash or as a donation to a charity of their choice to complete the full survey.

In 2015, weights were not adjusted using the nonresponse adjustment process described in previous years’ methods. As in past years, Kaiser/HRET conducted a small follow-up survey of those firms with 3 to 49 workers that refused to participate in the full survey. Based on the results of a McNemar test, we were not able to verify that the results of the follow-up survey were comparable to the results from the original survey. In 2010, the results of the McNemar test were also significant and we did not conduct a nonresponse adjustment.

The 2015 survey contains new information in several areas, including on wellness and biometric screening. In most cases, information reported in this section is not comparable with previous years’ findings. Data presented in the 2015 report reflect the firm’s benefits at the time they completed the interview. Some firms may report on a plan which took effect in the prior calendar year. Starting in 2015, firms were able to have a contribution and deductible in compliance with HSA requirements for the plan year.

Starting in 2015, employers were asked how many full-time equivalents they employed. In cases in which the number of full-time equivalents was relevant to the question, interviewer skip patterns may have depended on the number of FTEs.

In cases where a firm had multiple plans, they were asked about their strategies for containing the cost of specialty drugs for the plan with the largest enrollment.

Under the Affordable Care Act, non-grandfathered plans are required to have an out-of- pocket maximum. Non-grandfathered plans who indicated that they did not have an out of pocket maximum were asked to confirm whether their plan was grandfathered and whether that plan had an out-of-pocket maximum.

For more detail about the 2015 survey, see the Survey Methodology section of that year’s report.

2016

Between 2015 and 2016, we conducted a series of focus groups that led us to the conclusion that human resource and benefit managers at firms with between 20 and 49 employees think about health insurance premiums more similarly to benefit managers at smaller firms than larger firms. Therefore, starting in 2016, we altered the health insurance premium question pathway for firms with between 20-49 employees to match that of firms with 3-19 employees rather than firms with 50 or more employees. This change affected firms representing 8% of the total covered worker weight. We believe that these questions produce comparable responses and that this edit does not create a break in trend.

Starting in 2016, we made significant revisions to how the survey asks employers about their prescription drug coverage. In most cases, information reported in the Prescription Drug Benefits section is not comparable with previous years’ findings. First, in addition to the four standard tiers of drugs (generics, preferred, non-preferred, and lifestyle), we began asking firms about cost sharing for a drug tier that covers only specialty drugs. This new tier pathway in the questionnaire has an effect on the trend of the four standard tiers, since respondents to the 2015 survey might have previously categorized their specialty drug tier as one of the other four standard tiers. We did not modify the question about the number of tiers a firm’s cost-sharing structure has, but in cases in which the highest tier covered exclusively specialty drugs we reported it separately. For example, a firm with three tiers may only have copays or coinsurances for two tiers because their third tier copay or coinsurance is being reported as a specialty tier. Furthermore, in order to reduce survey burden, firms were asked about the plan attributes of only their plan type with the most enrollment. Therefore, in most cases, we no longer make comparisons between plan types. Lastly, prior to 2016, we required firms’ cost sharing tiers to be sequential, meaning that the second tier copay was higher than the first tier, the third tier was higher than the second, and the fourth was higher than the third. As drug formularies have become more intricate, many firms have minimum and maximums attached to their copays and coinsurances, leading us to believe it was no longer appropriate to assume that a firm’s cost sharing followed this sequential logic.

In cases where a firm had multiple plans, they were asked about their strategies for containing the cost of specialty drugs for the plan type with the largest enrollment. Between 2015 and 2016, we modified the series of ‘Select All That Apply’ questions regarding cost containment strategies for specialty drugs. In 2016, we elected to impute firms’ responses to these questions. We removed the option “Separate cost sharing tier for specialty drugs” and added specialty drugs as their own drug tier questionnaire pathway. We added question options on mail order drugs and prior authorization.

In 2016, we modified our questions about telemedicine to clarify that we were interested in the provision of health care services, and not merely the exchange of information, through telecommunication. We also added dependent and spousal questions to our health risk assessment question pathway.

For more detail about the 2016 survey, see the Survey Methodology section of that year’s report.

2017

While the Kaiser/HRET survey similar to other employer and household surveys has seen a general decrease in response rates over time, the decrease between the 2016 and 2017 response rates is not solely explained by this trend. In order to improve statistical power among sub-groups, including small firms and those with a high share of low income workers, the size of the sample was expanded from 5,732 in 2016 to 7,895 in 2017. As a result, the 2017 survey includes 204 more completes than the 2016 survey. While this generally increases the precision of estimates (for example, a reduction in the standard error for the offer rate from 2.2% to 1.8%), it has the effect of reducing the response rate. In 2017, non-panel firms had a response rate of 17%, compared to 62% for firms that had participated in one of the last two years.

To increase response rates, firms with 3-9 employees were offered an incentive for participating in the survey. A third of these firms were sent a $5 Starbucks gift card in the advance letter, a third were offered an incentive of $50 in cash or as a donation to a charity of their choice after completing the full survey, and a third of firms were offered no incentive at all. Our analysis does not show significant differences in responses to key variables among these incentive groups.

In 2017, weights were not adjusted using the nonresponse adjustment process described in previous years’ methods. As in past years, Kaiser/HRET conducted a small follow-up survey of those firms with 3-49 workers that refused to participate in the full survey. Based on the results of a McNemar test, we were not able to verify that the results of the follow-up survey were comparable to the results from the original survey. In 2010 and 2015, the results of the McNemar test were also significant and we did not conduct a nonresponse adjustment.

To reduce the length of survey, in several areas, including stoploss coverage for self-funded firms and cost sharing for hospital admissions, outpatient surgery, and emergency room visits, we revised the questionnaire to ask respondents about the attributes of their largest health plan rather than each plan type they may offer. This expands on the method we used for prescription drug coverage in 2016. Therefore, for these topics, aggregate variables represent the attributes of the firm’s largest plan type, and are not a weighted average of all of the firms plan types. In previous surveys, if a firm had two plan types, one with a copayment and one with a coinsurance for hospital admissions, the covered worker weight was allotted proportionally toward the average copayment and coinsurance based on the number of covered workers with either feature. With of this change, comparison among plans types is now a comparison of firms where any given plan type is the largest. The change only affects firms that have multiple plan types (58% of covered workers). After reviewing the responses and comparing them to prior years where we asked about each plan type, we find that the information we are receiving is similar to responses from previous years. For this reason, we will continue to report our results for these questions weighted by the number of covered workers in responding firms.

Starting in 2017, respondents were allowed to volunteer that their plans did not cover outpatient surgery or hospital admissions. Less than 1% of respondents indicated that their plan did not include coverage for these services. Cost sharing for hospital admissions, outpatient surgery and emergency room visits was imputed by drawing a firm similar in size and industry within the same plan type.

For more detail about the 2017 survey, see the Survey Methodology section of that year’s report.

2018

As in past years, we conducted a small follow-up survey of those firms with 3-49 workers that refused to participate in the full survey. Based on the results of a McNemar test, we were not able to verify that the results of the follow-up survey were comparable to the results from the original survey, and weights were not adjusted using the nonresponse adjustment process described in previous years’ methods. In 2010, 2015, and 2017, the results of the McNemar test were also significant and we did not conduct a nonresponse adjustment.

In light of a number of regulatory changes and policy proposals, we included new questions on the anticipated effects of the ACA’s individual mandate penalty repeal on the firm’s health benefits offerings, and the impact of the delay of the high cost plan tax, also known as the Cadillac tax, on the firm’s health benefits decisions. Also new in 2018 are questions asking about smaller firms’ use of level-funded premium plans, an alternative self-funding method with integrated stop loss coverage and a fixed monthly premium.

In 2018, we moved the battery of worker demographics questions from near the beginning of the survey to the end of the survey in an effort to improve the flow. There is no evidence that this move has impacted our survey findings and we will continue to monitor any suspected impacts.

The 2018 survey also expands on retiree health benefits questions, asking firms about cost reduction strategies, whether they contribute to the cost of coverage, and how retiree benefits are offered (e.g., through a Medicare Advantage contract, a traditional employer plan, private exchange, etc.).

Starting in 2018, we allowed respondents who did not know the combined maximum incentive or penalty an employee could receive for health screening and/or wellness and health promotion to answer a categorical question with specified ranges. This method is consistent with how we handle the percent of low-wage and high-wage workers at a firm. In 2018, 18% of respondents did not know the dollar value of the their incentive or penalty and 39% were able to estimate a range.

Starting in 2018, the survey began asking small firms who indicated that their plan was fully-insured whether the plan was level-funded. In a level-funded plan, employers make a set payment each month to an insurer or third party administrator which funds a reserve account for claims, administrative costs, and premiums for stop-loss coverage. These plans are often integrated and firms may not understand the complexities of the self-funded mechanisms underlying them. Some small employers who indicate that their plan is self-funded may also offer a plan that meets this definition. Respondents offering level funded plans were asked about any attachment points applying to enrollees. These firms were not less likely to answer this question, and including them doesn’t not substantially change the average. Prior to 2018, all firms reporting coverage as underwritten by an insurer were excluded from the stop-loss calculations.

The response option choices for the type of incentive or penalty for completing biometric screening or a health risk assessment changed from 2017 to 2018.

For more detail about the 2018 survey, see the Survey Methodology section of that year’s report.

2019

Starting in 2019, we discontinued a weighting adjustment informed by a follow-back survey of firms with 3-49 workers that refused to participate in the full survey. This adjustment was intended to reduce non-response bias in the offer rate statistic, under the assumption that firms that did not complete the survey were less likely to offer health benefits. The adjustment involves comparing the distribution of offering to non-offering firms in the full survey and the follow-back sample in the three smallest size categories (3-9, 10-24, 25-49). The adjustment is based on the differences between the two groups of firms and generally operates to adjust the weights of offering firms and non-offering firms to bring the counts closer together. However, if the distributions of the two groups differ to a statistically significant extent, we consider the follow-back survey to be a different population from the full survey and do not make any adjustment to the weights.

Although we cannot be sure of the reason, we are no longer witnessing the systematic upward bias on estimates for the offer rates of small firms that gave rise to the adjustment. Looking at the decade from 2010 to 2019, offer rates among firms responding to the follow-up survey have been higher for five of ten surveys. Firms with 3-49 employees responding to this follow-up survey have reported a higher offer rate than the full EHBS survey during the 2014, 2016, 2017, 2018, and 2019 surveys. An alternative way to measure non-response bias is to compare estimates throughout the fielding period.

In 2019, the percent of firms offering health benefit was similar in the last month of fielding to offer rates throughout the entire fielding period. Changes in both the survey methodology and the health insurance market have led us to become increasingly cautious about assuming that the follow back survey is a suitable proxy for the true population. Since 2014, we have collected offer rate information from firms before a final disposition is assigned. This method was introduced to reduce a bias in which firms who offer health benefits face a longer average survey than non-offering firms. This had the effect of increasing the percentage of firms for whom contact was made from whom we collected offer rate information. Additionally, we have also attempted to reduce non-response bias by increasing our data collection.

Recent changes in the marketplace also raise some concerns about the validity of the follow-back survey to be the basis for a weight adjustment. We have in recent years seen an increase in non-offering firms reporting that they are providing funds to employees to purchase non-group health insurance. We do not consider this to be an offer of health insurance by the firm, but we are concerned that the person who responds to the follow-back survey may not be able to make that distinction. The follow-back survey is a very simple set of questions asked to whoever answers the phone at a firm that refused to participate in the survey. In contrast, during the full-survey, we attempt to talk to the person most knowledgeable about health benefits, and the interviewers are trained to distinguish between types of benefit programs.

For 2019, making the weight adjustment would change offer rate statistic for all firms from 57% to 60%. Neither estimate is different than the 57% we reported last year (when the weight adjustment was not made because the statistical test indicated that the follow-back group was significantly different than the full survey group).

Starting in 2019, all presented calculations of out-of-pocket maximums strictly relied on an arithmetic average across all plans weighted by covered worker plan enrollment. In prior surveys, some figures (for example Figures 7.43, 7.45, and 7.46 in the 2018 report) were calculated based on the out-of-pocket maximum of the largest plan. This change did not meaningfully change any findings and ensured consistency within the out-of-pocket maximum section of the Employee Cost Sharing section.

For prescription drug coverage, similar to years past, if the firm reports that the worker pays the full cost for drugs on a particular tier and/or that the plan only offers access to a discount program, we do not consider this as offering covering for that drug tier. Starting this year, firms with multiple tiers that cover exclusively specialty drugs, were asked about the cost-sharing of the tier that is used most often. Cost-sharing for prescription drugs does not typically include mail order. Hospital, outpatient surgery and prescription drug cost-sharing was only asked of a firm’s largest plan type.

For 2019, we clarified the question that we use to ask firms whether or not they provide retiree health benefits; particularly, we added language that explicitly stated that firms that had terminated retiree health benefits but still has some retirees currently getting coverage, or that had current employees who will get retiree health coverage in the future, should answer yes to the question. We made this clarification in response to a large decline in the 2018 survey in the prevalence of retiree coverage (from 25% in 2017 to 18% in 2018). In the 2018 survey, we expressed concern that the then current public focus on public entities eliminating retiree benefits for future (not existing) retirees may be influencing the responses we were getting and said that we were going to add clarifying language to the survey question in future years.

For 2019, two open-ended questions were added to the survey in order to examine employer responses to the opioid crisis and obstacles preventing firms from adopting narrow network health plans. All responses to these questions were reviewed in a consistent manner by KFF staff to determine whether they could be recoded as an earlier multiple choice item, or if they could be sorted into new categories.

To increase participation in the final two weeks of the survey, a financial incentive was offered to firms with 3-9 employees, but only 6 firms that completed the survey within that time period qualified for the incentive. All respondents received a printed copy of the survey findings.

For more detail about the 2019 survey, see the Survey Methodology section of that year’s report.

2020

2020 was a challenging year both in administering the survey, as well as for many of our respondents who were scrambling to respond to the pandemic and the ensuing economic downturn. Our questionnaire was developed before the extent of the pandemic became apparent and the fielding period included response from both before and after. We asked respondents about their plans at the time of the interview, with approximately half of the responses (composing 50% of the covered worker weight) collected between January and March. The remaining interviews were completed before the middle of July. The survey is designed to track changes in benefit and cost between years and is not well suited to answer many of the important questions that emerged this year for a couple of reasons. Firstly, employers make decisions about their plans before the plan year begins. Premiums for self-funded employers are usually reported as the cost for a former worker to enroll in COBRA (deflated by an administrative fee) and do not reflect real-time spending. Many other plan features, including provider networks and cost-sharing, are set before a plan’s open enrollment period. We expect to learn more about how changes in benefits and utilization affected cost in the 2021 survey. Secondly, the month in which a respondent completes the survey is not random, the data collection firm completes interviews with larger panel firms first. We do not believe that these firms are similar to the non-panel firms that complete the survey later in the year. We believe these firms differ in ways which are not corrected for by weighting, which means we cannot look at how responses changed over the period to detect patterns of change. Thirdly, our sample is not sufficient to make many comparisons across fielding period. We plan to ask employers about changes to their plans and the impact of COVID-19 on their decision making in the 2021 survey.

In the summer of 2019, National Research LLC, which had conducted the Employer Health Benefit Survey since its inception, ceased operation. We engaged in a search to identify a new firm to conduct the 2020 survey and selected Davis Research LLC, based on their extensive experience in research on firms and establishments. While we believe that the sampling methodology, questionnaire and survey procedures were consistent between years, readers are strongly encouraged to consider “total survey error” when drawing conclusions about differences between statistics. Survey-adjusted standard errors (and statistical testing) measure uncertainty in estimates based on the sampling strategy, but do not measure biases that may be introduced through the data collection process such as interviewer or house effects. House effects refer to the impact of a data collection firm’s management and workflow processes on final statistics. We do not know how, or if at all, changing the data collection firm from National Research to Davis impacted estimates. Empirical studies of house effects vary greatly, with some reporting almost no impact and others observing significant differences in point estimates.

In order to minimize house effect impacts, we conducted extensive interview training with managers and interviewers at Davis, including sessions lead by interviewers with prior experience on the project. In addition, KFF pretested and observed interviews to verify that Davis’ quality assurance process was consistent with our understanding of how the survey had been conducted historically.

Starting in 2020, we limited the number of margins used to calibrate weights and adjust for non-response. Until 2019, our weighting procedure incorporated offer status, firm size, geographic region, and metropolitan status to adjust for unit nonresponse. Our 2020 weighting algorithm no longer relies on metropolitan vs. non-metropolitan as part of the non-response calculation. Separately, earlier surveys post-stratified each firm’s set of weights to industry, firm size, census division, and panel versus non-panel margins. Starting in 2020, we reduced this weight calibration to only industry and firm size controls. Finally, we collapsed industries in our 5,000+ employee firm size category, owing to the fact that many large businesses operate across multiple industries. All three of these changes were prompted by an increase in the number of calibration cells with low sample, which can result in individual firms with highly influential weights if not revised. Without this revision, some 2020 statistics would had been driven by a small number of firms with overly influential weights. Reducing the number of variables in these improves the stability of some published estimates. This issue arose in part due to the smaller number of completed interviews in 2020 relative to 2019.

Historically we measured the annual changes in workers’ wages and in inflation by comparing the values for April of the previous year and April of the current year. This year the labor market underwent significant disruptions in March and April as employers laid off and furloughed large numbers of workers in response to the COVID-19 pandemic. A relatively high share of lower-wage workers were furloughed and laid off during these months, resulting in a high change in wages as measured from April to April. In response to this unprecedented change in the labor market, we have elected to change how we calculate workers wages and inflation. Beginning with our 2020 publication, we are now calculating the change in workers wages and inflation based on an average of the first quarter of each year. Using this method, workers wages increased 3.4% compared to 7.7% between April and April. And similarly inflation increased 2.1% compared to 0.3%. Prior to 2020, both methods produced very similar estimates.

For more detail about the 2020 survey, see the Survey Methodology section of that year’s report.

2021

This year we made several changes to the survey questionnaire in order to reduce the length and burden of the survey; rather than asking benefit managers about the characteristics of up to four plan types, we asked for the premiums and deductibles of the largest two plan types and other cost information for only the largest. We now only ask about cost-sharing for prescription drugs, hospitalizations, outpatient surgery and office visits for the plan type with the most enrollment. This change mostly impacts the largest firms which are more likely to sponsor multiple plan types. As in prior years, if a firm sponsors multiple plans, of the same plan type, for example, several PPOs across the country, we ask about only the one with the most enrollment. In 2021, 13% of respondents offered three or more plan types – in total the largest plan type accounts for 82% of workers covered by health benefits and the largest two plan types represents 98%. For this reason, this change will only have a minimal impact on most estimates. Furthermore, in prior years we observed no systematic bias in key metrics across the plan type rank at each firm. For example, in 2020, among firms with three or more plan types, the third-largest plan had statistically similar premiums and deductibles to the larger plan types on average. This change did not require a change in how many of the the all firm variables are calculated. To determine the all plan value for categorical variables describing plans, we continue to use the largest type as a proxy. To do so, we identify the plan type that has the largest enrollment within the observation and use data from that plan as a proxy for the all-plan aggregate for that firm. For example, in previous years, we would ask an employer whether their HMO, PPO, POS and HDHP/SO were self-funded, and then report the response from largest plan type as the all firm response.

For the first time, a subset of employers were invited to complete the survey online, though in total 99% of the interviews were completed through computer-assisted telephone interviewing.

For more detail about the 2021 survey, see the Survey Methodology section of that year’s report.

2022

In 2022, we incorporated the California Employer Health Benefits Survey (CHBS) from the California Health Care Foundation (CHCF) into EHBS by oversampling firms with any presence in California and including new questions into the main EHBS instrument to determine firms with any employment in the state of California. Unlike other years, the 2022 EHBS used as its panel both respondents to either of the prior two years of EHBS (2020 and 2021) and also respondents to either of the prior two years of CHBS (2018 and 2020). Since many larger firms operate across state lines, the integration of CHBS with EHBS aimed to reduce survey burden among firms that had previously responded to both surveys. Among the N=1,140 large firms responding to the 2022 EHBS, 419 of those responding firms (37%) had any presence in California, highlighting the overlap across these two projects. Given the size of the California oversample needed to assure statistical reliability both nationally and within California, firm weights were calibrated to California-specific targets from the SUSB.

In 2022, Davis extended computer assisted web interview (CAWI) capacity, offering most respondents the opportunity to complete the survey using an online questionnaire rather a telephone interview. Although only 1% of respondents used this survey mode during the initial 2021 attempt, 43% of 2022 survey respondents answered EHBS via CAWI.

Survey mode did not impact the survey results in a systematic or obvious manner. The effects of mode and firm size on major firm characteristics such as annual premiums, contributions, and deductibles was modeled using standard linear regression. For certain plan types, survey implementation through telephone interview had a negative effect on the reported value. However, the plan types affected were random, so this effect is more likely due to confounding variables. When examining demographic characteristics between the two modes, there were small differences in the distribution of categorical variables such as region and age. However, without multiple years of data utilizing the two modes, it is impossible to determine if these are systematic biases. The imputation rate between the two modes was almost identical.

For more detail about the 2022 survey, see the Survey Methodology section of that year’s report.

2023

The 2023 survey features questions which have not been asked for several years including questions on spousal benefits, voluntary benefits, such as dental and vision coverage, waiting periods and emergency room cost-sharing. In addition, the survey includes new questions, on abortion coverage, prior authorization, coverage limits and coverage for gender-affirming care.

In 2022, the Employer Health Benefit Survey over-sampled California based firms in order to provide estimates for the California Health Care Foundation’s Health Benefit Survey. For more information please see: https://www.chcf.org/publication/2023-edition-california-employer-health-benefits/. As a result, both our weighting and sampling in 2022 took into account whether a firm was located in California. In 2023, we sampled non-panel firms based on whether they were located in California. Our 2023 sampling method is similar to the methods used prior to 2022, and is based on a firms size and industry.

As in previous years, modification were made to existing survey questions, both to improve clarity or respond to changes in the marketplace. Starting in 2023, respondents are able to provider either monthly or annual HSA contribution amounts. The interview notes for the question on level-funding was editing to clarify that employing a third-party administrator does not necessarily mean a plan is level-funded. The question on whether a firm or plan gives workers the opportunity to complete biometeric screening was edited to include take-home kits, which collect biometetric data, not just in-person exams. The distribution of cost-sharing for hospital admission and outpatient surgery was edited, to make the categories “copay”, “coninsurance”, and “both copay and coinsurance”, mutually exclusive. The both category may include covered workers who face either both cost-sharing requirements or which ever is greater. Based on interview debriefs, we elected not to report the share of employers who believe that telehealth will be important for enrollees in remote settings. We continue to revise our data cleaning and editing procedures.

For more detail about the 2023 survey, see the Survey Methodology section of that year’s report.


  1. HDHP/SO includes high-deductible health plans with a deductible of at least $1,000 for single coverage and $2,000 for family coverage and that offer either a Health Reimbursement Arrangement (HRA) or a Health Savings Account (HSA). Although HRAs can be offered along with a health plan that is not an HDHP, the survey collected information only on HRAs that are offered along with HDHPs. For specific definitions of HDHPs, HRAs, and HSAs, see the introduction to Section 8.↩︎
  2. Response rate estimates are calculated by dividing the number of completes over the number of refusals and the fraction of the firms with unknown eligibility to participate estimated to be eligible. Firms determined to be ineligible to complete the survey are not included in the response rate calculation.↩︎
  3. Estimates presented in [Figure 2.1], [Figure 2.2], [Figure 2.3], [Figure 2.4], [Figure 2.5], and [Figure 2.6] are based on the sample of both firms that completed the entire survey and those that answered just one question about whether they offer health benefits.↩︎
  4. Seasonally Adjusted Data from the Current Employment Statistics Survey. Bureau of Labor Statistics. Current Employment Statistics—CES (National) [Internet]. Washington (DC): BLS; [cited 2023 Aug 1]. Available from: https://www.bls.gov/ces/publications/highlights/highlights-archive.htm↩︎
  5. Bureau of Labor Statistics, Mid-Atlantic Information Office. Consumer Price Index historical tables for, U.S. City Average (1967 = 100) of Annual Inflation [Internet]. Washington (DC): BLS; [cited 2023 Aug 1]. Available from: https://www.bls.gov/regions/mid-atlantic/data/consumerpriceindexhistorical1967base_us_table.htm↩︎
  6. Seidenberg, Andrew B, Richard P Moser, and Brady T West. 2023. “Preferred Reporting Items for Complex Sample Survey Analysis (PRICSSA).” Journal of Survey Statistics and Methodology 11 (4): 743–57. https://doi.org/10.1093/jssam/smac040.↩︎
Summary of Findings Section 1: Cost of Health Insurance

KFF Headquarters: 185 Berry St., Suite 2000, San Francisco, CA 94107 | Phone 650-854-9400
Washington Offices and Barbara Jordan Conference Center: 1330 G Street, NW, Washington, DC 20005 | Phone 202-347-5270

www.kff.org | Email Alerts: kff.org/email | facebook.com/KFF | twitter.com/kff

The independent source for health policy research, polling, and news, KFF is a nonprofit organization based in San Francisco, California.