XIII. HWSM Validation, Strengths and Limitations, and Improvement

Published 2025

In this module:

HWSM validation
HWSM strengths and limitations
HWSM improvement

Annual updates of the Health Workforce Simulation Model (HWSM) incorporate the most recent available supply and demand determinants. During each annual update, the latest literature is reviewed on the occupations being modeled, trends in health care use and delivery, and health workforce modeling. The latest available data are analyzed and new data sources that could potentially improve the theoretical underpinnings of HWSM and the projections are explored.

Outreach efforts to the associations representing the modeled health occupations and training institutions provide stakeholders the opportunity to offer feedback on the data, methods, and assumptions used to develop workforce projections. Some stakeholders provide data on starting year supply of health workers and the number and characteristics of graduates from training programs. Some associations have provided data on retirement expectations, hours worked, and capacity (used to estimate starting year gap between supply and demand) from association-sponsored surveys. Associations that have provided data include the American Psychological Association, American Podiatric Medical Association, American Physical Therapy Association, and the Pharmacy Technician Certification Board. States are invited to provide data from their licensure files, and supply projections for some states use licensure data rather than rely on survey data.

This module summarizes activities undertaken to validate the model and projections, discusses HWSM strengths and limitations, and explores efforts to improve the model.

HWSM validation

A model, by definition, is a simplified version of reality. Validation activities are vital in ensuring that the model accurately reflects reality. Validation of HWSM is a continual process. The validation activities will continue as the model is updated with new data for different health workers.

Following International Society for Pharmacoeconomics and Outcomes Research (ISPOR) guidelines on best practices, validation activities in HWSM included the following:1

Review by subject matter experts (face validity). The model framework should conform to observations about how the system works and be consistent with theory. Expert review helps ensure that the model uses the best available inputs and parameters. Model outputs should be consistent with expectations of subject matter experts.
A technical evaluation panel of experts in the health care workforce at HRSA, HRSA-sponsored workforce centers, and others in the field have reviewed the model framework. The modeling approach effectively analyzes complex systems, such as the health care system, which feature decentralized and autonomous decision-making. For supply modeling, each individual makes career and labor force participation decisions based on their unique characteristics. They also take into account external factors such as earnings potential and unemployment risks. For demand modeling, individuals decide to use health care services based on their health risks and financial constraints. The potential to capture the complex dynamic interactive processes that characterize the demand for and supply of health care providers is an ongoing exploration.
Internal validation (verification). This set of activities includes:
1. Review computer code for accuracy
2. Validate parameters in the model against their source
3. “Stress test” HWSM by modeling extreme input values to test whether the model produces expected results
4. Assess and compare regression coefficients for the health care use patterns to prior year coefficients to determine consistency
External and predictive validation. This form of validation is used to identify external data sources (not used in model development) for comparison to model outputs.
As an example, the health-related characteristics of the constructed population database are calibrated by comparing the prevalence estimates to published sources. State-level projections of hospital inpatient days are compared to levels reported by the American Hospital Association.
Between-model validation (cross validation). This type of validation compares model outputs with results of other models.

There are few models for comparison, but some states and associations produce workforce projections. Demand estimates are typically simple extrapolations of provider-to-population ratios showing how local supply compares to the national average. Demand projections extrapolate current provider-to-population ratios to the future population. Some associations produce supply projections, often using a cohort model (or “stock and flow” model) with estimates of new entrants and exits from the workforce.

HWSM strengths and limitations

The main strengths of HWSM are the use of recent data sources and a sophisticated microsimulation model for projecting health workforce supply and demand.

Compared to population-based approaches to modeling, this approach has several advantages:

More predictive variables can be used in modeling: This both enhances the accuracy of results and allows for scenario modeling such as the Reduced Barriers demand scenario described in other modules.
Lower levels of geography can be modeled: HWSM demand projections take into consideration geographic variation in demographics, prevalence of disease and health risk factors, socioeconomic factors, such as household income and prevalence of medical insurance coverage, and level of rurality. This supports HRSA’s goal of building more accurate state level projections.
Projection models can be consolidated across occupations: Profession-specific equations can be integrated into a single platform. Annual updates of HWSM allow for updates of all occupations modeled. Still, modeling the integration of different occupations in care delivery is an ongoing process as care delivery patterns continue to evolve and data becomes available on the degree to which roles and scopes of practice overlap or complement the roles of other occupations.
HWSM uses individuals as the unit of analysis: Modeling at the individual level allows for improvement in the theoretical underpinnings of HWSM. For supply modeling, individual health care workers make their own labor force decisions about whether to work, how many hours to work, whether to move to another state, or whether to change profession. For demand modeling, health care use is determined by the needs and circumstances of individual patients. Modeling at the individual level provides added flexibility for modeling the workforce implications of changes in policy. A past example from earlier HRSA reports is expanded health insurance coverage under the Affordable Care Act. This flexibility facilitates modeling the Reduced Barriers demand scenario, where a person can be simulated as having the health care use patterns of an insured, non-Hispanic, white person living in a metropolitan area (that is, a population perceived to have lower barriers to receiving care).

Many of the limitations of HWSM stem from data limitations. The following are key limitations:

Setting demand equal to supply in the starting year: Historically, HWSM operated under the assumption that national demand equals national supply in the starting year for most health professions. The exceptions were primary care physicians and psychiatrists, where the number of additional providers required to remove health profession shortage designations served as a proxy for starting year shortfall. Alternative scenarios, such as the Reduced Barriers scenario and the Unmet Needs scenario for behavioral health, began with national demand starting higher than supply. Growing evidence of national shortfalls across many health professions, partly attributable to the COVID-19 pandemic, has prompted significant updates to this approach starting in 2025. Occupation-specific shortfall estimates and sources are discussed in individual occupation chapters.
Physicians and Advanced Practice Providers: For select physician specialties, published shortfall estimates are incorporated into demand projections. Additionally, estimates of increased demand for health care services as COVID-19 becomes endemic are used to shift provider demand and estimate starting year shortfalls. For family medicine and general internal medicine physicians, the model now accounts for substantial time spent providing behavioral health services, as growing demand for behavioral health care increasingly crowds out time for other preventive care services.
Nursing: Recent vacancy data from hospitals and academic institutions are used to quantify starting year shortfalls.
Allied Health Professions: For physical therapists, national shortfall estimates derived from recent association-sponsored provider capacity surveys are incorporated. For select allied health and long-term care occupations (including dietetic technicians, nuclear medicine technologists, occupational therapy aides, physical therapy aides, podiatrists, radiation therapists, recreational therapists, medical transcriptionists, and nursing assistants), shortage estimates incorporate both supply-side constraints and demand increases associated with COVID-19 becoming endemic. These supply-side shortage estimates are based on analysis of annual employment trends in the combined 2019-2024 BLS OEWS data.
Dental Professions: Published shortfall estimates are used for dental hygienists and dental assistants.
These shortfall estimates are likely conservative, as many professions show anecdotal evidence of current shortfalls without quantified estimates of shortfall magnitude.
Since demand projections for individual states reflect national averages for health care use and delivery applied to state populations, supply may exceed demand in some states while demand exceeds supply in others. Even in states where supply exceeds national average demand levels, perceived provider shortfalls may exist for two reasons: approximately half the states will exceed national averages by definition, and some states utilize different provider mixes than the national average. For example, states that rely more heavily on licensed practical nurses (LPNs) relative to registered nurses (RNs) compared to the national average may show an RN shortfall and LPN surplus in HWSM, while the state perceives no imbalances.
Omission of market forces and economic concepts in HWSM: HWSM currently lacks a market mechanism where labor costs respond to imbalances between supply and demand. A growing shortfall of providers, for example, presumably would cause wages to increase, and subsequently increase the desirability to work in the profession (thus increasing supply), while raising labor expenses (thus decreasing demand). Market forces, therefore, will tend to alleviate severe imbalances between supply and demand. Nevertheless, time lags such as the years required to adjust the training pipeline can lead to inefficiencies if relying on market forces alone. HWSM projections help work as market “signals” of growing imbalances between supply and demand, such that imbalances can be identified sufficiently far in advance to inform career and policy decisions that might help mitigate the severity of imbalances.
Use of survey data in lieu of population data for supply modeling: HWSM uses the American Community Survey (ACS), Occupational Employment and Wage Statistics (OEWS), and National Sample Survey of Registered Nurses (NSSRN) to estimate the starting year supply of many health occupations. Many states, however, have access to more complete supply data collected through the licensure/certification processes. Without comprehensive state-level data, HWSM continues to use estimates based on surveys. However, even state licensure files have data limitations. One limitation is that state licensure boards vary in the types and completeness of information they collect. While licensure files indicate whether the license is active, many licensure boards do not collect information on whether the licensed person is active in their profession and whether the person is active in that particular state. (This is especially true for the registered nurse workforce, where many states belong to compacts that allow the nurse to work in other states).
Omitted data elements from the constructed population file: The population file starts with people in households who responded to the ACS. However, the ACS lacks health-related information such as whether the person has various chronic diseases, health risk factors such as smoking and obesity, and information on the person’s mental health status or use of addictive substances. Information on chronic disease and health risk factors is obtained by statistically merging ACS with other sources of data—the Behavioral Risk Factor Surveillance System (BRFSS) for people who live in the community, the Medicare Beneficiary Survey subset of people living in residential communities, and the Nursing Home Minimum Dataset for people residing in nursing homes. These data sources with health-related information still lack a standardized metric for mental health status, addiction status, and dental insurance. Such information could improve projections for mental health workers, addiction counselors, and oral health providers, respectively. Dental insurance is unavailable for inclusion in the constructed population file. However, as discussed later, the use of medical insurance as a proxy for dental insurance performed equally well for modeling demand for oral health services.
Demand modeled where people reside: There is little information on consumer care migration patterns. Many large metropolitan areas cross state boundaries, and where people live is not necessarily where they receive care. Comparison of HWSM projections of hospital inpatient days to inpatient days reported by the American Hospital Association shows deviation across states between actual and projected inpatient days. It is unclear to what extent such deviation results from uncaptured demand determinants that affect how each state’s population uses health care services, and to what extent such deviation occurs because care is less accessible in their resident state so the patient seeks care in another state.
Supply projections unavailable for some health occupations: For occupations such as aides and assistants, where there is easy entry and exit from the workforce, insufficient data exist to develop accurate supply projections. There are multiple paths to entry for many of these occupations, and some states do not require licensure or graduation from a formal training program. They typically have low pay, so there is a high attrition rate which generally comes well before the traditional retirement age.
Uncertainty about changes in health care use and delivery over time: A criticism of workforce models in general is that they rely on historical data that is extrapolated into the future. Hence, such models might not accurately capture how the health care system will evolve over time–including evolving patterns of care use and delivery. Key trends to consider are the following:
1. Artificial intelligence: AI has the potential to profoundly transform health care, reshaping how care is delivered and utilized across the system. By analyzing vast datasets—including medical images, lab results, and patient histories—AI can assist with early detection and more accurate diagnoses. It also facilitates the shift toward personalized medicine, tailoring treatment plans to individual patients based on their unique data profiles.
  AI-driven robotic systems are already being developed to enhance surgical precision, while AI-powered virtual health assistants and chatbots improve patient engagement, offering 24/7 monitoring, medication reminders, and basic medical advice. These innovations extend into remote patient monitoring and telemedicine, allowing patients to receive care from home while maintaining continuous oversight of chronic conditions.
  AI can help optimize health care administration by streamlining workflows and reducing wait times. Additionally, AI is accelerating drug discovery by analyzing complex data to identify potential new treatments and bringing them to market faster. In the realm of mental health, AI is being utilized through therapy bots, monitoring systems, and predictive analytics to offer timely support and intervention. AI further assists clinicians in making more informed decisions by processing vast amounts of medical literature and real-time patient data. In short, AI has the potential to revolutionize every aspect of health care delivery, driving improvements in patient care, operational efficiency, and medical innovation.
2. Advances in medicine and technology: Such advances have the potential to change need, utilization, and delivery patterns. For example, FDA approval and growing access to GLP-1 medications to treat obesity has the potential to dramatically reduce incidence of type 2 diabetes, cardiovascular disease, certain cancers, and a host of other chronic conditions with implications for the health occupations and specialties that treat patients with these conditions. Many advances in medicine and technology are disease specific, but other advances have the potential to affect large segments of the population.
3. Budget pressures and cost containment: Budget pressures are intensifying for Medicare, Medicaid, and commercial payers as the aging population drives increased demand for health care services, while health care costs continue to consume a growing portion of national expenditures. Advances in medicine, which enable the delivery of more care, further contribute to these financial strains. Additionally, political and social efforts to address health disparities and expand access to care contribute to budgetary challenges. Budget constraints could cause payers to reduce coverage for certain types of care or shift more of the cost to patients, thus reducing demand for some services and the workforce required to provide such services.
  Budget pressures also threaten the healthcare workforce pipeline by constraining educational capacity. Cuts to research funding could affect academic medical institutions' ability to train future healthcare providers. Reduced access to student loans and financial aid may deter potential healthcare workers who rely on educational financing. Additionally, budget constraints can force training institutions to reduce faculty, limit training slots, or eliminate specialized programs, particularly in underserved areas where workforce shortage programs face funding cuts.
4. Team-based care, complementary care, and substitution across provider types: Care is increasingly specialized, with each member of the care delivery team providing services to address both efficiency and quality dimensions. Many providers overlap in the scope of services that they provide (for example, physicians and advanced practice providers), and there is a lack of data on the degree to which they can substitute for one another. There are also strong political and regulatory dimensions to this issue. The lack of provider supply in one profession (for example, phlebotomists) can have implications on the workload of other professions (for example, nurses), who perform the work that otherwise would have been performed by the profession in short supply.
5. Increased demand for behavioral health services: As the prevalences of mental health disorders such as anxiety and depression continue to rise, and as society becomes more open to discussing and seeking treatment, the demand for mental health services will also rise. The growing prevalence of substance abuse, particularly in the context of the opioid epidemic, will increase the need for addiction treatment services.

Additional data limitations are occupation specific and include imprecision in supply and demand determinants.

HWSM improvement

HRSA continues to explore improvements to HWSM, model inputs, and projections. To provide the highest quality projections, questions regarding technical accuracy and suggestions for improvement of the model were thoroughly investigated. The following are examples of improvements to HWSM:

In 2023, a career change component was added to HWSM. Analysis of the Current Population Survey (CPS) Annual Social and Economic Supplement data produced estimates of the probability that people under age 50 in each health care occupation would change careers and leave the occupation. This lowered projected growth in supply—particularly for occupations with lower education and training requirements and occupations with lower pay. In 2024 and 2025, this career change analysis was updated to include additional years of data and refinements in methodology.

Also starting in 2023, projections of the health workforce demand implications of COVID-19 becoming endemic were added to HWSM. As new data became available on the prevalence of acute COVID-19 and long-COVID symptoms, model inputs were updated.

For many occupations, there is growing evidence of a current shortage. The projections include a starting year shortfall for professions where there is sufficient information to quantify a shortfall. As noted above, shortfall information comes from published profession-specific studies, vacancy data from surveys, and other calculations based on shifts in demand associated with COVID-19 becoming endemic and increased demand for behavioral health services. Some professional associations, such as the American Physical Therapy Association, have started to incorporate into their member surveys questions on capacity to quantify any shortages or excess supply.

In 2019, in response to questions, the issue of possible overdispersion in the Poisson models used to develop predictive equations of annual visits to various types of providers and care delivery settings was investigated. Potential alternatives to Poisson models were examined. If data are distributed according to a Poisson distribution, their mean will equal their variance. However, the Medical Expenditure Panel Survey (MEPS) data with number of annual visits to various providers tend to contain more zeroes than would be expected in a Poisson distribution. Fitting a Poisson model to data exhibiting overdispersion will tend to produce understated standard errors.

Potentially better fitting models for count data that contain large numbers of zeroes include negative binomial, zero-inflated, and zero-altered models. After exploring these alternative regression specifications, HWSM switched from using Poisson to negative binomial models for estimating office visits, outpatient visits, and home health visits. This change had a negligible impact on the projections but is a conceptual improvement in the prediction equations for health care use.

Also, in 2019 the suggestion to use dental insurance rather than medical insurance as a predictor for the number of annual visits to oral health care providers was evaluated. As discussed previously, dental insurance is unavailable in the constructed population file, so predictive equations of oral health use that include a dental insurance variable cannot be applied to the population file. Because dental insurance is available in the MEPS file, how dental insurance compared to medical insurance as a predictive variable for oral health modeling was tested.

Our analysis looked at the root mean square error (RMSE), a measure of accuracy of the resulting predictions, when using dental insurance versus medical insurance as a predictor of annual visits to oral health providers:

RMSE equation. Equation is described in text before and after.

where y_j = observed visits, and ŷ_j = visits predicted by the model, for each observation j. RMSE was equivalent to two decimal places for the two formulations in regressions of visits to both dentists and hygienists.

Additional comparisons were then performed. The data was split 10 times for each oral health worker designation into training sets (75%, picked randomly) and testing sets (other 25%). In each split (and for each profession), the percentage of total prediction error was compared using the medical insurance coverage variable to the percentage of total prediction error using the dental insurance coverage variable. Total prediction error was always within 0.5% of each other. It was sometimes higher for the model with dental insurance and sometimes higher for the model with medical insurance. Thus, no evidence was found that dental insurance performed differently from medical insurance, and medical insurance was retained as a predictor variable for annual visits to oral health care providers.

Recent analyses have explored potential improvements to HWSM. Some analyses have already been implemented in HWSM—such as improvements to attrition rates for RNs and LPNs under age 50, and improvements to state-level estimates of the number of new nurses entering the nurse workforce.

Other analyses continue to be explored including:

Incorporating economic factors: Health workforce supply and demand respond to economic forces. Labor economic theory and empirical research find that economic factors such as compensation and wealth can affect individual and household employment decisions. Likewise, economic factors affect the decisions made by individuals, households, payers, health care provider organizations, and other entities (for example, medical device companies, pharmaceutical companies) on health care products and services to consume and provide. Economic factors affect resource allocation decisions to meet the demand for health care services, and often signal any growing gap between supply and demand. Research explored the impact of nurse wages on labor force participation decisions, weekly hours worked, and cross-state migration. Conducted research added cost-of-living-adjusted state data on mean wages for RNs and used LPNs as explanatory variables in the regression equations predicting labor force decisions. While most findings were as expected, many findings were either not statistically significant or had minimal impact. Additional research on other health occupations is required before incorporating such economic factors into HWSM. A review of the literature explored how economic factors might affect demand for health care services and providers. There exists a paucity of recent research published on this topic.
Using dynamic staffing patterns: HWSM uses national staffing ratios when modeling demand for health care workers based on projected demand for health care services. Our research explored how nurse staffing in nursing homes (resident-to-nurse ratio) and hospitals (resident-to-inpatient days ratio) varies across states as a function of nurse wages. The analysis used a regression approach using individual hospitals and individual nursing homes as units of analysis. Explanatory variables included the ratio of state mean RN wages to state mean LPN wages. The hypothesis is that as RN wages rise relative to LPN wages, some employers will substitute LPN labor for RN labor at the margin. (Higher wages also could be indicative of a provider shortfall). Findings were consistent with expected results. However, the overall impact on demand for nurses is small. Future research will continue to explore this topic, seeking to improve data sources available for analysis and extending to other health occupations.
Modeling more detailed hospital care delivery settings: HWSM models demand for hospital inpatient care based on total inpatient days, without accounting for care delivery setting. Research used State Inpatient Databases from five states where a revenue code allowed for tracking the number of inpatient days across different units of the hospital. For analysis, units were grouped under the categories critical care, medical/surgical, obstetrics and newborn (excluding intensive care units), psychiatric, and other. Nurse staffing levels differ by unit type. Also, demand for some occupations—such as critical care physicians and respiratory therapists—is concentrated in critical care units. Modeling of these more detailed settings could improve demand projections for some occupations. This is an area of ongoing research as new data becomes available.
Growing per capita use of behavioral health services: Starting in 2023, HWSM incorporates into the projections of behavioral health demand utilization trend scalars that show growing per capita use of behavioral health services among each age category. The trend analysis is based on historical data in the National Health Interview Survey (NHIS).
Modeling supply and demand for public health workers: In 2024, research was conducted on how a public health workforce component might be added to HWSM. This research is ongoing.
State-specific versus national level utilization data: Ongoing research is exploring how well national patterns of healthcare utilization predict state-level outcomes by comparing national projections of hospital inpatient days and emergency visits by ICD-10 diagnosis grouping to actual utilization in State Inpatient Databases.
Analysis reveals several reasons for divergence between state-level actual utilization and model-predicted totals. First, utilization factors may remain unaccounted for in the national model even when controlling for state-specific demographics and health risk factors. Second, cross-state utilization patterns occur when residents of one state receive care in hospitals located in another state. This cross-border care seeking behavior stems from multiple factors: convenience when communities sit on state borders, vacation-related medical needs when visitors require unexpected care in destination states, and specialized care provision by academic hospitals that attract patients from broader geographic regions.
Diagnosis grouping for hospital workforce demand modeling: National data sources on hospital utilization, such as MEPS and the National Inpatient Sample (NIS), do not identify which healthcare professionals provided care during a patient's hospital stay. Currently, the presumed physician medical specialty is defined by the primary ICD-10 diagnosis code. For example, a patient admitted with a primary diagnosis code indicating stroke is presumed to have been seen by a neurologist, and projected growth over time in inpatient days related to neurological services drives demand growth projections for hospital-based FTE neurologists.
However, this approach has limitations. A patient admitted to the hospital for a stroke may be seen by multiple types of physicians during their stay, including internists, cardiologists, or other specialists depending on comorbidities and complications. Ongoing research is exploring whether both primary and secondary diagnosis codes might be used to better categorize hospital inpatient days for modeling demand for hospital-based physician services, potentially providing a more accurate representation of the diverse specialist care patients actually receive.

HWSM continues to evolve as newer and better data becomes available, and as published research helps inform model parameters and scenarios.

Date Last Reviewed:

December 2025

1 Eddy DM, Hollingworth W, Caro JJ, Tsevat J, McDonald KM, Wong JB. Model Transparency and Validation: A Report of the ISPOR-SMDM Modeling Good Research Practices Task Force-7. Value in Health. 2012;15(6):843-850. https://doi.org/10.1016/j.jval.2012.04.012