A paradox hovers over our increasingly AI-dependent world. On the one hand, artificial intelligence can make the world a better place (or so we鈥檙e told). On the other hand, algorithms have no imagination or consciousness, and thus can know only the status quo鈥攁s reflected in the data they are trained on. And our current world is far from perfectly meritocratic or fair.
, assistant professor of information systems and operations management at at 911爆料, suggests that the paradox is compounded by conventional thinking around AI. 鈥淭he standard view is that fairness is a tax on efficiency. The way conventional systems are structured, fairness checks are added almost as an afterthought that is assumed to negatively impact system performance,鈥 she says.
Is the 鈥渂etter,鈥 optimized world of AI destined to replicate, or perhaps even exacerbate, existing inequalities? Yang鈥檚 ongoing research鈥攊n collaboration with Pengzhan Guo of Duke Kunshan University and Keli Xiao of Stony Brook University鈥攑oints to an appealing alternative. It uses AI systems as a proving ground for a theorized 鈥渇airness-performance complementarity鈥濃攖he idea that, under certain conditions, fairness and performance reinforce one another.
鈥淥ur 'fairness-by-design鈥 framework utilizes reinforcement learning, which is a type of machine learning (ML). But unlike most machine learning algorithms, ours includes multiple agents competing for finite resources in a dynamic environment, not a static one,鈥 Yang says. 鈥淭hat makes our paradigm much more structurally similar to many real-world environments in which various people compete over time for finite resources.鈥
Fairness was integrated in two stages. First, the framework was designed to 鈥渘udge鈥 high-performing agents towards exploratory choices that might maximize their rewards. As Yang explains, 鈥淚n this framework, high-performing agents are held in an exploratory mode for longer, while lower-performing agents settle into stable paths sooner.鈥 Second, options that were abandoned as a result of agents鈥 reward-seeking behavior were redistributed, with lower-performing agents getting first crack at the best opportunities.
As Yang summarizes, "The exploratory activity of the high performers releases opportunities that the system channels down toward the weaker performers. Theoretically, this increases fairness while retaining individual choice and without constraining performance.鈥
鈥淥ur 鈥榝airness-by-design鈥 framework utilizes reinforcement learning, which is a type of machine learning (ML). But unlike most machine learning algorithms, ours includes multiple agents competing for finite resources in a dynamic environment, not a static one. That makes our paradigm much more structurally similar to many real-world environments in which various people compete over time for finite resources.鈥
鈥擩ingyuan Yang, assistant professor of information systems and operations management at Costello College of Business at 911爆料
To test out the framework, the researchers used a data-set comprising detailed information on the job histories of 6.5 million professionals across a 20-year timeframe. 鈥淚n the real-world data, we see a high degree of disparity, without very much redistribution of elite opportunities from relatively advantaged to disadvantaged employees,鈥 Yang says.
The algorithm converted the real-world job information into opportunities offered to hypothetical agents. The resulting career paths were analyzed in terms of both performance and fairness. Performance was defined by aggregate rewards earned by all agents across all periods. Fairness was defined by the degree to which initial performance disparities were resolved over successive decisions.
The 鈥渇airness-by-design鈥 framework鈥檚 results鈥攆or both fairness and performance鈥攚ere better than those of eight alternative ML methods drawn from three different methodological families.
The researchers also adjusted the system to account for people鈥檚 changing preferences. Early-career professionals tend to value employer reputation and advancement potential; in late career, rewards pertaining to job stability and security are more salient. Even with these restrictions implemented, the framework functioned as intended鈥攊mproving the average quality of overall career paths while fueling upward mobility.
In a follow-up study utilizing the , the framework was tasked with generating route recommendations to hypothetical 鈥渁gents,鈥 i.e. cab drivers, with varying performance records. In this domain, the choice-set was much smaller (263 locations, as compared to 4,282 companies), and the timeframe far shorter (two hours as opposed to 20 years). As with the career-planning example, the taxi study found that more equitable distribution of high-quality routes led to higher average income per minute for the system as a whole.
鈥淏ecause the framework proved adaptable to different domains and agent preferences, we think it could be used in future as a governance mechanism for a variety of AI contexts,鈥 Yang says. Health care scheduling, course registration in higher education and provision of digital services are a few areas Yang sees as likely candidates.
While emphasizing that her research is still ongoing, she argues that it poses a serious challenge to standard ways of thinking about AI. 鈥Our formal proof establishes the conditions under which fairness and performance reinforce each other, and our experiments show those conditions are achievable in realistic settings. That gives our work both theoretical and experimental grounding," Yang says.
Related Stories
- June 17, 2026
- June 17, 2026
- June 10, 2026
- June 9, 2026
- June 8, 2026