Study highlights impact of demographics on AI training

A collaborative study conducted by Prolific, Potato, and the University of Michigan has shed light on a critical aspect of AI model development: the profound influence of annotator demographics on the training of AI systems. This study delved into the intricate interplay of age, race, and education on the data used to train AI models, revealing potential pitfalls where biases might seep into the very fabric of these systems.

In today’s world, AI models like ChatGPT have become integral to everyday tasks for many individuals. However, as Assistant Professor David Jurgens of the University of Michigan School of Information points out, we need to critically examine whose values are being embedded into these trained models. If we overlook differences and fail to consider diverse perspectives, we risk perpetuating marginalization of certain groups in the technologies we rely on.

The process of training machine learning and AI systems often involves human annotation to guide and refine their performance. This “Human-in-the-loop” approach, also known as Reinforcement Learning from Human Feedback (RLHF), entails individuals reviewing and categorizing the outputs of language models to enhance their accuracy and appropriateness.

One of the standout findings of this study revolves around the impact of demographics on assessing offensiveness. Intriguingly, the research uncovered that different racial groups held distinct perceptions of what constitutes offensive online comments. For instance, Black participants tended to rate comments as more offensive compared to individuals from other racial backgrounds. Age also played a role, with participants aged 60 and above being more inclined to label comments as offensive compared to their younger counterparts.

The study, encompassing an analysis of 45,000 annotations contributed by 1,484 annotators, spanned a diverse array of tasks. These tasks ranged from detecting offensiveness and answering questions to assessing politeness. The research outcomes highlighted that demographic factors extend their influence even into ostensibly objective tasks like question answering. It was particularly intriguing to observe that factors such as race and age influenced the accuracy of question responses, reflecting disparities that stem from differences in educational opportunities.

Politeness, a pivotal aspect of interpersonal communication, was also found to be significantly influenced by annotator demographics. The study revealed that women tended to attribute lower levels of politeness to messages compared to men. Moreover, older participants were more inclined to assign higher politeness ratings. Interestingly, participants with higher levels of education were more likely to assign lower politeness ratings, and variations were evident among different racial groups and Asian participants.

The implications of this study are profound. As AI systems become more integrated into various aspects of our lives, it becomes imperative to address and mitigate the biases that may arise from the data used to train them. Acknowledging the influence of annotator demographics underscores the importance of diverse and representative input during the development of AI models. By fostering inclusivity and considering a wide range of perspectives, we can pave the way for AI systems that are fair, accurate, and respectful to all users.

Posted in

Aihub Team

Leave a Comment





OpenAI is not currently training GPT-5

OpenAI is not currently training GPT-5

Microsoft’s AI chatbot is ‘unhinged’ and wants to be human

Microsoft’s AI chatbot is ‘unhinged’ and wants to be human

Machine learning expert Jordan bemoans use of AI as catch-all term

Machine learning expert Jordan bemoans use of AI as catch-all term

ITN to explore how AI can be a force for good at the AI & Big Data Expo this November

ITN to explore how AI can be a force for good at the AI & Big Data Expo this November

Fiverr create Demand for AI expertise surges by 1,000%

Fiverr create Demand for AI expertise surges by 1,000%

Databricks acquires LLM pioneer MosaicML for $1.3B

Databricks acquires LLM pioneer MosaicML for $1.3B

AI think tank calls GPT-4 a risk to public safety

AI think tank calls GPT-4 a risk to public safety

AI vs Machine Learning

AI vs Machine Learning

US: AI Begins Taking Over Thousands of Human Jobs | Vantage on Firstpost

US: AI Begins Taking Over Thousands of Human Jobs | Vantage on Firstpost

Snowpark, Input Tables, & Sigma AI: The Future of Analytics

Snowpark, Input Tables, & Sigma AI: The Future of Analytics

How to Scale Service with Generative AI and Einstein GPT

How to Scale Service with Generative AI and Einstein GPT

Fight AI with AI: Going Beyond ChatGPT

Fight AI with AI: Going Beyond ChatGPT

Can China’s ChatGPT clones give it an edge over the U.S. in an A.I. arms race?

Can China’s ChatGPT clones give it an edge over the U.S. in an A.I. arms race?

What Is AI Artificial Intelligence What is Artificial Intelligence

What Is AI Artificial Intelligence What is Artificial Intelligence

Trustworthiness of AI applications in public sector

Trustworthiness of AI applications in public sector

Bringing AI closer to citizens – smart communities

 Bringing AI closer to citizens – smart communities

AI in practice and implementation strategies

AI in practice and implementation strategies

At July 4 cookouts with financial experts, AI takes centre stage while there are burgers, beers, and brainy bots.

At July 4 cookouts with financial experts, AI takes center stage while there are burgers, beers, and brainy bots.

Efficient Generative AI Summit

 Efficient Generative AI Summit

CDAO Chicag

CDAO Chicag

AI Hardware & Edge AI

AI Hardware & Edge AI

AI and the Future of Work

AI and the Future of Work

AI in Art and Creativity

AI in Art and Creativity

Exploring the Ethics of Artificial Intelligence

Exploring the Ethics of Artificial Intelligence

Demystifying Machine Learning

Demystifying Machine Learning

AI in healthcare

AI in Healthcare

New WEF research identifies revolutionary healthcare AI applications

New WEF research identifies revolutionary healthcare AI applications

Tesla’s AI supercomputer tripped the power grid

Tesla’s AI supercomputer tripped the power grid

Stephen Almond, ICO: Prioritise privacy when adopting generative AI

Stephen Almond, ICO: Prioritise privacy when adopting generative AI

Sony has a new ‘AI robotics’ drone division called Airpeak

Sony has a new ‘AI robotics’ drone division called Airpeak