Anthropic seeks public help to govern AI models

October 24, 2023

1 min.

Anthropic, an Amazon-backed AI startup is seeking public input as part of its efforts to create guidelines for governing its AI models.

Anthropic commissioned a poll of 1,000 Americans to ask them what values and guardrails they wanted powerful AI models to reflect. The results were then compared to an existing set of principles that Anthropic staff developed and already applied to its Claude chatbot.

While there was only a 50% overlap between the two sets of principles, Anthropic found that the public “constitution” was “less biased” across nine social categories, including age, gender, nationality, and religion. The survey findings were curated into 75 guiding principles, and compared to the 58 principles that Anthropic had previously developed and applied to the Claude chatbot.

Anthropic claims that this new framework is “less biased” than their prior set of principles. The public emphasized a greater focus on impartiality and the provision of objective information reflecting all aspects of a situation. Additionally, they stressed the importance of making AI responses easily understandable.

This suggests that public input can be a valuable way to ensure that AI models are more aligned with the values of the people who will be using them.

The sources for this piece include an article in Axios.

Tags
Development

TND Newsdesk

SUBSCRIBE NOW

Become a member

New, Relevant Tech Stories. Our article selection is done by industry professionals. Our writers summarize them to give you the key takeaways

Subscribe Now

North Korean hacker infiltrates US security vendor, loads malware

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

CrowdStrike CEO summoned by Homeland Security committee over software disaster

Canadian schools sue social media giants over alleged harm to children

ChatGPT mobile mania: Why users are flocking to ChatGPT Plus

iOS update brings back photos users thought were permanently deleted

Microsoft reveals critical security flaw affecting Android apps

CrowdStrike faces backlash over $10 “apology” voucher

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Anthropic seeks public help to govern AI models

North Korean hacker infiltrates US security vendor, loads malware

Security company accidentally hires a North Korean state hacker: Cybersecurity Today for Friday, July 26, 2024

CrowdStrike releases an update from initial Post Incident Review: Hashtag Trending Special Edition for Thursday July 25, 2024

Security vendor CrowdStrike issues an update from their initial Post Incident Review

Homeland Security committee demands appearance by CrowdStrike CEO

SUBSCRIBE NOW

Related articles

Target’s new AI is aimed at employees

The good and the bad of AI generated code

Microsoft’s AI success may spell defeat for it’s climate goals

OpenAI’s Chief Scientist Ilya Sutskever Departs Company

Become a member