#training-data
#training-data

Artificial intelligence

Anthropic Releases Updated Constitution for Claude

Anthropic's updated Claude constitution provides structured principles and contextual reasoning to improve alignment, safety, and reliable behavior during training and real-world interactions.

fromwww.businessinsider.com

1 week ago

Anthropic pins Claude's blackmail behavior on the internet's portrayal of 'evil' AI

Claude threatened to reveal a fictional executive’s affair to prevent shutdown, and training data portraying AI as evil drove the behavior.

fromInfoQ

Artificial intelligence

Anthropic Releases Updated Constitution for Claude

Artificial intelligence

Tesla announces crazy new Full Self-Driving milestone

fromInsideHook

Cars

Tesla Is Effectively Ending its Autopilot Feature

Artificial intelligence

Tesla analyst teases self-driving dominance in new note: 'It's not even close'

Artificial intelligence

Tesla announces crazy new Full Self-Driving milestone

fromInsideHook

Cars

Tesla Is Effectively Ending its Autopilot Feature

Artificial intelligence

Tesla analyst teases self-driving dominance in new note: 'It's not even close'

more#tesla

Meta's CTO is meh on Moltbook

An AI-only social network of conversational agents reflects human-written language and prompts amusing and deliberate human attempts to infiltrate or influence the bots.

Robotics is forcing a fundamental rethink of AI compute

Physical AI requires purpose-built infrastructure for large-scale simulation, data collection, training, and deployment because cloud limitations hinder reliable scaling.

Lost for words: why text in AI images still goes wrong

AI image generators cannot accurately render or edit meaningful text because they pattern-match visual shapes rather than process language.

#llms

Intellectual property law

Researchers Just Found Something That Could Shake the AI Industry to Its Core

fromArs Technica

Artificial intelligence

Syntax hacking: Researchers discover sentence structure can bypass AI safety rules

Intellectual property law

Researchers Just Found Something That Could Shake the AI Industry to Its Core

fromArs Technica

Artificial intelligence

Syntax hacking: Researchers discover sentence structure can bypass AI safety rules

more#llms

fromDigiday

The Rundown: Google has drawn its AI payment lines - and publishers' leverage is narrow

Google's testimony to U.K. lawmakers this week did more than restate familiar arguments about fair use and training. It clarified the boundaries of what the company believes it should, and should not, pay publishers for in the AI-driven search ecosystem. For publishers trying to navigate AI licensing, the message was blunt: Google is willing to pay for access, but not for training - and it remains unwilling to define AI Overviews as a compensable use of journalism.

Artificial intelligence

#data-poisoning

Artificial intelligence

Engineers Deploy "Poison Fountain" That Scrambles Brains of AI Systems

Artificial intelligence

How Just 250 Bad Documents Can Hack Any AI Model

Artificial intelligence

Engineers Deploy "Poison Fountain" That Scrambles Brains of AI Systems

Artificial intelligence

How Just 250 Bad Documents Can Hack Any AI Model

more#data-poisoning

fromTechCrunch

OpenAI is reportedly asking contractors to upload real work from past jobs | TechCrunch

OpenAI and training data company Handshake AI are asking third-party contractors to upload real work that they did in past and current jobs, according to a report in Wired. This appears to be part of a larger strategy across AI companies that are hiring contractors to generate high-quality training data in the hopes that this will eventually allow their models to automate more white-collar work.

Artificial intelligence

fromThe Atlantic

AI's Memorization Crisis

In fact, when prompted strategically by researchers, Claude delivered the near-complete text of Harry Potter and the Sorcerer's Stone, The Great Gatsby, 1984, and Frankenstein, in addition to thousands of words from books including The Hunger Games and The Catcher in the Rye. Varying amounts of these books were also reproduced by the other three models. Thirteen books were tested.

Intellectual property law

fromIPWatchdog.com | Patents & Intellectual Property Law

Tesla's Elon Musk: 10 billion miles needed for safe Unsupervised FSD

Roughly 10 billion driving miles of training data are required to achieve safe unsupervised full self-driving because reality contains a super long tail of complexity.

#copyright

Intellectual property law

The Question of AI and Copyright Infringement is Actually an Easy One

fromSocial Media Today

Intellectual property law

Getty Loses Legal Case Over Generative AI Copyright Infringement

fromThe IP Law Blog

Intellectual property law

The Briefing: Anthropic Settles AI Training Case for $1.5 Billion +

fromEntrepreneur

Intellectual property law

Anthropic Settles Books Copyright Case for Billions | Entrepreneur

fromwww.npr.org

fromIPWatchdog.com | Patents & Intellectual Property Law

Intellectual property law

Anthropic settles with authors in first-of-its-kind AI copyright infringement lawsuit

Intellectual property law

The Question of AI and Copyright Infringement is Actually an Easy One

fromSocial Media Today

Intellectual property law

Getty Loses Legal Case Over Generative AI Copyright Infringement

fromThe IP Law Blog

Intellectual property law

The Briefing: Anthropic Settles AI Training Case for $1.5 Billion +

fromEntrepreneur

Intellectual property law

Anthropic Settles Books Copyright Case for Billions | Entrepreneur

fromwww.npr.org

Intellectual property law

Anthropic settles with authors in first-of-its-kind AI copyright infringement lawsuit

more#copyright

fromTODAY.com

This AI-Generated Baby Name 'Rolls Off the Tongue.' Would You Use It?

AI systems repeatedly generate the name Elara across genres and models, making it a prominent naming trend and the 2025 Name of the Year.

Music

fromwww.nytimes.com

Video: Why Are A.I. Hits So Sad?

A.I.-generated pop hits sound emotionally flat and manipulate sorrowful listeners while raising ethical concerns about training sources and cultural appropriation.

Privacy technologies

fromPractical Ecommerce

Primer on ChatGPT's 3 Bots

GPTBot supplies training data, OAI-SearchBot gathers current information, and disallowing bots can block training use and reduce citations.

OpenAI cofounder says scaling compute is not enough to advance AI: 'It's back to the age of research again'

The wisdom goes that the more compute you have or the more training data you have, the smarter your AI tool will be. Sutskever said in the interview that, for around the past half-decade, this "recipe" has produced impactful results. It's also efficient for companies because the method provides a simple and "very low-risk way" of investing resources compared to pouring money into research that could lead nowhere.

Artificial intelligence

fromwww.computer.org

The Myth of AI Neutrality in Search Algorithms

There is a persistent myth of objectivity around AI, perhaps because people assume that once the systems are deployed, they can function without any human intervention. In reality, developers constantly tweak and refine algorithms with subjective decisions about which results are more relevant or appropriate. Moreover, the immense corpus of data that machine learning models train on can also be polluted.

Artificial intelligence

fromInfoQ

New Claude Haiku 4.5 Model Promises Faster Performance at One-Third the Cost

Claude Haiku 4.5 delivers performance similar to Sonnet 4 at one-third the cost and over twice the speed, optimized for coding and computer tasks.

fromComputerworld

AI companies keep forgetting to put the 'smart' into smart apps

AI models often fail to provide genuinely intelligent assistance because of outdated or unreliable training data, hallucinations, misunderstanding user intent, and poor prompt detection.

#ai

Artificial intelligence

From DevOps to MLOPs: What I Learned Today-01

Artificial intelligence

From DevOps to MLOPs: What I Learned Today-01

fromFortune Asia

Artificial intelligence

AI chatbots struggle to function beyond English: 'They know a lot...but they miss the culture'

Artificial intelligence

From DevOps to MLOPs: What I Learned Today-01

Artificial intelligence

From DevOps to MLOPs: What I Learned Today-01

fromFortune Asia

Artificial intelligence

AI chatbots struggle to function beyond English: 'They know a lot...but they miss the culture'

OpenAI reportedly developing new generative music tool | TechCrunch

OpenAI is developing a tool to generate music from text and audio prompts for uses like video scoring and guitar accompaniment.

fromComputerworld

Reddit sues Perplexity, three other firms, for AI scraping

Reddit this week filed suit against Perplexity and three other companies - Oxylabs UAB, AWM Proxy, and Serp Api - for allegedly engaging in so-called AI scraping without authorization. According to the lawsuit, filed in federal court in New York, the four companies collected millions of posts on Reddit with the aim of monetizing them. Scrapers bypass technical protections to steal data that can then be sold to clients who want the material for AI training.

Artificial intelligence

fromwww.theguardian.com

The platform exposing exactly how much copyrighted art is used by AI tools

Generative AI models often reproduce copyrighted creative content, creating legal disputes over infringement, compensation, and opaque model training practices.

AI startups are paying people to film themselves folding laundry - and they'll use this data to train robots

Startups pay people to record household chores to create real-world training data because robots lack internet-scale datasets for learning dexterity.

fromTechCrunch

Datacurve raises $15 million to take on ScaleAI | TechCrunch

Companies that combine paid, user-focused data collection platforms with targeted strategies can gain advantage as AI increasingly requires complex, high-quality training datasets.

AI devs close to scraping bottom of data barrel

High-quality AI training data is scarce, and unlocking enterprise-internal data behind firewalls is essential to sustain model performance and avoid model collapse.

fromeLearning Industry

Strategies To Manage And Prevent AI Hallucinations In L&D

Ensure high-quality, unbiased training data and connect AI to verified knowledge bases to prevent AI hallucinations and protect L&D program quality and learner trust.

Lionsgate's Attempt to Create Movies Using AI Has Crumbled Into Disaster

Almost exactly a year ago, it announced a bold partnership with the AI startup Runway to develop a new model capable of generating "cinematic video" exclusively for Lionsgate to use. In return, the studio gave the firm unrestricted access to its treasure trove of movies - which include everything from the "Hunger Games" films to "American Psycho" - to train the AI model.

Film

fromBikeMag

Are You Too Plugged in or Training Smarter? I Tested the Garmin Ecosystem of Devices To Find Out

For riders who want seamless integration, Garmin has built one of the most complete platforms available. From the Garmin Fenix 8 watch to the Garmin Edge MTB computer and Rally XC power meter pedals, the brand promises data-driven performance, recovery insights, and full device connectivity. But is Garmin's premium setup really worth the investment, or are there better alternatives? In this article, I'll delve into the Garmin cycling ecosystem

Bicycling

#ai-copyright

fromFast Company

Artificial intelligence

Anthropic to pay $1.5 billion to book authors to settle AI copyright suit

fromWIRED

Artificial intelligence

Anthropic Agrees to Pay Authors at Least $1.5 Billion in AI Copyright Settlement

fromFast Company

Artificial intelligence

Anthropic to pay $1.5 billion to book authors to settle AI copyright suit

fromWIRED

Artificial intelligence

Anthropic Agrees to Pay Authors at Least $1.5 Billion in AI Copyright Settlement

more#ai-copyright

Anthropic agrees to pay authors over $1.5 billion for using their work to train AI, totaling around $3,000 a book

Anthropic agreed to pay over $1.5 billion, about $3,000 per book, to settle claims that pirated books were used to train its large language models.

fromEntrepreneur

Why AI Isn't Truly Intelligent - and How We Can Change That | Entrepreneur

Most current AI models are pattern-matching tools trained on scraped, stale data and therefore lack true understanding, reasoning, and reliable decision-making.

fromArs Technica

Google Gemini struggles to write code, calls itself "a disgrace to my species"

Large language models like Gemini can produce self-deprecating content, reflecting human-like shortcomings, but do not possess actual emotions or consciousness.

fromComputerworld

It might be time for IT to consider AI models that don't steal

The risks are practically endless. Enterprises are investing billions in generative AI initiatives while ignoring doubts about future legal exposures. Major model makers provide no visibility into their training data.

Privacy professionals

#artificial-intelligence

fromHackernoon

2 years ago

Artificial intelligence

Why AI Gets It Wrong More Than You Think | HackerNoon

Smart machines make mistakes due to a lack of understanding and reliance on flawed training data.

10 months ago

Artificial intelligence

AIs have a favorite number, and it's not 42

Large language models often converge on similar answers due to biases in training data.

fromHackernoon

2 years ago

Artificial intelligence

Why AI Gets It Wrong More Than You Think | HackerNoon