#training-data

[ follow ]
large-language-models
Medium
1 week ago
Data science

Deploying Large Language Models (LLMs) on Google Cloud Platform

Large language models (LLMs), like ChatGPT, are rapidly gaining popularity due to their conversational abilities and natural language understanding. [ more ]
MarTech
3 months ago
Artificial intelligence

How to protect against and benefit from generative AI hallucinations | MarTech

Marketers using large language models (LLMs) must be concerned about 'hallucinations' and how to prevent them.
LLMs can produce nonsensical or inaccurate outputs that are not based on training data and do not follow any identifiable pattern. [ more ]
morelarge-language-models
openai
The Verge
1 day ago
Marketing tech

Leaked OpenAI slide deck reveals how it's wooing publishers.

OpenAI offers incentives to publishers like financial compensation and priority placement for training data and licensing agreements. [ more ]
TechCrunch
1 week ago
Artificial intelligence

Watch: OpenAI's media deal rush continues with FT deal

OpenAI and FT deepen their content deal with potential FT.com links in ChaptGPT, highlighting the AI company's strategy to ingest training material and pay providers. [ more ]
www.nytimes.com
1 month ago
Artificial intelligence

Four Takeaways on the Race to Amass Data for A.I.

Data is essential for the success of artificial intelligence models like large language models.
Large language models are trained on massive amounts of data collected from various sources like websites, books, and articles. [ more ]
moreopenai
ai-companies
Fast Company
1 week ago
Data science

The Financial Times deal with OpenAI highlights an uneasy future for both media and tech

Media outlets like Financial Times are licensing journalistic content to tech firms like OpenAI for training data, offering hope amidst challenging times. [ more ]
The Verge
1 month ago
Data science

OpenAI transcribed over a million hours of YouTube videos to train GPT-4

AI companies facing challenges in obtaining high-quality training data.
Companies adopting methods in AI training that navigate copyright law ambiguities. [ more ]
The Verge
2 months ago
Artificial intelligence

Tumblr's owner is striking deals with OpenAI and Midjourney for training data, says report

Automattic in talks with AI companies to use data from Tumblr users' posts for training AI models.
Automattic plans to launch an opt-out setting for users to prevent data sharing with third parties, including AI companies. [ more ]
moreai-companies
generative-ai
Mashable
2 weeks ago
Graphic design

Adobe unveils AI features for Photoshop - but not everyone is happy about it

Generative AI features added to Photoshop ease image creation from text prompts for both rookies and professionals. [ more ]
CNET
2 weeks ago
Artificial intelligence

5-ish Things on AI: Fake James Bond Trailer Goes Viral, an Inside Look at Secretive Training Data

AI companies rely on vast amounts of training data from online sources like books, Wikipedia, and news to power large language models for chatbots. [ more ]
Iapp
4 weeks ago
Artificial intelligence

ICO seeks input on accuracy of generative AI

The U.K. Information Commissioner's Office is soliciting feedback on the impact of accurate training data on AI models to avoid misinformation. [ more ]
TechCrunch
1 month ago
Artificial intelligence

OpenAI built a voice cloning tool, but you can't use it... yet | TechCrunch

OpenAI debuts Voice Engine, allowing synthetic voice generation from 15-second samples, emphasizing responsible deployment.
The generative AI model behind Voice Engine powers other features like ChatGPT's voice capabilities and Spotify's podcast dubbing function. [ more ]
Medium
2 months ago
Artificial intelligence

Generative AI, Ethics, and Social Good

Generative AI methods are being adopted across industries for various data types.
One application of generative AI is accelerating drug development by suggesting new molecules or proteins based on learned patterns. [ more ]
WIRED
3 months ago
Artificial intelligence

An AI Executive Turns AI Crusader to Stand Up for Artists

Generative AI has an ethics problem
Fairly Trained offers a certification program for AI companies to ensure ethical use of training data [ more ]
moregenerative-ai
copyright-infringement
www.vox.com
2 months ago
Artificial intelligence

A poster's guide to who's selling your data to train AI

AI systems like ChatGPT use scraped public data to train, sometimes leading to lawsuits.
Companies like OpenAI face legal challenges for using copyrighted material without permission. [ more ]
www.nytimes.com
3 months ago
Artificial intelligence

We Asked A.I. to Create the Joker. It Generated a Copyrighted Image.

A.I. image generators can create images nearly identical to existing copyrighted materials.
The use of intellectual property in A.I. training data raises legal and ethical concerns. [ more ]
The Verge
3 months ago
Artificial intelligence

AI models that don't violate copyright are getting a new certification label

Groups are offering certification programs to AI companies to show they don't violate copyright.
Fairly Trained, founded by a former Stability AI VP, labels companies that prove they asked for permission to use copyrighted training data. [ more ]
Theregister
4 months ago
Artificial intelligence

OpenAI: Impossible to train AI models and avoid copyright

AI services like DALL-E 3 and Midjourney can recreate copyrighted scenes from films and video games based on their training data.
The study suggests that both Midjourney and OpenAI trained their AI models on copyrighted material, raising concerns about legal liability for copyright infringement. [ more ]
morecopyright-infringement
Theregister
1 day ago
Data science

Experts divided over training AI with more data from AI

AI model collapse is not inevitable, as argued by a group of academics. [ more ]
FlowingData
4 months ago
Data science

AI-based things in 2023

LLMs are easy to build with just a few hundred lines of Python
The quantity and quality of training data is the most important factor in the performance of LLMs [ more ]
Yahoo Finance
4 months ago
Data science

Why we need to fear the risk of AI model collapse

Generative AI has the potential for great benefits but also carries risks, including model collapse.
Model collapse occurs when generative AI becomes unstable or unreliable due to training on synthetic data instead of human-generated data. [ more ]
Fast Company
1 week ago
Artificial intelligence

The AI arms race may soon center on a competition for 'expert' data

The AI arms race is shifting towards acquiring specialized data for model training. [ more ]
Entrepreneur
4 weeks ago
Artificial intelligence

Adobe's Firefly AI Image Generator Partly Trained With AI: Report | Entrepreneur

Adobe's AI image generator Firefly included images from competitors in its training data, raising ethical concerns. [ more ]
Ars Technica
4 weeks ago
Artificial intelligence

US lawmaker proposes a public database of all AI training material

AI companies may soon be required to disclose copyrighted works used in training datasets to ensure creators are aware and can seek credit or compensation. [ more ]
AdExchanger
2 months ago
Artificial intelligence

When Generative AI Makes The Ad | AdExchanger

Generative AI startups are transforming ad creative with specialization and generalization in channels like video, audio, and social media.
Consider the importance of training data for AI models in creating specific types of content, like classic art versus stock images. [ more ]
Medium
2 months ago
Artificial intelligence

AI and designers: the ethical and legal implications

AI integration unlocks opportunities
Designers need to understand ethical and legal aspects
Generative AI uses training data and deep learning [ more ]
Singularity Hub
2 months ago
Artificial intelligence

Why the New York Times' AI Copyright Lawsuit Will Be Tricky to Defend

Lawsuits against AI companies over copyright issues increasing
Legal arguments around training data in AI lawsuits evolving
Novel argument about AI 'hallucinations' in NYT case [ more ]
www.nytimes.com
4 weeks ago
Privacy professionals

A.I.'s Data Wall, a Surprise Privacy Bill, and What Happened to the TikTok Ban?

Artificial intelligence companies facing limitations on available training data, new bipartisan national privacy law proposal, ByteDance focusing on new apps amid TikTok ban. [ more ]
The Atlantic
2 months ago
Artificial intelligence

The Holy Grail for AI Research

The current limitations of AI progress include a lack of training data and the slow process of human evaluation.
Researchers are exploring the use of AI models to improve other AI models, potentially leading to significant advancements. [ more ]
Bloomberg
3 months ago
Privacy technologies

Bloomberg

AI devices are reinforcing gender biases
This has implications for AI technology in areas such as healthcare and criminal justice [ more ]
WIRED
3 months ago
Artificial intelligence

Most Top News Sites Block AI Bots. Right-Wing Media Welcomes Them

AI models are fine-tuned using reinforcement learning from human feedback.
The use of broad training data helps AI models represent diverse cultures, industries, ideologies, and languages. [ more ]
Axios
3 months ago
Artificial intelligence

AI's next big fight: Whose values should it hold?

AI systems are embedded with values and biases, forcing creators to make choices about whose values the system will respect.
The data with which AI systems are trained and the efforts developers take to mitigate biases play a crucial role in shaping their points of view. [ more ]
www.cnbc.com
3 months ago
Artificial intelligence

Blockchain, the tech behind bitcoin, may have found its 'killer use case' by keeping AI in check

Using blockchain to prevent bias in AI data could be a killer use case for the technology.
Blockchain provides an immutable and tamper-proof ledger for training data, allowing developers to track and roll back AI models if biases or false information are detected. [ more ]
www.fastcompany.com
4 months ago
Artificial intelligence

The New York Times's OpenAI lawsuit could put a damper on AI's 2024 ambitions

The New York Times filed a lawsuit against OpenAI and Microsoft, alleging unauthorized use of its content to train AI models.
The lawsuit highlights the potential legal challenges and copyright concerns faced by AI companies using large language models. [ more ]
[ Load more ]