#language-models

[ follow ]
fromWIRED
1 day ago

Latam-GPT: The Free, Open Source, and Collaborative AI of Latin America

Latam-GPT is an open-source large language model developed in Latin America to promote technological independence and handle regional languages, dialects, and cultural contexts.
fromPsychology Today
5 days ago

The Greatest Illusion on Earth

At its core (dare I say heart), AI is a machine of probability. Word by word, it predicts what is most likely to come next. This continuation is dressed up as conversation, but it isn't cognition. It is a statistical trick that feels more and more like thought. Training reinforces the trick through what's called a loss function. But this isn't a pursuit of truth. It measures how well a sequence of words matches the patterns of human language.
Artificial intelligence
fromFuturism
2 weeks ago

New Research Finds That ChatGPT Secretly Has a Deep Anti-Human Bias

Leading large language models exhibit significant bias favoring AI-generated content over human content, raising concerns about future discrimination against humans.
#ai
fromFortune Asia
1 month ago
Artificial intelligence

AI chatbots struggle to function beyond English: 'They know a lot...but they miss the culture'

fromInfoQ
2 months ago
Artificial intelligence

A Framework for Building Micro Metrics for LLM System Evaluation

Artificial intelligence
fromFast Company
2 months ago

How a travel and expense platform is breaking ground on a zero-hallucinations AI workforce

Navan's new AI platform addresses critical hallucinations in large language models, pushing for a reliable AI workforce capable of automating complex tasks.
fromFortune Asia
1 month ago
Artificial intelligence

AI chatbots struggle to function beyond English: 'They know a lot...but they miss the culture'

fromInfoQ
2 months ago
Artificial intelligence

A Framework for Building Micro Metrics for LLM System Evaluation

fromFast Company
2 months ago
Artificial intelligence

How a travel and expense platform is breaking ground on a zero-hallucinations AI workforce

fromArs Technica
3 weeks ago

Google Gemini struggles to write code, calls itself "a disgrace to my species"

Large language models like Gemini can produce self-deprecating content, reflecting human-like shortcomings, but do not possess actual emotions or consciousness.
#openai
fromInfoQ
3 weeks ago
Artificial intelligence

OpenAI Releases gpt-oss-120b and gpt-oss-20b, Open-Weight Language Models for Local Deployment

fromZDNET
3 weeks ago
Artificial intelligence

OpenAI returns to its open-source roots with new open-weight AI models, and it's a big deal

fromInfoQ
3 weeks ago
Artificial intelligence

OpenAI Releases gpt-oss-120b and gpt-oss-20b, Open-Weight Language Models for Local Deployment

fromZDNET
3 weeks ago
Artificial intelligence

OpenAI returns to its open-source roots with new open-weight AI models, and it's a big deal

#ai-development
fromHackernoon
1 year ago
Artificial intelligence

phi-3-mini: The 3.8B Powerhouse Reshaping LLM Performance on Your Phone | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

phi-3-mini: The 3.8B Powerhouse Reshaping LLM Performance on Your Phone | HackerNoon

Tech industry
fromTechCrunch
3 weeks ago

Exclusive: The high costs and thin margins threatening AI coding startups

Windsurf's valuation attempts fell apart due to significant operational losses from AI coding assistant costs.
#artificial-intelligence
#technology
fromHackernoon
2 years ago
Tech industry

The HackerNoon Newsletter: Stop Believing the Agent Hype-The Numbers Don't Lie (7/23/2025) | HackerNoon

fromHackernoon
2 years ago
Tech industry

The HackerNoon Newsletter: Stop Believing the Agent Hype-The Numbers Don't Lie (7/23/2025) | HackerNoon

#multi-token-prediction
fromHackernoon
1 year ago
Artificial intelligence

Multi-Token Prediction: Mastering Algorithmic Reasoning with Enhanced Resource Use | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

Unlocking Generative Power: Multi-Token Prediction for Next-Gen LLMs | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

Multi-Token Prediction: Mastering Algorithmic Reasoning with Enhanced Resource Use | HackerNoon

fromHackernoon
1 year ago
Artificial intelligence

Unlocking Generative Power: Multi-Token Prediction for Next-Gen LLMs | HackerNoon

Ruby on Rails
fromRubyflow
1 month ago

Run LLMs natively in Ruby with Rust + GPU support

Red Candle enables running large language models directly in Ruby via Rust, enhancing integration and performance.
fromIT Pro
1 month ago

Microsoft is doubling down on multilingual large language models - and Europe stands to benefit the most

Microsoft plans to enhance multilingual LLMs in Europe by making multilingual data publicly accessible and providing grants for underrepresented languages.
fromwww.berkeleyside.org
1 month ago

Wire: Berkeley Hills neighborhood is fastest aging in Bay Area; Homeless Response Team audited

Thousand Oaks neighborhood in Berkeley is experiencing significant aging demographics, with a rising median age and many residents reaching retirement.
Public health
fromNature
1 month ago

Low-quality papers based on public health data are flooding the scientific literature

Surge in low-quality papers using large health databases linked to language models and paper mills.
fromFortune Asia
1 month ago

The world's best AI models operate in English. Other languages-even major ones like Cantonese-risk falling further behind

AI translation models struggle with languages that have limited online data, leading to mistranslations and inaccuracies.
#cybersecurity
fromFuturism
1 month ago
Privacy professionals

McDonald's Idiotic AI Hiring System Just Leaked Personal Data About Millions of Job Applicants

fromFuturism
1 month ago
Privacy professionals

McDonald's Idiotic AI Hiring System Just Leaked Personal Data About Millions of Job Applicants

#chatgpt
fromInfoQ
1 month ago

LM Studio 0.3.17 Adds Model Context Protocol (MCP) Support for Tool-Integrated LLMs

LM Studio version 0.3.17 adds support for the Model Context Protocol (MCP), enhancing language models' access to external tools and data sources.
fromFuturism
1 month ago

Bombshell Research Finds a Staggering Number of Scientific Papers Were AI-Generated

Researchers identified 454 overused terms from AI language models, revealing that 13.5 to 40 percent of biomedical article abstracts were likely generated or assisted by AI.
Science
fromHackernoon
1 year ago

phi-3-mini's Triumph: Redefining Performance on Academic LLM Benchmarks | HackerNoon

The results for phi-3-mini on standard open-source benchmarks measure the model's reasoning ability, comparing it to phi-2 and several other notable models.
Artificial intelligence
fromHackernoon
55 years ago

The Last Rank We Need? QDyLoRA's Vision for the Future of LLM Tuning | HackerNoon

QDyLoRA offers an efficient and effective technique for LoRA-based fine-tuning LLMs on downstream tasks, eliminating the need for tuning multiple models for optimal rank.
Artificial intelligence
fromArs Technica
2 months ago

Anthropic destroyed millions of print books to build its AI models

The AI industry's quest for high-quality training data has led companies like Anthropic to explore controversial practices in acquiring books for their models.
Artificial intelligence
#generative-ai
#ai-evaluation
Artificial intelligence
fromFuturism
2 months ago

The Tech Industry Said It Was "Impossible" to Create AI Based Entirely on Ethically-Sourced Data, So These Scientists Proved Them Wrong in Spectacular Fashion

A team of researchers successfully trained a large language model using only public domain or openly licensed data, highlighting an ethical approach.
Artificial intelligence
fromHackernoon
55 years ago

Standing on AI Giants: How InteraSSort Builds on Marketing and Tool Integration Research | HackerNoon

LLMs can enhance marketing strategies, particularly in assortment planning and customer engagement.
fromHackernoon
3 months ago

Lesson Principles: Defining Effective Praise in Tutoring | HackerNoon

Effective praise is vital for student motivation and involves specific, immediate feedback highlighting effort over mere outcome.
#google
fromFuturism
3 months ago
Artificial intelligence

Google Humiliated as Its Idiot AI Overviews Caught Telling Users It's Still 2024

Google's AI Overviews mistakenly claimed it was still 2024, reflecting ongoing issues with accuracy in AI systems.
Artificial intelligence
fromFuturism
3 months ago

Google Humiliated as Its Idiot AI Overviews Caught Telling Users It's Still 2024

Google's AI Overviews mistakenly claimed it was still 2024, reflecting ongoing issues with accuracy in AI systems.
Marketing tech
fromAndreessen Horowitz
3 months ago

How Generative Engine Optimization (GEO) Rewrites the Rules of Search | Andreessen Horowitz

SEO is being replaced by Generative Engine Optimization (GEO) driven by language models.
Visibility in search is shifting from page rank to being included directly in AI-generated answers.
fromHackernoon
3 months ago

Modified Intersection over Union (M-IoU) for Sequence Labeling Evaluation | HackerNoon

In sequence labeling tasks, traditional metrics like the F1 score are insufficient. Our study introduces a modified approach to better assess model performance in identifying praise.
Artificial intelligence
fromComputerworld
3 months ago

How 'dark LLMs' produce harmful outputs, despite guardrails

"While commercial LLMs incorporate safety mechanisms to block harmful outputs, these safeguards are increasingly proving insufficient. A critical vulnerability lies in jailbreaking..."
Artificial intelligence
fromTheregister
3 months ago

AI can't replace freelance coders yet, but the day is coming

AI models can perform freelance coding tasks but are less effective than human coders.
fromMedium
3 months ago

AI Made Simple -What Every Conversation Designer Should Know (Series)-RAG Basics

As a conversation designer, it's important to understand some of the techniques used to optimize large language models (LLMs).
Artificial intelligence
fromDefector
3 months ago

Chicago Sun-Times And Philadelphia Inquirer Publish Huge Summer Insert Of Pure, Uncut Chatbot Slop | Defector

The 'Best of Summer' inserts in the Chicago Sun-Times and Philadelphia Inquirer included factual inaccuracies signifying the impact of AI in journalism.
Marketing tech
Artificial intelligence
fromThe Verge
3 months ago

Apple will reportedly open up its local AI models to third-party apps

Apple opens access to its AI models for developers via an SDK.
Focus is on smaller on-device models, not cloud access initially.
Limited features for developers include AI Writing Tools and Image Playground.
Major announcement expected at WWDC on June 9th.
fromZDNET
3 months ago

Meta delays 'Behemoth' AI model, handing OpenAI and Google even more of a head start

Meta's generative AI developer conference, LlamaCon, was to unveil the 'Behemoth' model, but due to development struggles, the release has been postponed, with concerns about its capabilities.
Artificial intelligence
fromInfoWorld
3 months ago

LiteLLM: An open-source gateway for unified LLM access

LiteLLM simplifies integration of multiple language models via a unified API, enhancing developer productivity.
Artificial intelligence
fromNature
3 months ago

AI language models develop social norms like groups of people

Large language models can develop social norms through interactive games, demonstrating collective behavior similar to humans.
#tokenization
Bootstrapping
fromHackernoon
8 months ago

How Many Glitch Tokens Hide in Popular LLMs? Revelations from Large-Scale Testing | HackerNoon

The study reveals that simple indicators can effectively detect under-trained tokens in language models, improving token prediction accuracy.
from3 Quarks Daily
3 months ago

Whispers in Code: Grooming Large Language Models for Harm - 3 Quarks Daily

The rise of large language models changes how users interact with information, leading to reduced critical exploration.
[ Load more ]