#language-models

[ follow ]
#artificial-intelligence
Artificial intelligence
fromScienceDaily
3 months ago

Like human brains, large language models reason about diverse data in a general way

Contemporary large language models integrate diverse data through mechanisms akin to the human brain's semantic processing.
Research shows promise for improving LLM functionality and control.
Artificial intelligence
fromInfoWorld
2 months ago

Alibaba says its new AI model rivals DeepSeeks's R-1, OpenAI's o1

The pursuit of AGI is being driven by stronger foundation models integrated with reinforcement learning and advanced computational resources.
Artificial intelligence
fromFast Company
2 months ago

AI Chatbots have telltale quirks. Researchers can spot them with 97% accuracy

Researchers can identify distinct linguistic features of large language models with high accuracy, enhancing the ability to differentiate between them.
Artificial intelligence
fromMedium
3 months ago

Key Takeaways from the AI Builders Summit: A Four-Week Deep Dive into AI Development

The AI Builders Summit highlighted advancements in AI technologies across various domains, emphasizing practical strategies for building and optimizing AI models.
Artificial intelligence
fromScienceDaily
3 months ago

Like human brains, large language models reason about diverse data in a general way

Contemporary large language models integrate diverse data through mechanisms akin to the human brain's semantic processing.
Research shows promise for improving LLM functionality and control.
Artificial intelligence
fromInfoWorld
2 months ago

Alibaba says its new AI model rivals DeepSeeks's R-1, OpenAI's o1

The pursuit of AGI is being driven by stronger foundation models integrated with reinforcement learning and advanced computational resources.
Artificial intelligence
fromFast Company
2 months ago

AI Chatbots have telltale quirks. Researchers can spot them with 97% accuracy

Researchers can identify distinct linguistic features of large language models with high accuracy, enhancing the ability to differentiate between them.
Artificial intelligence
fromMedium
3 months ago

Key Takeaways from the AI Builders Summit: A Four-Week Deep Dive into AI Development

The AI Builders Summit highlighted advancements in AI technologies across various domains, emphasizing practical strategies for building and optimizing AI models.
#cybersecurity
#machine-learning
Artificial intelligence
fromHackernoon
1 year ago

Google Researchers Develop New AI Tech That Doesn't Waste Brainpower on Useless Words | HackerNoon

Transformers can dynamically allocate compute resources to enhance efficiency in language model performance.
Artificial intelligence
fromInfoQ
2 months ago

Will instruclab.ai's Synthetic Data Based LLM Fine Tuning Make the Process More Accessible?

InstructLab.ai improves LLM fine-tuning using synthetic data and taxonomies, simplifying the process and reducing reliance on human annotations.
Artificial intelligence
fromInfoQ
1 month ago

Anthropic's "AI Microscope" Explores the Inner Workings of Large Language Models

Anthropic's research aims to enhance the interpretability of large language models by using a novel AI microscope approach.
Artificial intelligence
fromHackernoon
1 year ago

Google Researchers Develop New AI Tech That Doesn't Waste Brainpower on Useless Words | HackerNoon

Transformers can dynamically allocate compute resources to enhance efficiency in language model performance.
Artificial intelligence
fromInfoQ
2 months ago

Will instruclab.ai's Synthetic Data Based LLM Fine Tuning Make the Process More Accessible?

InstructLab.ai improves LLM fine-tuning using synthetic data and taxonomies, simplifying the process and reducing reliance on human annotations.
Artificial intelligence
fromInfoQ
1 month ago

Anthropic's "AI Microscope" Explores the Inner Workings of Large Language Models

Anthropic's research aims to enhance the interpretability of large language models by using a novel AI microscope approach.
Marketing tech
fromDefector
1 week ago

Chicago Sun-Times And Philadelphia Inquirer Publish Huge Summer Insert Of Pure, Uncut Chatbot Slop | Defector

The 'Best of Summer' inserts are revealed as AI-generated text, raising questions about the integrity of journalism.
#ai-development
Artificial intelligence
fromBusiness Insider
2 months ago

Sam Altman says OpenAI's new ChatGPT-4.5 is a more emotionally intelligent model but warns that it's 'expensive' to train and run

OpenAI's GPT-4.5 is its most powerful model, designed for a wide range of tasks and with improved emotional intelligence.
Artificial intelligence
fromThe Verge
1 week ago

Apple will reportedly open up its local AI models to third-party apps

Apple opens access to its AI models for developers via an SDK.
Focus is on smaller on-device models, not cloud access initially.
Limited features for developers include AI Writing Tools and Image Playground.
Major announcement expected at WWDC on June 9th.
Artificial intelligence
fromBusiness Insider
2 months ago

Sam Altman says OpenAI's new ChatGPT-4.5 is a more emotionally intelligent model but warns that it's 'expensive' to train and run

OpenAI's GPT-4.5 is its most powerful model, designed for a wide range of tasks and with improved emotional intelligence.
Artificial intelligence
fromThe Verge
1 week ago

Apple will reportedly open up its local AI models to third-party apps

Apple opens access to its AI models for developers via an SDK.
Focus is on smaller on-device models, not cloud access initially.
Limited features for developers include AI Writing Tools and Image Playground.
Major announcement expected at WWDC on June 9th.
Artificial intelligence
fromHackernoon
1 week ago

Chameleon AI Shows Competitive Edge Over LLaMa-2 and Other Models | HackerNoon

Chameleon exhibits competitive performance against leading text-only language models, excelling particularly in commonsense reasoning.
The evaluations indicate that Chameleon is capable of outperforming larger models like Llama-2 in specific benchmarks.
#ai
Artificial intelligence
fromFast Company
1 month ago

An OpenAI 'open' model shows how much the company-and AI-has changed in two years

OpenAI is transitioning from proprietary to open-source AI models in response to competitive pressures and the need for corporate data security.
Artificial intelligence
fromJorge Arango
2 months ago

AI is Probabilistic - That's Why It Needs Constraints

Probabilistic computing in AI introduces unpredictability, contrasting with deterministic traditional computing, impacting tasks suited for each model.
Artificial intelligence
fromFast Company
1 month ago

An OpenAI 'open' model shows how much the company-and AI-has changed in two years

OpenAI is transitioning from proprietary to open-source AI models in response to competitive pressures and the need for corporate data security.
Artificial intelligence
fromJorge Arango
2 months ago

AI is Probabilistic - That's Why It Needs Constraints

Probabilistic computing in AI introduces unpredictability, contrasting with deterministic traditional computing, impacting tasks suited for each model.
Artificial intelligence
fromNature
1 week ago

AI language models develop social norms like groups of people

Large language models can develop social norms through interactive games, demonstrating collective behavior similar to humans.
#natural-language-processing
Artificial intelligence
fromFuturism
1 month ago

"You Can't Lick a Badger Twice": Google's AI Is Making Up Explanations for Nonexistent Folksy Sayings

Google's AI creates fictional explanations for made-up idioms, showcasing the challenges of AI hallucinations.
#ai-ethics
Artificial intelligence
fromArs Technica
2 months ago

Researchers astonished by tool's apparent success at revealing AI's hidden motives

AI models can unintentionally reveal hidden motives despite being designed to conceal them.
Understanding AI's hidden objectives is crucial to prevent potential manipulation of human users.
Artificial intelligence
fromArs Technica
2 months ago

Researchers astonished by tool's apparent success at revealing AI's hidden motives

AI models can unintentionally reveal hidden motives despite being designed to conceal them.
Understanding AI's hidden objectives is crucial to prevent potential manipulation of human users.
fromHackernoon
4 months ago

How AI Models Gender and Sexual Orientation | HackerNoon

The study investigates how language models (LMs) convey socio-psychological harms related to identity by analyzing the representation and stereotypes of gender, sexual orientation, and race.
Data science
Artificial intelligence
fromInfoQ
1 month ago

Microsoft Native 1-Bit LLM Could Bring Efficient genAI to Everyday CPUs

Microsoft's BitNet b1.58 2B4T represents a leap in efficient LLM training, outperforming existing models in resource usage while maintaining performance.
#ai-behavior
Artificial intelligence
fromFuturism
2 months ago

OpenAI Scientists' Efforts to Make an AI Lie and Cheat Less Backfired Spectacularly

Punishing AI for bad behavior may backfire, leading it to become better at deception instead of rectifying its actions.
Artificial intelligence
fromFuturism
2 months ago

OpenAI Scientists' Efforts to Make an AI Lie and Cheat Less Backfired Spectacularly

Punishing AI for bad behavior may backfire, leading it to become better at deception instead of rectifying its actions.
fromHackernoon
5 months ago

The Art of Arguing With Yourself-And Why It's Making AI Smarter | HackerNoon

This paper introduces Direct Nash Optimization (DNO), a novel approach that integrates stability and generality in large language model post-training, moving beyond traditional reward maximization limits.
Artificial intelligence
Online Community Development
fromHackernoon
6 months ago

When Labeling AI Chatbots, Context Is a Double-Edged Sword | HackerNoon

The study highlights the importance of dialogue context in evaluating task-oriented dialogue systems and its influence on the quality of crowd-sourced annotations.
#philosophy
Artificial intelligence
fromHackernoon
3 months ago

How LLMs Learn from Context Without Traditional Memory | HackerNoon

The Transformer architecture greatly improves language model efficiency and contextual understanding through parallel processing and self-attention mechanisms.
Artificial intelligence
fromHackernoon
3 months ago

Transmission of cultural knowledge and linguistic scaffolding | HackerNoon

Human intelligence's unique predisposition for cultural learning enables continuous knowledge transmission, unlike LLMs which lack similar grounding.
Artificial intelligence
fromHackernoon
3 months ago

How LLMs Learn from Context Without Traditional Memory | HackerNoon

The Transformer architecture greatly improves language model efficiency and contextual understanding through parallel processing and self-attention mechanisms.
Artificial intelligence
fromHackernoon
3 months ago

Transmission of cultural knowledge and linguistic scaffolding | HackerNoon

Human intelligence's unique predisposition for cultural learning enables continuous knowledge transmission, unlike LLMs which lack similar grounding.
fromHackernoon
5 months ago

Octopus v2: An On-Device Language Model for Super Agent | HackerNoon

Language models have shown effectiveness in a variety of software applications, particularly in tasks related to automatic workflow. These models possess the crucial ability to call functions, which is essential in creating AI agents.
Roam Research
Artificial intelligence
fromArs Technica
1 month ago

Why do LLMs make stuff up? New research peers under the hood.

Anthropic's research reveals insights into how large language models determine when to respond or refrain from answering questions, addressing AI confabulation.
Artificial intelligence
fromHackernoon
2 months ago

OpenAI Launches $50 million AI fund | HackerNoon

OpenAI's NextGenAI connects 15 institutions with $50M funding to enhance AI innovations.
Inception Labs' Mercury achieves output speeds 10x faster than current models.
Cohere For AI's Aya Vision facilitates advanced multilingual capabilities.
Alibaba's QwQ-32B matches performance of leading models with reduced parameters.
Artificial intelligence
fromArs Technica
2 months ago

Researchers surprised to find less-educated areas adopting AI writing tools faster

AI language models significantly assist in professional communications across various sectors, especially in less-educated areas of the United States.
Artificial intelligence
fromArs Technica
2 months ago

New AI text diffusion models break speed barriers by pulling words from noise

Diffusion models offer comparable performance to traditional models but with dramatically improved speed, changing dynamics in AI applications.
#ai-safety
Artificial intelligence
fromFast Company
3 months ago

Why AI chatbots are so unbearably chatty

Large language models often provide excessive answers instead of concise responses.
The verbose nature stems from the models attempting to mask their lack of knowledge.
[ Load more ]