#long-context-256k

[ follow ]
#ai-models
Apple
fromEngadget
8 hours ago

DeepSeek promises its new AI model has 'world-class' reasoning

DeepSeek launched V4 Pro and Flash AI models, featuring enhanced context length and capabilities, while facing bans due to security concerns.
Apple
fromEngadget
8 hours ago

DeepSeek promises its new AI model has 'world-class' reasoning

DeepSeek launched V4 Pro and Flash AI models, featuring enhanced context length and capabilities, while facing bans due to security concerns.
#ai
Psychology
fromPsychology Today
3 days ago

More Us Than It: Why LLMs Are More Transference Than Machine

Countertransference awareness is essential in navigating interactions with AI, emphasizing the need for accountability and understanding of distortions in perception.
fromTechCrunch
3 days ago
Graphic design

ChatGPT's new Images 2.0 model is surprisingly good at generating text | TechCrunch

Artificial intelligence
fromAxios
1 day ago

OpenAI releases "Spud" GPT-5.5 model

GPT-5.5 enhances autonomous task handling and efficiency in various fields, marking a significant advancement in AI capabilities.
Artificial intelligence
fromGSMArena.com
2 hours ago

DeepSeek-V4 Preview launches with open weights and API access

DeepSeek launched its new AI model, DeepSeek-V4, featuring Expert and Instant versions with significant capabilities and open-weights for community use.
Graphic design
fromwww.businessinsider.com
2 days ago

OpenAI wants you to know how good its new image model is at faking real photos

OpenAI's ChatGPT Images 2.0 features advanced image generation capabilities, including internet crawling and multi-language support.
Psychology
fromPsychology Today
3 days ago

More Us Than It: Why LLMs Are More Transference Than Machine

Countertransference awareness is essential in navigating interactions with AI, emphasizing the need for accountability and understanding of distortions in perception.
Graphic design
fromTechCrunch
3 days ago

ChatGPT's new Images 2.0 model is surprisingly good at generating text | TechCrunch

AI-generated imagery has significantly improved, creating realistic content that can be indistinguishable from human-made designs.
Artificial intelligence
fromAxios
1 day ago

OpenAI releases "Spud" GPT-5.5 model

GPT-5.5 enhances autonomous task handling and efficiency in various fields, marking a significant advancement in AI capabilities.
Data science
fromInfoWorld
10 hours ago

Why world models are AI's next frontier

World models learn the physical world, providing the common sense AI needs to achieve artificial general intelligence (AGI).
Gadgets
fromGSMArena.com
22 hours ago

Nothing introduces Essential Voice speech-to-text transcription and translation

Essential Voice is a speech-to-text engine that delivers clear, real-time text by eliminating filler words and supporting multiple languages.
fromNature
2 days ago

Evaluating large language models for accuracy incentivizes hallucinations - Nature

Next-word pretraining creates statistical pressure toward hallucination, even with idealized error-free data. Facts lacking repeated support in training data yield unavoidable errors, while recurring regularities do not.
Tech industry
fromwww.businessinsider.com
2 days ago

Google's new chips are a shot at Nvidia and a big hint at where AI goes next

Google unveiled its latest AI chips, TPU 8t for training and TPU 8i for inference, responding to industry shifts towards inference computing.
#openai
Artificial intelligence
fromFortune
1 day ago

GPT-5.5 is here-and AI model launches are starting to look like software updates | Fortune

OpenAI released GPT-5.5, emphasizing its rapid development and enhanced capabilities for enterprise users and consumers.
Privacy technologies
fromTNW | Artificial-Intelligence
3 days ago

OpenAI Codex Chronicle captures your Mac screen to build AI context, with cloud processing and no encryption

Chronicle captures screenshots for AI context, prioritizing cloud processing over local privacy, and requires a Pro subscription and Apple Silicon.
Artificial intelligence
fromFortune
1 day ago

GPT-5.5 is here-and AI model launches are starting to look like software updates | Fortune

OpenAI released GPT-5.5, emphasizing its rapid development and enhanced capabilities for enterprise users and consumers.
#deepseek
Artificial intelligence
fromTechCrunch
6 hours ago

DeepSeek previews new AI model that 'closes the gap' with frontier models | TechCrunch

DeepSeek launched V4 models, featuring 1 million token context windows and significant parameter counts, outperforming many peers in reasoning benchmarks.
Artificial intelligence
fromTechCrunch
6 hours ago

DeepSeek previews new AI model that 'closes the gap' with frontier models | TechCrunch

DeepSeek launched V4 models, featuring 1 million token context windows and significant parameter counts, outperforming many peers in reasoning benchmarks.
DevOps
fromTechzine Global
1 week ago

Claude Opus 4.7 is no Mythos, and that's a good thing

Claude Opus 4.7 improves software engineering, vision, and agentic tasks, but is not the risky Mythos model Anthropic refrains from fully releasing.
Data science
fromTheregister
2 days ago

LLMs fuel new generation of natural language query systems

Text-to-SQL tools may simplify data queries but can misinterpret business users' intentions, raising caution for organizations.
Node JS
fromRaymondcamden
1 week ago

Summarizing Docs with Built-in AI

On-device summarization of various document types, including Office formats, is achievable using libraries like officeParser and Chrome's Summary API.
European startups
fromTNW | Launch
1 week ago

DeepL launches real-time voice-to-voice translation in 40+ languages

DeepL launched a voice translation suite for real-time spoken communication in various business settings, supporting over 40 languages.
Philosophy
fromJames Bennett
2 weeks ago

Let's talk about LLMs

The current technological landscape may represent a significant shift driven by large language models, but its ultimate impact remains uncertain.
JavaScript
fromInfoWorld
2 weeks ago

27 questions to ask when choosing an LLM

Model performance is crucial for hardware compatibility, speed, and rate limits in real-time applications.
Scala
fromInfoQ
3 weeks ago

Beyond RAG: Architecting Context-Aware AI Systems with Spring Boot

Context-Augmented Generation (CAG) enhances Retrieval-Augmented Generation (RAG) by managing runtime context for enterprise applications without requiring model retraining.
Python
fromTalkpython
3 weeks ago

Deep Agents: LangChain's SDK for Agents That Plan and Delegate

Deep Agents framework enables building advanced AI agents using Python functions and middleware, enhancing capabilities beyond standard LLMs.
Software development
fromInfoWorld
3 weeks ago

Meta shows structured prompts can make LLMs more reliable for code review

Code review is evolving towards machine-led verification, improving accuracy but introducing tradeoffs like increased latency and workflow overhead.
Online learning
fromwww.businessinsider.com
3 weeks ago

Inside the OpenAI project where freelancers train ChatGPT on everything from farming to commercial flying

Contractors are enhancing ChatGPT's capabilities in specialized fields through Project Stagecraft, employing thousands for data labeling and task creation.
Artificial intelligence
fromTechCrunch
1 day ago

OpenAI releases GPT-5.5, bringing company one step closer to an AI 'superapp' | TechCrunch

OpenAI released GPT-5.5, its most advanced AI model, enhancing capabilities and moving closer to a multi-purpose 'superapp' vision.
Mobile UX
fromTechCrunch
4 weeks ago

WhatsApp can now draft AI-generated responses based on your conversations | TechCrunch

WhatsApp introduces AI-powered features for suggested replies, message drafting, photo touch-ups, and space management, enhancing user experience and privacy.
Python
fromPyImageSearch
3 weeks ago

Autoregressive Model Limits and Multi-Token Prediction in DeepSeek-V3 - PyImageSearch

Multi-Token Prediction (MTP) in DeepSeek-V3 allows simultaneous token forecasting, enhancing training speed and contextual understanding.
#structured-data
Data science
fromAol
2 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
Data science
fromAol
2 weeks ago

Demystifying structured data: How to speak an LLM's native language

Structured data is essential for LLMs to accurately interpret and rank online content, enhancing search visibility and user engagement.
fromTechCrunch
4 weeks ago

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Cohere's Transcribe model is designed for tasks like note-taking and speech analysis, supporting 14 languages and optimized for consumer-grade GPUs, making it accessible for self-hosting.
European startups
Venture
fromComputerworld
1 month ago

OpenAI's desktop superapp: The end of ChatGPT as we know it?

The shift in enterprise technology is driven by internal fragmentation and competitive pressure, focusing on workflows rather than conversations.
Data science
fromInfoWorld
3 weeks ago

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.
Science
fromThe Cipher Brief
1 month ago

Why the U.S. Must Build the Ultimate Multi-Modal Foundation Model

Advanced AI models like AlphaEarth demonstrate pixel-level geospatial intelligence capabilities that must be integrated into U.S. national security frameworks to maintain technological leadership.
Artificial intelligence
fromMedium
3 days ago

Enterprise AI in Practice: 6 Must-Watch Sessions on Scaling Agentic Systems

Enterprise AI is transitioning from experimentation to execution, presenting challenges in governance, scaling, and measurable business impact.
Artificial intelligence
fromnews.bitcoin.com
4 days ago

Nvidia Releases Nemotron 3 Super, a 120B Open AI Model Built for Agentic Workloads

Nvidia launched Nemotron 3 Super, a 120 billion parameter model that significantly reduces AI compute costs and increases throughput.
Data science
fromInfoWorld
3 weeks ago

A GitHub tinkerer teaches Claude to talk less, and that may matter more than it seems

A markdown file can significantly reduce AI output token usage, enhancing efficiency without code changes.
Software development
fromMedium
1 month ago

Inside Dify AI: How RAG, Agents, and LLMOps Work Together in Production

Dify AI provides a unified platform for deploying production language model systems with built-in solutions for data freshness, observability, versioning, and safe deployment across multiple cloud environments.
Data science
fromTechzine Global
4 weeks ago

As AI hits scaling limits, Google smashes the context barrier

TurboQuant significantly reduces KV cache size, enhancing AI model performance and expanding context windows for complex workloads.
Software development
fromInfoQ
1 month ago

The Oil and Water Moment in AI Architecture

Software architecture is transitioning to AI architecture, requiring architects to manage the coexistence of deterministic systems with non-deterministic AI behavior while shifting from tool-centric to intent-centric thinking.
Artificial intelligence
fromTheregister
3 weeks ago

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.
Data science
fromInfoQ
1 month ago

Google Researchers Propose Bayesian Teaching Method for Large Language Models

Google researchers developed a training method enabling large language models to approximate Bayesian reasoning by learning from optimal Bayesian system predictions, improving belief updates during multi-step interactions.
Python
fromPyImageSearch
2 months ago

TF-IDF vs. Embeddings: From Keywords to Semantic Search - PyImageSearch

Vector databases and embeddings enable semantic search and retrieval-augmented generation by mapping text meaning into geometric vectors for similarity-based retrieval.
Artificial intelligence
fromMedium
1 month ago

Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI

Model quantization and architectural optimization can outperform larger models, challenging the belief that more GPUs equal greater intelligence.
Artificial intelligence
fromFast Company
1 month ago

OpenAI's new frontier models mark a huge change in how AI will be built

OpenAI released two frontier models in early March: GPT-5.3 optimized for fast responses and GPT-5.4 optimized for deep analytical work, representing a shift toward specialized AI models.
fromFast Company
2 months ago

Are LTMs the next LLMs? This new type of AI can do what large-language models can't

A major difference between LLMs and LTMs is the type of data they're able to synthesize and use. LLMs use unstructured data-think text, social media posts, emails, etc. LTMs, on the other hand, can extract information or insights from structured data, which could be contained in tables, for instance. Since many enterprises rely on structured data, often contained in spreadsheets, to run their operations, LTMs could have an immediate use case for many organizations.
Artificial intelligence
fromInfoQ
2 months ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
Artificial intelligence
fromInfoQ
2 months ago

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Prioritize small, resource-efficient models and iterative, human-in-the-loop data creation to build practical, improvable AI under infrastructure and data constraints.
fromFortune
1 month ago

We studied chatbots and language and saw a huge problem: They mean 80% when they say 'likely' but humans hear 65% | Fortune

By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models. While the models do tend to agree with humans on extremes like 'impossible,' they diverge sharply on hedge words like 'maybe.' For example, a model might use the word 'likely' to represent an 80% probability, while a human reader assumes it means closer to 65%.
Artificial intelligence
Artificial intelligence
fromInfoWorld
2 months ago

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.
Artificial intelligence
fromTechCrunch
2 months ago

Cohere launches a family of open multilingual models | TechCrunch

Cohere launched Tiny Aya open-weight multilingual models supporting 70+ languages, runnable offline on everyday devices with a 3.35B-parameter base and regional variants.
fromTechCrunch
2 months ago

Tiny startup Arcee AI built a 400B open source LLM from scratch to best Meta's Llama | TechCrunch

But tiny 30-person startup Arcee AI disagrees. The company just released a truly and permanently open (Apache license) general-purpose, foundation model called Trinity, and Arcee claims that at 400B parameters, it is among the largest open-source foundation models ever trained and released by a U.S. company. Arcee says Trinity compares to Meta's Llama 4 Maverick 400B, and Z.ai GLM-4.5, a high-performing open-source model from China's Tsinghua University, according to benchmark tests conducted using base models (very little post training).
Artificial intelligence
Artificial intelligence
fromFortune
1 month ago

AI mastered language. The physical world is next | Fortune

Embodied AI advancement requires world modeling and physical understanding, constrained by scarcity of specific training data rather than compute or architecture limitations.
fromSearch Engine Roundtable
2 months ago

Google Expands AI Mode To 53 New Languages

Google has added 53 new languages to AI Mode, which means the AI Mode works in just under 100 languages. This was announced by Nick Fox from Google on X yesterday. Nick Fox said, "Shipping AI Mode to 53 new languages (spoken by more than a billion people globally!)"
Artificial intelligence
#continual-learning
fromInfoWorld
2 months ago
Artificial intelligence

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

fromInfoWorld
2 months ago
Artificial intelligence

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

fromInfoQ
2 months ago

Open Responses Specification Enables Unified Agentic LLM Workflows

OpenAI has released Open Responses, an open specification to standardize agentic AI workflows and reduce API fragmentation. Supported by partners like Hugging Face and Vercel and local inference providers, the spec introduces unified standards for agentic loops, reasoning visibility, and internal versus external tool execution. It aims to enable developers to easily switch between proprietary models and open-source models without rewriting integration code.
Artificial intelligence
Artificial intelligence
fromTechzine Global
1 month ago

IBM integrates Deepgram speech AI into watsonx Orchestrate

IBM and Deepgram integrate advanced speech-to-text and text-to-speech capabilities into watsonx Orchestrate to enable organizations to build conversational AI agents and automate operations.
fromNature
2 months ago

Multimodal learning with next-token prediction for large multimodal models - Nature

Since AlexNet5, deep learning has replaced heuristic hand-crafted features by unifying feature learning with deep neural networks. Later, Transformers6 and GPT-3 (ref. 1) further advanced sequence learning at scale, unifying structured tasks such as natural language processing. However, multimodal learning, spanning modalities such as images, video and text, has remained fragmented, relying on separate diffusion-based generation or compositional vision-language pipelines with many hand-crafted designs.
Artificial intelligence
[ Load more ]