#single-shot

[ follow ]
#ai
fromTechCrunch
3 weeks ago
Silicon Valley

Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way | TechCrunch

Artificial intelligence
fromTheregister
1 day ago

Claude is getting worse, according to Claude

Anthropic's Claude is facing significant issues with service quality and reliability, leading to customer dissatisfaction and increased complaints.
Tech industry
fromComputerworld
2 weeks ago

HP will cram a 20-billion-parameter AI model into new AI PCs

HP is launching AI features in its Workforce Experience Platform to enhance remote device management and automate tasks on enterprise PCs.
Silicon Valley
fromTechCrunch
3 weeks ago

Startup Gimlet Labs is solving the AI inference bottleneck in a surprisingly elegant way | TechCrunch

Gimlet Labs raised $80 million to enhance AI inference efficiency across diverse hardware types.
fromPyImageSearch
1 week ago

Agentic AI Vision System: Object Segmentation with SAM 3 and Qwen - PyImageSearch

Agentic AI systems are designed to interpret user requests, select the appropriate models or tools, evaluate intermediate outputs, and refine their decisions over multiple steps. This iterative reasoning loop enhances the segmentation process significantly.
Python
#large-language-models
Data science
fromMedium
5 days ago

The Top 10 LLM Training Datasets for 2026

Large language models require extensive training data, and practitioners can utilize ten leading public datasets for effective training and fine-tuning.
Data science
fromMedium
5 days ago

The Top 10 LLM Training Datasets for 2026

Large language models require extensive training data, and practitioners can utilize ten leading public datasets for effective training and fine-tuning.
Podcast
fromFast Company
1 week ago

3 AI tools that make keeping up with the news easier

Huxe is a personalized audio app that generates custom podcasts based on user interests, calendar, and email.
Artificial intelligence
fromFuturism
2 days ago

OpenAI's Latest Thing It's Bragging About Is Actually Kind of Sad

The AI industry faces significant delays and cancellations in data center projects, impacting ambitious computing capacity goals.
Online learning
fromeLearning Industry
1 week ago

From Manual To Intelligent: How AI Automation Is Reshaping L&D Operations

AI automation can alleviate operational burdens on L&D teams, allowing them to focus on strategic tasks and improve learning quality.
Python
fromEfficientcoder
4 days ago

Build Your Own AI Meme Matcher: A Beginner's Guide to Computer Vision with Python

Computer Vision enables real-time facial recognition and meme matching using Object-Oriented Programming for clean and organized code.
#ai-agents
Data science
fromMedium
1 week ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
fromTechCrunch
1 month ago
Artificial intelligence

Perplexity's new Computer is another bet that users need many AI models | TechCrunch

Perplexity launches Computer, an agentic tool for Max subscribers that unifies AI capabilities to execute complex workflows independently using 19 models and create subagents.
fromZDNET
2 months ago
Artificial intelligence

Is your AI agent up to the task? 3 ways to determine when to delegate

AI agents should be managed as an adjunct workforce, using management skills to decide which tasks to automate versus retain for humans.
Data science
fromMedium
1 week ago

15 Datasets for Training and Evaluating AI Agents

Datasets for training and evaluating AI agents are essential for building reliable agentic systems and preventing execution failures.
fromTechCrunch
1 month ago
Artificial intelligence

Perplexity's new Computer is another bet that users need many AI models | TechCrunch

fromZDNET
2 months ago
Artificial intelligence

Is your AI agent up to the task? 3 ways to determine when to delegate

Software development
fromArs Technica
1 week ago

Running local models on Macs gets faster with Ollama's MLX support

Ollama enhances local language model performance on Apple Silicon with MLX support and improved caching, catering to growing interest in local models.
Python
fromThe JetBrains Blog
1 week ago

How to Train Your First TensorFlow Model in PyCharm | The PyCharm Blog

TensorFlow is an open-source framework for building and deploying machine learning models using tensors and high-level libraries like Keras.
fromArs Technica
2 weeks ago

Google's TurboQuant AI-compression algorithm can reduce LLM memory usage by 6x

PolarQuant is doing most of the compression, but the second step cleans up the rough spots. Google proposes smoothing that out with a technique called Quantized Johnson-Lindenstrauss (QJL).
Roam Research
#data-annotation
Data science
fromInfoWorld
1 week ago

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.
fromForbes
1 month ago
Artificial intelligence

Beyond The Hype: The Messy Reality Of Training AI

Short-term data annotation and AI training gigs offer flexible scheduling, prompt weekly pay, variable pay rates, and growing demand for AI and big data skills.
Data science
fromInfoWorld
1 week ago

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.
Artificial intelligence
fromMedium
1 week ago

Hindsight: The Future of AI Agent Memory Beyond Vector Databases

Hindsight introduces a new AI memory system that enables learning from experiences rather than just recalling past information.
DevOps
fromInfoWorld
3 weeks ago

An architecture for engineering AI context

AI systems must intelligently manage context to ensure accuracy and reliability in real applications.
#ai-development
fromInfoWorld
2 weeks ago
Artificial intelligence

Final training of AI models is a fraction of their total cost

Developing AI models incurs significant costs, with most expenditures on scaling and research rather than final training runs.
Science
fromThe Cipher Brief
3 weeks ago

Why the U.S. Must Build the Ultimate Multi-Modal Foundation Model

Advanced AI models like AlphaEarth demonstrate pixel-level geospatial intelligence capabilities that must be integrated into U.S. national security frameworks to maintain technological leadership.
#deepseek-v3
Artificial intelligence
fromFortune
2 weeks ago

Is AI's visual understanding mostly a 'mirage'? New research suggests so. | Fortune

Anthropic faces significant cybersecurity risks following multiple sensitive data leaks related to its new AI model, Mythos.
Data science
fromFast Company
2 weeks ago

A top AI researcher explains the limitations of current models

Francois Chollet's ARC-AGI-3 benchmark reveals AI's limitations in navigating novel situations compared to human intelligence.
Python
fromBusiness Matters
2 weeks ago

Building AI-powered visual solutions: How Python forms the foundation for advanced Computer Vision use cases

Python is the preferred programming language for developing computer vision technologies due to its simplicity, flexibility, and extensive libraries.
Software development
fromInfoWorld
4 weeks ago

How to build an AI agent that actually works

Successful agents embed intelligence within structured workflows at specific decision points rather than operating autonomously, combining deterministic processes with reasoning models where judgment is needed.
#ai-agent-evaluation
fromInfoQ
4 weeks ago
Software development

Evaluating AI Agents in Practice: Benchmarks, Frameworks, and Lessons Learned

Software development
fromInfoQ
4 weeks ago

Evaluating AI Agents in Practice: Benchmarks, Frameworks, and Lessons Learned

AI agents require system-level evaluation across multiple turns measuring task success, tool reliability, and real-world behavior rather than single-turn NLP benchmarks like BLEU and ROUGE scores.
Artificial intelligence
fromInfoWorld
3 weeks ago

Why AI evals are the new necessity for building effective AI agents

User trust in AI agents depends on interaction-layer evaluation measuring reliability and predictability, not just model performance benchmarks.
Data science
fromInfoWorld
3 weeks ago

The 'toggle-away' efficiencies: Cutting AI costs inside the training loop

Simple optimizations can significantly reduce AI training costs and carbon emissions without needing the latest GPUs.
Tech industry
fromWIRED
1 month ago

Meta Developed Four New Chips to Power Its AI and Recommendation Systems

Meta developed four new AI chips (MTIA 300, 400, 450, 500) for powering generative AI and content ranking, with one in production and three shipping between 2027.
#ai-efficiency
Artificial intelligence
fromMedium
3 weeks ago

Less Compute, More Impact: How Model Quantization Fuels the Next Wave of Agentic AI

Model quantization and architectural optimization can outperform larger models, challenging the belief that more GPUs equal greater intelligence.
Online learning
fromeLearning Industry
1 month ago

How Do AI-Driven Learning Platforms Enhance Workforce Performance?

AI-driven learning platforms improve employee productivity and business outcomes by automating personalized learning paths aligned with performance goals.
#circle-to-search
Artificial intelligence
fromInfoWorld
1 month ago

Nvidia launches Nemotron 3 Super to power enterprise AI agents

Nemotron 3 Super's hybrid architecture combining Mamba and Transformer technologies enables enterprises to run complex AI agents more efficiently with lower costs and faster execution on existing infrastructure.
#gemini-31-pro
fromInfoWorld
1 month ago

Neoclouds run AI cheaper and better

By neoclouds, I'm referring to GPU-centric, purpose-built cloud services that focus primarily on AI training and inference rather than on the sprawling catalog of general-purpose services that hyperscalers offer. In many cases, these platforms deliver better price-performance for AI workloads because they're engineered for specific goals: keeping expensive accelerators highly utilized, minimizing platform overhead, and providing a clean path from model development to deployment.
Artificial intelligence
Python
fromPyImageSearch
1 month ago

SAM 3 for Video: Concept-Aware Segmentation and Object Tracking - PyImageSearch

SAM3 extends beyond static image segmentation to video by maintaining streaming memory and tracking state, enabling unified detection, segmentation, and tracking across frames while preserving object identity over time.
fromYanko Design - Modern Industrial Design News
1 month ago

Nvidia wants robots to learn before executing tasks by watching 44,000 hours of human video - Yanko Design

The robotics industry, for now, faces the biggest challenge in teaching robots to operate in the messy real world. The unstructured environment means robots need massive amounts of data to learn. Gathering and structuring that data is the costliest thing in robotics and perhaps the biggest impediment, slowing the entire development process.
Artificial intelligence
Python
fromPyImageSearch
2 months ago

Grounded SAM 2: From Open-Set Detection to Segmentation and Tracking - PyImageSearch

Grounded SAM 2 extends Grounding DINO by adding pixel-level segmentation and video-aware tracking to convert language-driven detections into precise, persistent object masks.
Artificial intelligence
fromwww.socialmediatoday.com
1 month ago

Google introduces next iteration of AI image generation model

Google launched Nano Banana 2, a unified AI image generation model combining previous capabilities with advanced world knowledge, real-time web search integration, and enhanced control features for faster, more accurate visual creation.
Artificial intelligence
fromTheregister
1 month ago

AI models get better at math but still get low marks

Current LLMs struggle with mathematical accuracy, with even top performers scoring C-grade equivalent on practical math benchmarks, though recent versions show modest improvements.
Artificial intelligence
fromHackernoon
2 months ago

This "Flash" AI Model Is Fast and Dangerous at Math-Here's What It Can Do | HackerNoon

GLM-4.7-Flash is a 30-billion-parameter mixture-of-experts model offering strong performance for lightweight deployment.
fromNature
2 months ago

Multimodal learning with next-token prediction for large multimodal models - Nature

Since AlexNet5, deep learning has replaced heuristic hand-crafted features by unifying feature learning with deep neural networks. Later, Transformers6 and GPT-3 (ref. 1) further advanced sequence learning at scale, unifying structured tasks such as natural language processing. However, multimodal learning, spanning modalities such as images, video and text, has remained fragmented, relying on separate diffusion-based generation or compositional vision-language pipelines with many hand-crafted designs.
Artificial intelligence
Artificial intelligence
fromAxios
2 months ago

Models that improve on their own are AI's next big thing

Recursive self-improvement lets AI models keep learning after training, accelerating progress while increasing risks, reducing visibility, and complicating safety and governance.
Artificial intelligence
fromInfoQ
2 months ago

Foundation Models for Ranking: Challenges, Successes, and Lessons Learned

Large-scale search and recommendation systems use two-stage retrieval and ranking pipelines to efficiently serve personalized results for hundreds of millions of users and items.
Artificial intelligence
fromTechCrunch
1 month ago

Running AI models is turning into a memory game | TechCrunch

Rising DRAM prices and sophisticated prompt-caching orchestration make memory management a critical cost and performance factor for large-scale AI deployments.
fromInfoQ
2 months ago

Building Embedding Models for Large-Scale Real-World Applications

What happens under the hood? How is the search engine able to take that simple query, look for images in the billions, trillions of images that are available online? How is it able to find this one or similar photos from all that? Usually, there is an embedding model that is doing this work behind the hood.
Artificial intelligence
Artificial intelligence
fromTechzine Global
2 months ago

OpenAI seeks faster alternatives to Nvidia chips

OpenAI seeks alternative inference chips with larger on-chip SRAM to improve response speed for coding and AI-to-AI communication, aiming for about 10% of future inference capacity.
Artificial intelligence
from24/7 Wall St.
1 month ago

NVIDIA Cements Its Role as the Backbone of AI Infrastructure

NVIDIA's networking revenue grew 162% year-over-year to $8.2 billion, nearly tripling GPU growth, signaling a shift from chip seller to integrated infrastructure provider selling complete AI data center systems.
fromCointelegraph
2 months ago

What Role Is Left for Decentralized GPU Networks in AI?

What we are beginning to see is that many open-source and other models are becoming compact enough and sufficiently optimized to run very efficiently on consumer GPUs,
Artificial intelligence
fromInfoWorld
2 months ago

Researchers propose a self-distillation fix for 'catastrophic forgetting' in LLMs

"To enable the next generation of foundation models, we must solve the problem of continual learning: enabling AI systems to keep learning and improving over time, similar to how humans accumulate knowledge and refine skills throughout their lives," the researchers noted. Reinforcement learning offers a way to train on data generated by the model's own policy, which reduces forgetting. However, it typically requires explicit reward functions, which are not easy in every situation.
Artificial intelligence
Artificial intelligence
fromArs Technica
2 months ago

OpenAI sidesteps Nvidia with unusually fast coding model on plate-sized chips

Cerebras' Wafer Scale Engine enables high token throughput while OpenAI diversifies hardware beyond Nvidia amid fast-paced coding model competition.
Artificial intelligence
fromInfoQ
2 months ago

Why Most Machine Learning Projects Fail to Reach Production

Most ML projects fail to reach production because of problem choice, data/labeling issues, model-to-product gaps, offline-online mismatches, and non-technical blockers.
Artificial intelligence
fromInfoQ
2 months ago

Building LLMs in Resource-Constrained Environments: A Hands-On Perspective

Prioritize small, resource-efficient models and iterative, human-in-the-loop data creation to build practical, improvable AI under infrastructure and data constraints.
fromTechCrunch
2 months ago

Quadric rides the shift from cloud AI to on-device inference - and it's paying off | TechCrunch

The company, which is based in San Francisco and has an office in Pune, India, is targeting up to $35 million this year as it builds a royalty-driven on-device AI business. That growth has buoyed the company, which now has post-money valuation of between $270 million and $300 million, up from around $100 million in its 2022 Series B, Kheterpal said.
Artificial intelligence
Artificial intelligence
fromTESLARATI
2 months ago

Elon Musk's xAI brings 1GW Colossus 2 AI training cluster online

xAI's Colossus 2 is the world's first gigawatt-scale AI training cluster and will expand to 1.5 GW in April, targeting roughly 2 GW.
Artificial intelligence
fromInfoWorld
2 months ago

What is context engineering? And why it's the new AI architecture

Context engineering designs and manages the information, tools, and constraints an LLM receives, enabling scalable, high-signal inputs and improved model outcomes.
Artificial intelligence
fromeLearning Industry
2 months ago

Artificial Intelligence In Transportation Training And Education

AI enables individualized transportation training by evaluating trainee performance, tailoring instruction, simulating real scenarios, and measuring performance for targeted improvement.
fromenglish.elpais.com
2 months ago

How does artificial intelligence think? The big surprise is that it intuits'

Each of these achievements would have been a remarkable breakthrough on its own. Solving them all with a single technique is like discovering a master key that unlocks every door at once. Why now? Three pieces converged: algorithms, computing power, and massive amounts of data. We can even put faces to them, because behind each element is a person who took a gamble.
Artificial intelligence
Artificial intelligence
fromInfoQ
2 months ago

Autonomous Big Data Optimization: Multi-Agent Reinforcement Learning to Achieve Self-Tuning Apache Spark

A Q-learning agent autonomously learns and generalizes optimal Spark configurations by discretizing dataset features and combining with Adaptive Query Execution for superior performance.
Artificial intelligence
fromMail Online
1 month ago

Can you tell the difference between real and AI-generated people?

People are overconfident in their ability to distinguish AI-generated faces from real ones and perform only slightly better than chance.
Artificial intelligence
fromInfoWorld
2 months ago

Edge AI: The future of AI inference is smarter local compute

Edge AI shifts computation from cloud to devices, enabling low-latency, cost-efficient, and privacy-preserving AI inference while facing performance and ecosystem challenges.
Artificial intelligence
fromFast Company
1 month ago

AI's biggest problem isn't intelligence. It's implementation

AI adoption is uneven, yielding clear efficiency gains in some functions yet producing limited measurable profit impacts across most large companies.
Artificial intelligence
fromTheregister
2 months ago

Robotics is forcing a fundamental rethink of AI compute

Physical AI requires purpose-built infrastructure for large-scale simulation, data collection, training, and deployment because cloud limitations hinder reliable scaling.
[ Load more ]