#text-recognition

[ follow ]
Data science
fromTheregister
23 hours ago

LLMs fuel new generation of natural language query systems

Text-to-SQL tools may simplify data queries but can misinterpret business users' intentions, raising caution for organizations.
US politics
fromFuturism
14 hours ago

Leak Shows ICE Planning to Use Facial Recognition Glasses to Identify Targets in Real Time

ICE plans to use facial recognition glasses for real-time data collection on Americans, raising concerns about privacy and surveillance.
Photography
fromAxios
1 day ago

Hands-on with ChatGPT's powerful new image engine

ChatGPT Images 2.0 offers personalized image creation with various aspect ratios and modes, enhancing user experience for both free and paid subscribers.
Privacy professionals
fromEngadget
1 day ago

AI company deletes the 3 million OKCupid photos it used for facial recognition training

Clarifai deleted 3 million profile photos from OkCupid after a settlement with the FTC for violating privacy policies.
#google-photos
Photography
fromTechRepublic
1 day ago

Google Photos Rolls Out New AI-Powered Portrait Editing Features

Google Photos introduces AI-powered touch-up tools for easy and subtle portrait enhancements.
Mobile UX
fromThe Verge
2 days ago

Google Photos adds subtle touch-up tools for faces

Google Photos introduces new touch-up tools for subtle enhancements to faces in photos.
Photography
fromPetaPixel
2 days ago

Google Photos Can Now Smooth Skin and Whiten Teeth Via New 'Touch-Up' Menu

Google Photos introduces a new feature for subtle photo enhancements, allowing users to improve skin texture and whiten teeth easily.
Photography
fromTechRepublic
1 day ago

Google Photos Rolls Out New AI-Powered Portrait Editing Features

Google Photos introduces AI-powered touch-up tools for easy and subtle portrait enhancements.
Mobile UX
fromThe Verge
2 days ago

Google Photos adds subtle touch-up tools for faces

Google Photos introduces new touch-up tools for subtle enhancements to faces in photos.
Photography
fromPetaPixel
2 days ago

Google Photos Can Now Smooth Skin and Whiten Teeth Via New 'Touch-Up' Menu

Google Photos introduces a new feature for subtle photo enhancements, allowing users to improve skin texture and whiten teeth easily.
#ai
fromFast Company
2 days ago
Data science

Your AI can't read an invoice. That should worry you more than whether it can pass a math exam

Python
fromPycon
2 weeks ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
Artificial intelligence
fromWIRED
1 day ago

OpenAI Beefs Up ChatGPT's Image Generation Model

OpenAI launched ChatGPT Images 2.0, an advanced image generation model capable of producing multiple images from a single prompt and supporting various languages.
Graphic design
fromThe Verge
1 day ago

OpenAI's updated image generator can now pull information from the web

OpenAI's ChatGPT Images 2.0 introduces advanced image generation with web search capabilities and improved detail preservation.
Data science
fromFast Company
2 days ago

Your AI can't read an invoice. That should worry you more than whether it can pass a math exam

Advanced AI excels in structured reasoning tasks but struggles with messy, real-world data extraction like invoices.
Typography
fromMedium
3 weeks ago

AI is rewriting the rules. Language is following.

The word 'delve' has surged in usage due to AI's influence on language and communication patterns.
Python
fromPycon
2 weeks ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
Artificial intelligence
fromWIRED
1 day ago

OpenAI Beefs Up ChatGPT's Image Generation Model

OpenAI launched ChatGPT Images 2.0, an advanced image generation model capable of producing multiple images from a single prompt and supporting various languages.
#openai
Privacy technologies
fromTNW | Artificial-Intelligence
2 days ago

OpenAI Codex Chronicle captures your Mac screen to build AI context, with cloud processing and no encryption

Chronicle captures screenshots for AI context, prioritizing cloud processing over local privacy, and requires a Pro subscription and Apple Silicon.
Software development
fromThe Verge
6 days ago

OpenAI's big Codex update is a direct shot at Anthropic's Claude Code

OpenAI updates Codex to enhance its capabilities, including desktop app operation, image generation, and memory features for improved user experience.
Privacy technologies
fromTNW | Artificial-Intelligence
2 days ago

OpenAI Codex Chronicle captures your Mac screen to build AI context, with cloud processing and no encryption

Chronicle captures screenshots for AI context, prioritizing cloud processing over local privacy, and requires a Pro subscription and Apple Silicon.
Software development
fromThe Verge
6 days ago

OpenAI's big Codex update is a direct shot at Anthropic's Claude Code

OpenAI updates Codex to enhance its capabilities, including desktop app operation, image generation, and memory features for improved user experience.
UX design
fromMedium
2 days ago

The web trained AI to deceive. Now designers have to untrain it.

LLMs replicate UX dark patterns from the web, leading to deceptive design practices in generated content.
Digital life
fromTechCrunch
2 days ago

It's not just one thing - it's another thing | TechCrunch

The phrase 'It's not just this - it's that' has dramatically increased in corporate communications, indicating a trend in AI-generated writing.
Node JS
fromRaymondcamden
6 days ago

Summarizing Docs with Built-in AI

On-device summarization of various document types, including Office formats, is achievable using libraries like officeParser and Chrome's Summary API.
DevOps
fromTechzine Global
6 days ago

Claude Opus 4.7 is no Mythos, and that's a good thing

Claude Opus 4.7 improves software engineering, vision, and agentic tasks, but is not the risky Mythos model Anthropic refrains from fully releasing.
Apple
fromEngadget
6 days ago

Perplexity brings its Personal Computer AI assistant to Mac

Perplexity has launched Personal Computer for Mac, a software that enhances multi-model orchestration for managing tasks and workflows.
Online learning
fromeLearning
1 week ago

The Role of AI in Modern Technical Training Programs Across Industries - eLearning

Technical training is essential for applying skills in real work situations, enhanced by AI for better design and delivery.
#google
Digital life
fromArs Technica
6 days ago

Gemini can now create personalized AI images by digging around in Google Photos

Google's new feature allows image generation from Google Photos, but it doesn't retain data for training AI.
Mobile UX
fromTNW | Artificial-Intelligence
2 weeks ago

Google quietly releases free offline AI dictation app for iPhone | TNW

Google AI Edge Eloquent is a free, offline voice dictation app that transcribes speech in real time and polishes text without internet access.
Artificial intelligence
fromTechRepublic
1 day ago

Google AI Overviews: Analysis Suggests 600 Million Inaccurate Daily Answers

Google's AI Overview feature generates hundreds of millions of incorrect answers daily, with a significant portion of accurate responses being ungrounded.
Digital life
fromArs Technica
6 days ago

Gemini can now create personalized AI images by digging around in Google Photos

Google's new feature allows image generation from Google Photos, but it doesn't retain data for training AI.
Mobile UX
fromTNW | Artificial-Intelligence
2 weeks ago

Google quietly releases free offline AI dictation app for iPhone | TNW

Google AI Edge Eloquent is a free, offline voice dictation app that transcribes speech in real time and polishes text without internet access.
Artificial intelligence
fromTechRepublic
1 day ago

Google AI Overviews: Analysis Suggests 600 Million Inaccurate Daily Answers

Google's AI Overview feature generates hundreds of millions of incorrect answers daily, with a significant portion of accurate responses being ungrounded.
fromPyImageSearch
2 weeks ago

Agentic AI Vision System: Object Segmentation with SAM 3 and Qwen - PyImageSearch

Agentic AI systems are designed to interpret user requests, select the appropriate models or tools, evaluate intermediate outputs, and refine their decisions over multiple steps. This iterative reasoning loop enhances the segmentation process significantly.
Python
Social media marketing
fromTechCrunch
2 weeks ago

X is rolling out automatic translation and photo editing powered by Grok | TechCrunch

X introduces automatic translation and a new photo editor powered by Grok models to enhance user experience.
Node JS
fromRaymondcamden
1 week ago

Testing OCR with Chrome Built-in AI

Chrome's built-in AI can perform OCR on images, enabling text extraction and bounding box identification.
Podcast
fromFast Company
2 weeks ago

3 AI tools that make keeping up with the news easier

Huxe is a personalized audio app that generates custom podcasts based on user interests, calendar, and email.
Software development
fromZDNET
6 days ago

OpenAI's Codex Desktop can run your computer now - and has its own browser

Codex Desktop evolves from coding to broader productivity workflows while still targeting developers.
#meta
fromFortune
1 day ago
Artificial intelligence

Meta will start tracking employees' screens and keystrokes to train AI tools | Fortune

Artificial intelligence
fromFortune
1 day ago

Meta will start tracking employees' screens and keystrokes to train AI tools | Fortune

Meta is implementing tracking software on employee computers to gather data for AI training.
Photography
fromThe Verge
5 days ago

This charming gadget writes bad AI poetry

The Poetry Camera generates AI poems from photos instead of images, combining playful design with a frustrating user experience.
Software development
fromInfoWorld
1 week ago

Mastering the dull reality of sexy AI

The gap in enterprise AI lies in building effective systems for retrieval, evaluation, memory, and governance, not just access to models.
Marketing tech
fromWashington City Paper
3 weeks ago

Top 6 AI Detector Tools for Editors, Educators, and Content Teams

AI detection is essential for maintaining content integrity as patterns of AI-generated content become more prevalent and indistinguishable from human writing.
Digital life
fromComputerworld
1 week ago

Google's new AI app is a glimpse of the future

Offline AI tools like Google's AI Edge Eloquent provide essential functionality for users with limited connectivity.
#ai-tools
Business intelligence
fromeLearning Industry
3 weeks ago

How Many AI Tools Are There? A Data-Backed Look At The Expanding AI Landscape

The AI tools ecosystem is rapidly expanding, with thousands of tools available across various categories, creating both opportunities and complexities for businesses.
fromFast Company
4 weeks ago
Artificial intelligence

5 AI projects every solo business owner should try

AI tools provide solopreneurs with dedicated project workspaces to enhance business efficiency and decision-making.
Business intelligence
fromeLearning Industry
3 weeks ago

How Many AI Tools Are There? A Data-Backed Look At The Expanding AI Landscape

The AI tools ecosystem is rapidly expanding, with thousands of tools available across various categories, creating both opportunities and complexities for businesses.
Careers
fromFast Company
3 weeks ago

Using AI to find a job? Here are the do's and don'ts

Job seekers face challenges in a low-hire market, but AI can enhance applications and help personalize approaches to potential employers.
#artificial-intelligence
Digital life
fromFast Company
3 weeks ago

The future of AI is already in your hands

AI must integrate into smartphones as a core system, emphasizing judgment over mere capability to build user trust.
Productivity
fromFast Company
4 weeks ago

Writer wants to be the go-to AI tool kit for the enterprise

Writer offers AI tools for enterprises, enabling non-engineers to automate tasks without IT support.
Python
fromBusiness Matters
4 weeks ago

Building AI-powered visual solutions: How Python forms the foundation for advanced Computer Vision use cases

Python is the preferred programming language for developing computer vision technologies due to its simplicity, flexibility, and extensive libraries.
Digital life
fromFast Company
3 weeks ago

The future of AI is already in your hands

AI must integrate into smartphones as a core system, emphasizing judgment over mere capability to build user trust.
Data science
fromInfoWorld
3 weeks ago

Why 'curate first, annotate smarter' is reshaping computer vision development

Strategic data selection and curation reduce annotation costs and enhance development productivity in computer vision teams.
Artificial intelligence
fromInfoQ
3 days ago

Designing Memory for AI Agents: Inside Linkedin's Cognitive Memory Agent

LinkedIn's Cognitive Memory Agent enables context-aware AI systems that retain knowledge across interactions, enhancing personalization and continuity.
#ai-agents
Business intelligence
fromZDNET
1 month ago

4 tips for building better AI agents that your business can trust

AI agents are transforming professional roles, requiring companies to adopt and integrate these technologies effectively.
Artificial intelligence
fromArs Technica
1 month ago

Perplexity's "Personal Computer" brings its AI agents to the, uh, Personal Computer

Perplexity launches Personal Computer, a desktop agent tool enabling AI to access local files and apps to complete user-defined objectives through natural language descriptions.
Business intelligence
fromZDNET
1 month ago

4 tips for building better AI agents that your business can trust

AI agents are transforming professional roles, requiring companies to adopt and integrate these technologies effectively.
Artificial intelligence
fromArs Technica
1 month ago

Perplexity's "Personal Computer" brings its AI agents to the, uh, Personal Computer

Perplexity launches Personal Computer, a desktop agent tool enabling AI to access local files and apps to complete user-defined objectives through natural language descriptions.
Mobile UX
fromTechCrunch
3 weeks ago

WhatsApp can now draft AI-generated responses based on your conversations | TechCrunch

WhatsApp introduces AI-powered features for suggested replies, message drafting, photo touch-ups, and space management, enhancing user experience and privacy.
Artificial intelligence
fromTechCrunch
6 days ago

OpenAI takes aim at Anthropic with beefed-up Codex that gives it more power over your desktop | TechCrunch

OpenAI's Codex has been revamped with new features, including background operation capabilities, to compete with Anthropic's Claude Code.
Artificial intelligence
fromTechCrunch
6 days ago

Physical Intelligence, a hot robotics startup, says its new robot brain can figure out tasks it was never taught | TechCrunch

Physical Intelligence's π0.7 model enables robots to perform unfamiliar tasks through compositional generalization, marking a significant advancement in robotic AI capabilities.
Business intelligence
fromComputerWeekly.com
1 month ago

AI tools offer 'near-real-time' analysis of data from seized mobile phones and computers | Computer Weekly

Cellebrite's AI-powered Guardian Investigate platform enables police to rapidly analyze mobile device data, discover connections between datasets, track phone locations over time, and construct event timelines for major crime investigations.
Apple
fromFast Company
1 month ago

Photoshop's new AI assistant makes it easer than ever to edit images

Adobe launches an AI assistant for Photoshop Web and Mobile that enables intuitive photo editing through prompts, voice commands, and touch navigation, with results integrable into full Adobe creative workflows.
fromTNW | Insider
1 month ago

Dominate AI search in 2026

Buyers no longer open ten tabs, skim through blog posts, and slowly form an opinion over weeks. Instead, they ask a single question to an AI system and receive a shortlist in return, usually two or three companies that feel familiar, credible, and safe enough to justify internally. That shortlist often becomes the entire market in the buyer's mind.
Marketing
Web development
fromRaymondcamden
2 months ago

Interrogate Your PDFs with Chrome AI

Create an in-browser PDF question-and-answer system using PDF.js and Chrome's on-device Prompt API with client-side parsing and feature detection.
Artificial intelligence
fromTheregister
2 weeks ago

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.
fromFuturism
1 month ago

Startup Generates Caring Letters to Your Friends Using AI, Handwrites Them Using Robot Pen

In an age where we are all drowning in electronic communication, handwritten notes really stand out. The company's website brags that its robo-scrawl is virtually indistinguishable from human writing, produced with unmatched speed, quality, and realism through a large language model that generates content and a proprietary robot that inks it onto stationary.
Writing
E-Commerce
fromNewsday
2 months ago

Online shopping could be AI's next victim

Autonomous AI chatbots will increasingly select and purchase goods, requiring retailers to optimize discovery and profitability for bot-driven commerce.
Mobile UX
fromEngadget
1 month ago

Nothing updates its AI app with semantic search and a new way to track events

Nothing's updated Essential Space app now recognizes events from images and supports semantic search, making it easier to organize and find screenshots, voice recordings, and other digital content on 2025 and 2026 Nothing phones.
Psychology
fromPsychology Today
1 month ago

Conversational AI and Emotional Intelligence

Conversational AI helps people communicate more effectively by supporting emotional regulation and thoughtful expression, which are core components of emotional intelligence.
Software development
fromBusiness Matters
1 month ago

AI Document Processing Software for UK SMEs

UK small business owners waste 120 hours annually on document admin; AI processing software eliminates errors, reduces costs, and frees staff for revenue-generating work.
Digital life
fromwww.theguardian.com
2 months ago

Tell us: have you ever used AI to navigate everyday life and social relationships?

People use chatbots to handle social interactions and major life decisions, including drafting sensitive messages, seeking relationship or job advice, with secure anonymous submissions invited.
Mobile UX
fromEngadget
1 month ago

Google's Circle to Search can now identify multiple objects in an image

Google's updated Circle to Search now identifies multiple objects simultaneously using Gemini 3, enabling comprehensive shopping searches and complex relationship analysis between image elements.
fromFortune
1 month ago

We studied chatbots and language and saw a huge problem: They mean 80% when they say 'likely' but humans hear 65% | Fortune

By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models. While the models do tend to agree with humans on extremes like 'impossible,' they diverge sharply on hedge words like 'maybe.' For example, a model might use the word 'likely' to represent an 80% probability, while a human reader assumes it means closer to 65%.
Artificial intelligence
Artificial intelligence
fromMedium
2 months ago

Extracting AI-Ready Data From Organizational Documents

Poor document extraction corrupts retrieval; preserving document structure at ingestion produces reliable embeddings and trustworthy RAG outputs.
fromThe Verge
2 months ago

ChatGPT's deep research tool adds a built-in document viewer so you can read its reports

OpenAI is updating ChatGPT's deep research tool with a full-screen viewer that you can use to scroll through and navigate to specific areas of its AI-generated reports. As shown in a video shared by OpenAI, the built-in viewer allows you to open ChatGPT's reports in a window separate from your chat, while showing a table of contents on the left side of the screen, and a list of sources on the right.
Artificial intelligence
fromBusiness Insider
2 months ago

Google is blurring the line between search and chatbot

Google Search's AI makeover continues. The company said that, starting today, mobile users will be able to ask follow-up questions to AI Overviews, Google's AI-generated search summaries. Doing so will launch users into a back-and-forth with AI Mode, its more conversational take on search that already lives in a separate tab on the search page. After Google's AI Overviews awkwardly stumbled out the gate in 2024 ( pizza glue, anyone?) they've gradually become a staple of the Search experience.
Artificial intelligence
fromEntrepreneur
1 month ago

This AI Assistant Runs Entirely on Your Computer With No Monthly Fees

It's no secret that businesses are increasingly concerned about artificial intelligence (AI) privacy and escalating subscription costs. Many entrepreneurs find themselves locked into expensive monthly AI services while worrying about where their sensitive business data ends up. Pansophy is an AI desktop assistant that offers a different approach entirely, and a lifetime subscription is available now for only $59.97 (reg. $199).
Artificial intelligence
[ Load more ]