#voice-translate

[ follow ]
#openai
#ai
UX design
fromYannglt
1 week ago

AI and the Rosetta Stone

AI enhances the translation between design and engineering, increasing speed while challenging the preservation of meaning.
Artificial intelligence
fromEngadget
3 days ago

Microsoft's research assistant can now use multiple AI models simultaneously

The upgraded Researcher tool combines ChatGPT and Claude models for improved research quality in Microsoft 365 Copilot.
Artificial intelligence
fromFuturism
6 days ago

Alarming Study Finds That Most People Just Do What ChatGPT Tells Them, Even If It's Totally Wrong

AI chatbots often provide incorrect answers, yet users frequently trust their outputs, demonstrating a phenomenon called 'cognitive surrender'.
Typography
fromMedium
1 day ago

AI is rewriting the rules. Language is following.

The word 'delve' has surged in usage due to AI's influence on language and communication patterns.
UX design
fromYannglt
1 week ago

AI and the Rosetta Stone

AI enhances the translation between design and engineering, increasing speed while challenging the preservation of meaning.
European startups
fromTechCrunch
1 week ago

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral launched Voxtral TTS, an open-source text-to-speech model for voice AI assistants and enterprise applications, supporting nine languages.
Artificial intelligence
fromEngadget
3 days ago

Microsoft's research assistant can now use multiple AI models simultaneously

The upgraded Researcher tool combines ChatGPT and Claude models for improved research quality in Microsoft 365 Copilot.
Artificial intelligence
fromFuturism
6 days ago

Alarming Study Finds That Most People Just Do What ChatGPT Tells Them, Even If It's Totally Wrong

AI chatbots often provide incorrect answers, yet users frequently trust their outputs, demonstrating a phenomenon called 'cognitive surrender'.
Software development
fromZDNET
23 hours ago

I built two apps with just my voice and a mouse - are IDEs already obsolete?

AI coding transforms development by replacing traditional editing and debugging with instructive guidance.
Marketing tech
fromThe Verge
23 hours ago

Microsoft's new 'superintelligence' game plan is all about business

Microsoft's Mustafa Suleyman focuses on achieving superintelligence to enhance business productivity through AI advancements.
fromThe Conversation
1 day ago

AI's fluency in other languages hides a Western worldview that can mislead users a scholar of Indonesian society explains

The response was in Indonesian but shaped by values that centered individual autonomy over the consensus-building, social harmony and collective family dynamics that tend to matter more in Indonesian social life.
Philosophy
Gadgets
fromTechCrunch
3 days ago

Speechify's Windows app uses local models for transcription and dictation | TechCrunch

Speechify launched a Windows app for dictation and reading aloud, processing voice entirely on-device for enhanced user experience.
#ai-agents
Python
fromTalkpython
1 day ago

Deep Agents: LangChain's SDK for Agents That Plan and Delegate

Deep Agents framework enables building advanced AI agents using Python functions and middleware, enhancing capabilities beyond standard LLMs.
fromMedium
4 days ago
Software development

A human approach to Agentic AI. One person. One text file. Five agents.

A soft-agent team of AI assists in book creation and management without requiring coding skills.
Python
fromTalkpython
1 day ago

Deep Agents: LangChain's SDK for Agents That Plan and Delegate

Deep Agents framework enables building advanced AI agents using Python functions and middleware, enhancing capabilities beyond standard LLMs.
Software development
fromMedium
4 days ago

A human approach to Agentic AI. One person. One text file. Five agents.

A soft-agent team of AI assists in book creation and management without requiring coding skills.
Data science
fromInfoWorld
3 days ago

A GitHub tinkerer teaches Claude to talk less, and that may matter more than it seems

A markdown file can significantly reduce AI output token usage, enhancing efficiency without code changes.
Online learning
fromwww.businessinsider.com
2 days ago

Inside the OpenAI project where freelancers train ChatGPT on everything from farming to commercial flying

Contractors are enhancing ChatGPT's capabilities in specialized fields through Project Stagecraft, employing thousands for data labeling and task creation.
fromwww.npr.org
4 days ago

China's chatbot industry is fiercely competing for customers. Cue the freebies

The competitive landscape among AI apps in China is fierce. Companies have been dumping money into the market to try to win customers and show them how AI is useful in everyday life, in particular, for buying stuff.
US news
fromTechCrunch
1 week ago

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Cohere's Transcribe model is designed for tasks like note-taking and speech analysis, supporting 14 languages and optimized for consumer-grade GPUs, making it accessible for self-hosting.
European startups
#google
Mobile UX
fromThe Verge
1 week ago

Google's 'live' AI search assistant can handle conversations in dozens more languages

Google expands Search Live, allowing voice and camera searches in over 200 countries with a new AI model for improved responses.
Mobile UX
fromTechCrunch
1 week ago

Google Translate's real-time headphone translations feature expands to iOS and more countries | TechCrunch

Google's Live Translate feature expands to iOS and more countries, enabling real-time translations in headphones for over 70 languages.
Mobile UX
fromThe Verge
1 week ago

Google's 'live' AI search assistant can handle conversations in dozens more languages

Google expands Search Live, allowing voice and camera searches in over 200 countries with a new AI model for improved responses.
Mobile UX
fromTechCrunch
1 week ago

Google Translate's real-time headphone translations feature expands to iOS and more countries | TechCrunch

Google's Live Translate feature expands to iOS and more countries, enabling real-time translations in headphones for over 70 languages.
Artificial intelligence
fromTheregister
17 hours ago

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.
Apple
fromTechRepublic
1 day ago

Apple Prepares Siri for Multi-Step AI Requests in iOS 27

Apple is developing a Siri upgrade to handle multiple requests in one command, enhancing its functionality in iOS 27.
fromwww.theguardian.com
3 days ago

Don't stop at Duolingo, set realistic goals, balance skills: how to start learning a new language

Learning a new language not only makes you look cool, it also allows you to familiarize yourself with another culture, connect with new people and enjoy a wider variety of art and media.
Online learning
Digital life
fromFast Company
1 week ago

Is AI killing the human voice in writing?

Predictive language technologies challenge individual expression by influencing how writers generate and complete their thoughts.
#voice-ai
fromTechCrunch
1 month ago
Artificial intelligence

ElevenLabs CEO: Voice is the next interface for AI | TechCrunch

Voice is becoming the primary AI interface, enabling hands-free, agentic interactions across devices by combining expressive speech with large language model reasoning.
Apple
fromThe Verge
2 days ago

You can now use ChatGPT with Apple's CarPlay

ChatGPT is now available on CarPlay for voice-based interactions with iOS 26.4 and the latest app version.
fromwww.scientificamerican.com
2 weeks ago

Can you solve these language puzzles? Test your skills with these problems from North America's biggest linguistics competition

Computational linguistics is a two-way street: You're either using a computer to do things with human language or communicate or translate or teach a foreign language, or you're using computational techniques to learn something about human languages. Her work documenting and preserving endangered languages uses a little bit of both.
Education
Mindfulness
fromPsychology Today
2 weeks ago

How Saying "Please" to AI Changes the Way We Think About It

Using polite language with AI creates perceived relationships that reduce objectivity and increase unhealthy reliance on its responses.
Science
fromThe Cipher Brief
2 weeks ago

Why the U.S. Must Build the Ultimate Multi-Modal Foundation Model

Advanced AI models like AlphaEarth demonstrate pixel-level geospatial intelligence capabilities that must be integrated into U.S. national security frameworks to maintain technological leadership.
Marketing tech
fromForbes
4 days ago

Google Search Live Goes Global, Giving Users Real-Time Search With Voice And Video

Google Search Live expands globally, enabling real-time, multimodal conversations using voice and camera in over 200 countries.
Mobile UX
fromTechCrunch
1 week ago

WhatsApp can now draft AI-generated responses based on your conversations | TechCrunch

WhatsApp introduces AI-powered features for suggested replies, message drafting, photo touch-ups, and space management, enhancing user experience and privacy.
Relationships
fromBusiness Matters
3 weeks ago

Real-time video translation for families: How to end awkward multilingual calls

Real-time video translation removes language barriers in family calls, enabling natural conversations and preserving emotional connection across multilingual households.
Roam Research
fromTechCrunch
2 weeks ago

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

Physical AI notetakers provide versatile recording and transcription options for meetings, offering features like live translation and unlimited transcription without subscriptions.
Media industry
fromMashable
2 weeks ago

AI translation tool turns English into 'LinkedIn'

Kagi, a premium search service, offers a free AI-based tool that translates standard English into LinkedIn's characteristic self-promotional jargon and corporate speak.
Deliverability
fromFast Company
3 weeks ago

How to communicate like a human in the age of AI

AI-generated communication lacks personal distinctiveness and authenticity, reducing trustworthiness despite appearing professional, while minimal AI editing preserves human voice and credibility.
Apple
fromWIRED
1 week ago

How to Use Apple's Live Translation on Your AirPods

Live Translation feature in AirPods enables instant language translation through Apple devices, enhancing communication during calls and travel.
Medicine
fromwww.bbc.com
4 weeks ago

'My new AI voice keeps my personality alive'

AI technology enables a motor neurone disease patient to communicate using a reconstructed version of her own voice, restoring personal identity and family connection.
Psychology
fromPsychology Today
1 month ago

Conversational AI and Emotional Intelligence

Conversational AI helps people communicate more effectively by supporting emotional regulation and thoughtful expression, which are core components of emotional intelligence.
Science
fromMail Online
1 month ago

AI is being taught UK regional slang - so, how many terms do YOU know?

UK researchers are training AI systems to understand regional slang and accents so automated council phone lines can better serve local callers across different dialects.
Artificial intelligence
fromInfoWorld
3 weeks ago

How developers can bring voice AI into telephony applications

Voice AI agents require complex infrastructure beyond LLMs to integrate with legacy telephony systems, demanding flexible architecture designed for component switching and evolution.
fromZDNET
3 weeks ago

I wrote off ChatGPT's voice mode, then found 7 ways it's genuinely useful

Talking to ChatGPT feels more collaborative than typing. It shines for brainstorming, prep, and translation. Usage limits can interrupt productivity mid-session. Voice Mode runs on mobile devices, as well as in your browser. On mobile, there are two ChatGPT widgets available for the lock screen. One widget opens the app, and one launches ChatGPT Voice.
Artificial intelligence
Artificial intelligence
fromFortune
3 weeks ago

AI mastered language. The physical world is next | Fortune

Embodied AI advancement requires world modeling and physical understanding, constrained by scarcity of specific training data rather than compute or architecture limitations.
fromwww.socialmediatoday.com
2 months ago

Meta Adds More Languages to AI Translations for Reels

As explained by Meta: AI-powered translations for Reels are starting to roll out in more languages, including Bengali, Tamil, Telugu, Marathi, and Kannada, on Instagram. These new additions build on our existing language support for English, Hindi, Portuguese, and Spanish. The addition of more of the languages spoken in India is significant, because India is now the biggest single market for both Facebook and Instagram usage, beating out the U.S. by a significant margin.
Tech industry
Pets
fromMail Online
1 month ago

Want your dog to understand everything you're saying?

A company offers a collar that converts human speech into AI-generated dog barks that elicit responses, while experts doubt it enables true conversational exchange.
#t-mobile
Artificial intelligence
fromTechCrunch
1 month ago

Claude Code rolls out a voice mode capability | TechCrunch

Anthropic launches Voice Mode for Claude Code, enabling developers to interact with the AI coding assistant through spoken commands, starting with 5% of users.
Artificial intelligence
fromTechzine Global
1 month ago

Claude, surging in popularity, can now copy rival chatbots' memories

Anthropic introduced a memory import tool enabling users to transfer conversation history and preferences from competing chatbots like ChatGPT and Gemini directly into Claude.
fromPsychology Today
2 months ago

Don't Get Lost in Translation

Led Zeppelin warned us about the perils of misunderstood communications in relationships. Failing to translate what we are trying to say or do so that someone else gets it is the root of so many problems. But translation is a fantastic find when it goes right. Here are some things I've learned about translating meaning from a lifetime of speaking numerous languages, practicing a wide array of martial arts, and communicating science.
Philosophy
UX design
fromMedium
2 months ago

Beyond conversations: natural language as interaction influencer

Natural language interfaces shift responsibility from users learning system structure to systems understanding user intent and executing compressed workflows.
Gadgets
fromSpyglass
2 months ago

"Hello, Computer."

AI-driven advances are creating an inflection point that may finally enable practical, mainstream voice computing after years of partial progress and false starts.
Education
fromSilicon Canals
1 month ago

7 words highly intelligent people use in conversation that average people mispronounce - Silicon Canals

Correct pronunciation of commonly mispronounced words often reflects extensive reading, attention to language, and habitual auditory correction rather than showing off.
Science
fromNature
2 months ago

ArXiv says submissions must be in English: are AI translators up for the job?

arXiv requires all submissions to be in English or include a full English translation starting 11 February.
fromMedium
2 months ago

Beyond chat: 8 core user intents driving AI interaction

The majority of AI products remain tethered to a single, monolithic UI pattern: the chat box. While conversational interfaces are effective for exploration and managing ambiguity, they frequently become suboptimal when applied to structured professional workflows. To move beyond "bolted-on" chat, product teams must shift from asking where AI can be added to identifying the specific user intent and the interface best suited to deliver it.
UX design
Artificial intelligence
fromPsychology Today
1 month ago

An AI Voice Is Not a Mind

AI systems select and perform contextually appropriate personas rather than expressing unified selves with genuine beliefs, creating fluency that mimics mind without possessing interiority or conviction.
Marketing tech
fromThe Drum
1 month ago

Getting the first word in voice search

Voice search usage is growing, creating brand opportunities while requiring optimisation for accuracy, shopping trust, and adaptation to screenless interactions.
fromEngadget
2 months ago

Subtle's 'Voicebuds' use AI to transcribe your words below a whisper, or in very loud spaces

There's a good chance you spend more time talking to your phone's virtual assistant, or dictating text with your voice, instead of actually calling people these days. But, as convenient as voice input can be, you don't want to be the obnoxious person shouting commands to Siri in a quiet library. And you probably won't have much luck dictating an email in a room with toddlers screaming and Peppa Pig blaring on the TV. (Ask me how I know.)
Gadgets
fromFortune
1 month ago

We studied chatbots and language and saw a huge problem: They mean 80% when they say 'likely' but humans hear 65% | Fortune

By comparing how AI models and humans map these words to numerical percentages, we uncovered significant gaps between humans and large language models. While the models do tend to agree with humans on extremes like 'impossible,' they diverge sharply on hedge words like 'maybe.' For example, a model might use the word 'likely' to represent an 80% probability, while a human reader assumes it means closer to 65%.
Artificial intelligence
Apple
fromThe Verge
2 months ago

Apple's second biggest acquisition ever is an AI company that listens to 'silent speech'

Apple acquired AI audio startup Q.ai to integrate imaging and audio machine-learning for nonverbal/micro-movement recognition, enabling whispered-speech interfaces across AirPods, Vision Pro, iPhone, and Macs.
Gadgets
fromFast Company
1 month ago

Kagi's new app is like Google Translate-plus privacy

Kagi Translate is a free mobile translation app that mirrors Google Translate features while prioritizing user privacy with no ads, trackers, or data monetization.
Artificial intelligence
fromTechCrunch
1 month ago

Cohere launches a family of open multilingual models | TechCrunch

Cohere launched Tiny Aya open-weight multilingual models supporting 70+ languages, runnable offline on everyday devices with a 3.35B-parameter base and regional variants.
fromWIRED
1 month ago

A New Mistral AI Model's Ultra-Fast Translation Gives Big AI Labs a Run for Their Money

On Wednesday, the Paris-based AI lab released two new speech-to-text models: Voxtral Mini Transcribe V2 and Voxtral Realtime. The former is built to transcribe audio files in large batches and the latter for nearly real-time transcription, within 200 milliseconds; both can translate between 13 languages. Voxtral Realtime is freely available under an open source license.
Artificial intelligence
Artificial intelligence
fromBusiness Matters
2 months ago

Free AI Dubbing Tool with Audiobook Support - Convert Text to Speech Instantly

AI audiobook generators and dubbing engines let anyone convert text or video into realistic, human-like audio quickly, affordably, and across languages.
Artificial intelligence
fromInfoQ
2 months ago

Google Introduces TranslateGemma Open Models for Multilingual Translation

TranslateGemma is an open suite of 4B, 12B, and 27B translation models delivering efficient machine translation across 55 languages for diverse hardware.
fromInfoQ
2 months ago

Hugging Face Releases FineTranslations, a Trillion-Token Multilingual Parallel Text Dataset

The dataset was created by translating non-English content from the FineWeb2 corpus into English using Gemma3 27B, with the full data generation pipeline designed to be reproducible and publicly documented. The dataset is primarily intended to improve machine translation, particularly in the English→X direction, where performance remains weaker for many lower-resource languages. By starting from text originally written in non-English languages and translating it into English, FineTranslations provides large-scale parallel data suitable for fine-tuning existing translation models.
Artificial intelligence
fromwww.wired.com
2 months ago

Stop Using Your Keyboard and Start Using This Simple, Free Speech-to-Text App

If old sci-fi shows are anything to go by, we're all using our computers wrong. We're still typing with our fingers, like cave people, instead of talking out loud the way the future was supposed to be. Have you ever seen Picard touch a keyboard? Of course not. And it's odd because our computers are all capable of turning speech into text by default. The problem? It just doesn't work very well. Or, at least, it didn't.
Artificial intelligence
fromwww.dw.com
1 month ago

Moltbook explained: Where AI bots meet to 'discuss' humans

The new talk of the town is one where humans have no place a site called Moltbook that describes itself as a "social network for AI agents." The Reddit-styled site, launched in late January by US-based entrepreneur Matt Schlicht, is one where thousands of AI assistants talk to each other and discuss topics ranging from the technical to the philosophical.
Artificial intelligence
fromFast Company
1 month ago

Are LTMs the next LLMs? This new type of AI can do what large-language models can't

A major difference between LLMs and LTMs is the type of data they're able to synthesize and use. LLMs use unstructured data-think text, social media posts, emails, etc. LTMs, on the other hand, can extract information or insights from structured data, which could be contained in tables, for instance. Since many enterprises rely on structured data, often contained in spreadsheets, to run their operations, LTMs could have an immediate use case for many organizations.
Artificial intelligence
fromTNW | Artificial-Intelligence
1 month ago

Stop talking to AI, let them talk to each other: The A2A protocol

Have you ever asked Alexa to remind you to send a WhatsApp message at a determined hour? And then you just wonder, 'Why can't Alexa just send the message herself? Or the incredible frustration when you use an app to plan a trip, only to have to jump to your calendar/booking website/tour/bank account instead of your AI assistant doing it all? Well, exactly this gap between AI automation and human action is what the agent-to-agent (A2A) protocol aims to address. With the introduction of AI Agents, the next step of evolution seemed to be communication. But when communication between machines and humans is already here, what's left?
Artificial intelligence
[ Load more ]