#audio--voice-processing
#audio--voice-processing

[ follow ]

Podcast

Google Testing Audio Overviews In The Wild

Google is testing Audio Overviews in search results, generating conversational audio for queries using Gemini models.

Mobile UX

Google quietly releases free offline AI dictation app for iPhone | TNW

Google AI Edge Eloquent is a free, offline voice dictation app that transcribes speech in real time and polishes text without internet access.

Podcast

fromSearch Engine Roundtable

1 hour ago

Google Testing Audio Overviews In The Wild

Google is testing Audio Overviews in search results, generating conversational audio for queries using Gemini models.

Mobile UX

fromTNW | Artificial-Intelligence

2 weeks ago

Google quietly releases free offline AI dictation app for iPhone | TNW

Google AI Edge Eloquent is a free, offline voice dictation app that transcribes speech in real time and polishes text without internet access.

Mouth Coding

Mouth coding enables collaborative website creation through conversation, transforming ideas into tangible digital products in real-time.

Gadgets

fromGSMArena.com

15 hours ago

Nothing introduces Essential Voice speech-to-text transcription and translation

Essential Voice is a speech-to-text engine that delivers clear, real-time text by eliminating filler words and supporting multiple languages.

Artificial intelligence

fromMail Online

2 days ago

Take the test to see if you can distinguish real and AI VOICES

AI voice clones can recreate human voices with greater clarity and intelligibility than the original speakers.

Productivity

fromBusiness Matters

1 day ago

5 Best AI Note Takers for Sales Calls in 2026

Sales reps lose a full day weekly to post-call admin, impacting deal closure; AI note takers enhance focus and streamline CRM updates.

Mobile UX

fromGSMArena.com

18 hours ago

Google confirms: revamped Siri will be powered by Gemini

Apple's Siri will be revamped using Google's Gemini AI models, expected to launch at the Worldwide Developers Conference in June.

Medicine

fromHarvard Gazette

1 day ago

Hearing breakthrough holds up - Harvard Gazette

Gene therapy for inherited deafness shows significant and lasting improvements in hearing and speech recognition, especially in younger patients.

Data science

fromTheregister

2 days ago

LLMs fuel new generation of natural language query systems

Text-to-SQL tools may simplify data queries but can misinterpret business users' intentions, raising caution for organizations.

fromEngadget

2 days ago

Anker's 'Thus' chip brings AI to its headphones and other products

Anker calls Thus the 'first Compute-in-Memory (CIM) AI audio chip with neural networks.' The chip integrates computing power directly into NOR flash memory cells, providing faster read speeds than NAND memory.

Wearables

Artificial intelligence

fromPsychology Today

15 hours ago

The Many Ways Chatbot Tools Can Manipulate Us

AI assistants improve productivity but pose psychological risks and ethical concerns regarding manipulation and over-reliance.

UX design

fromYanko Design - Modern Industrial Design News

1 week ago

Your Voice Wearable and Robot Hear the Words Mute People Can't Say - Yanko Design

Your Voice transforms attempted speech into audible communication for those with speech impairments, enhancing accessibility and immediacy.

Wearables

fromCGMagazine

1 week ago

BEACN Creates A Voice-First Headset, Releasing Spring 2026

BEACN launched a premium wireless headset focused on delivering high-quality voice and sound for online communication.

ElevenLabs releases a new AI-powered music generation app | TechCrunch

ElevenLabs launched ElevenMusic, an iOS app for creating and discovering AI-generated music, aiming to expand beyond voice models and compete in the music space.

Django

fromThe Verge

3 weeks ago

Suno leans into customization with v5.5

Suno v5.5 enhances user control with new features: Voices, My Taste, and Custom Models for personalized AI music creation.

fromYanko Design - Modern Industrial Design News

2 months ago

Music

These 5 AI Modules Listen When You Hum, Tap, or Strum, Not Type - Yanko Design

Music production

fromTechCrunch

3 weeks ago

ElevenLabs releases a new AI-powered music generation app | TechCrunch

ElevenLabs launched ElevenMusic, an iOS app for creating and discovering AI-generated music, aiming to expand beyond voice models and compete in the music space.

Django

fromThe Verge

3 weeks ago

Suno leans into customization with v5.5

Suno v5.5 enhances user control with new features: Voices, My Taste, and Custom Models for personalized AI music creation.

fromYanko Design - Modern Industrial Design News

2 months ago

Music

These 5 AI Modules Listen When You Hum, Tap, or Strum, Not Type - Yanko Design

more#ai-music

#ai

Roam Research

fromdesignboom | architecture & design magazine

3 weeks ago

radio time machine AI audio system recreates past sounds for cognitive health in elderly care

AI-powered Radio Time Machine enhances well-being in elderly care by generating nostalgic audio content to stimulate memories and communication.

European startups

fromTechCrunch

4 weeks ago

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral launched Voxtral TTS, an open-source text-to-speech model for voice AI assistants and enterprise applications, supporting nine languages.

Roam Research

fromdesignboom | architecture & design magazine

3 weeks ago

radio time machine AI audio system recreates past sounds for cognitive health in elderly care

AI-powered Radio Time Machine enhances well-being in elderly care by generating nostalgic audio content to stimulate memories and communication.

European startups

fromTechCrunch

4 weeks ago

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral launched Voxtral TTS, an open-source text-to-speech model for voice AI assistants and enterprise applications, supporting nine languages.

Google quietly releases an offline-first AI dictation app on iOS | TechCrunch

Google released an offline-first dictation app called Google AI Edge Eloquent for iOS, featuring advanced speech recognition and text editing capabilities.

fromTechCrunch

4 weeks ago

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Cohere's Transcribe model is designed for tasks like note-taking and speech analysis, supporting 14 languages and optimized for consumer-grade GPUs, making it accessible for self-hosting.

European startups

fromWIRED

3 weeks ago

Meet the Man Making Music With His Brain Implant

Galen Buckwalter, a 69-year-old research psychologist and quadriplegic, participated in a brain implant study to contribute to science that aids those with paralysis. The six chips in his brain decode movement intention, allowing him to operate a computer and feel sensations in his fingers again.

Music production

Gadgets

fromTechCrunch

3 weeks ago

Speechify's Windows app uses local models for transcription and dictation | TechCrunch

Speechify launched a Windows app for dictation and reading aloud, processing voice entirely on-device for enhanced user experience.

Artificial intelligence

fromFast Company

3 weeks ago

AI is teaching us to speak like bots and its a problem

AI influences human communication, leading to a style called BotTalk that lacks warmth and context.

Music production

fromThe Verge

4 weeks ago

Google Lyria 3 Pro makes longer AI songs

Google's Lyria 3 music-making AI now creates tracks up to three minutes long with enhanced features for user control and integration with other Google products.

Silicon Valley

fromFast Company

1 month ago

The tech that restored Eric Dane's voice shows how AI can be used for good, says Rebecca Gayheart Dane

ElevenLabs uses AI technology to restore voices for people who have lost them, having helped approximately 7,000 people worldwide and aiming to provide free access to one million people.

#ai-notetakers

fromTechCrunch

1 month ago

Roam Research

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

fromWIRED

2 months ago

Gadgets

The Best AI Notetakers to Record Your Meetings, Interviews, or Classes

Roam Research

fromTechCrunch

1 month ago

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

Physical AI notetakers provide versatile recording and transcription options for meetings, offering features like live translation and unlimited transcription without subscriptions.

fromWIRED

2 months ago

Gadgets

The Best AI Notetakers to Record Your Meetings, Interviews, or Classes

Google's Lyria 3 Pro can now generate AI music (slop) up to 3 minutes in length

Google's Lyria 3 Pro generates full three-minute songs with enhanced customization and understanding of musical composition.

fromEntrepreneur

1 month ago

Music production

AI Is Changing Music Production - But It Can't Fill Creative Gaps

fromTNW | Music

2 months ago

Artificial intelligence

Google's new music tool, Lyria 3 is here

Music production

fromEngadget

4 weeks ago

Google's Lyria 3 Pro can now generate AI music (slop) up to 3 minutes in length

Google's Lyria 3 Pro generates full three-minute songs with enhanced customization and understanding of musical composition.

fromEntrepreneur

1 month ago

Music production

AI Is Changing Music Production - But It Can't Fill Creative Gaps

fromTNW | Music

2 months ago

Artificial intelligence

Google's new music tool, Lyria 3 is here

more#ai-music-generation

Podcast

fromTechCrunch

1 month ago

Rebel Audio is a new AI podcasting tool aimed at first-time creators | TechCrunch

Rebel Audio launches an all-in-one podcasting platform to eliminate barriers for first-time creators by consolidating recording, editing, publishing, and promotion into a single tool.

Video games

fromEngadget

1 month ago

Arc Raiders replaced some of its AI-generated voice lines with professional actors

Embark Studios replaced some AI-generated voice lines in Arc Raiders with human voice actors after player backlash, acknowledging that professional actors deliver superior quality compared to AI.

Music production

fromTechCrunch

4 weeks ago

Google launches Lyria 3 Pro music generation model | TechCrunch

Google released Lyria 3 Pro, allowing users to create longer music tracks with enhanced customization and control compared to Lyria 3.

Media industry

fromTechCrunch

1 month ago

Substack launches a built-in recording studio | TechCrunch

Substack launches Recording Studio, enabling creators to pre-record and publish videos with built-in editing, clip generation, and thumbnail creation tools.

fromThe Verge

1 month ago

The secret story of the vocoder, the military tech that changed music forever

The vocoder was never supposed to be a revolution in music. Its development began a century ago, when an engineer at Bell Labs was looking for a simpler way to send phone calls across copper telephone lines.

Music production

Software development

fromeLearning

2 months ago

Captivate Text to speech vioces downloads - eLearning

Request instructions and links to download and import additional Adobe Captivate voices beyond the existing ones.

fromDEV Community

1 month ago

I Built a 100% Private, On-Device AI Audio Stem Splitter (No Servers!)

If you've ever used tools like PhonicMind or LALAL.AI, you know the drill: Upload your MP3. Wait in a queue. Pay for "credits" or high-quality downloads. Your file sits on someone else's server. For musicians, producers, or just karaoke fans, this is slow and privacy-invasive.

Music production

Marketing tech

fromThe Drum

2 months ago

Getting the first word in voice search

Voice search usage is growing, creating brand opportunities while requiring optimisation for accuracy, shopping trust, and adaptation to screenless interactions.

Science

fromwww.scientificamerican.com

2 months ago

Speech sounds are a blurhere's how your brain sorts them out

High-gamma brain-wave power drops about 100 milliseconds after word boundaries, marking word endings and tracking native-language fluency.

Apple

fromTechRepublic

2 months ago

Apple Unveils Steps to Make Siri Sound Human - TechRepublic

A method reduces text-to-speech latency to make Siri and other voice-driven products sound more responsive while preserving intelligibility and accuracy.

#audio-advertising

fromThe Drum

2 months ago

Marketing

The Audio Impact: Messaging that works

fromExchangewire

2 months ago

Marketing tech

Audion Launches First AI Agent Built to Deliver Tangible Outcomes in Digital Audio Advertising

fromThe Drum

2 months ago

Marketing

The Audio Impact: Messaging that works

fromExchangewire

2 months ago

Marketing tech

Audion Launches First AI Agent Built to Deliver Tangible Outcomes in Digital Audio Advertising

more#audio-advertising

fromeLearning Industry

2 months ago

How To Remove Background Noise From Video: Best Practices For Professional Content

When professionals talk about how to remove background noise from video, they are really talking about improving the audio track of a video so the speaker's voice is clearer, more consistent, and easier to understand. Background noise refers to any unwanted sound that competes with the main voice, like air conditioning hum, office chatter, keyboard typing, traffic, or the low hiss created by recording equipment and compression. In video production, background noise removal is about reducing distractions so the listener can focus on the message.

Film

Books

fromThe Walrus

2 months ago

Speakerphone | The Walrus

Prayer as keeping an open line fosters mutual, attentive silence and faint shared speech amid everyday noises and distance.

Education

fromSilicon Canals

2 months ago

7 words highly intelligent people use in conversation that average people mispronounce - Silicon Canals

Correct pronunciation of commonly mispronounced words often reflects extensive reading, attention to language, and habitual auditory correction rather than showing off.

Venture

from24/7 Wall St.

1 month ago

SoundHound AI Stuns With 80% EPS Beat and Voice AI Expansion Keeps Accelerating

SoundHound AI narrowed quarterly losses to 2 cents per share, beating estimates by 79.65% and moving closer to breakeven while revenue grew 85% year-over-year to $84.7 million.

Medicine

fromwww.bbc.com

1 month ago

'My new AI voice keeps my personality alive'

AI technology enables a motor neurone disease patient to communicate using a reconstructed version of her own voice, restoring personal identity and family connection.

Artificial intelligence

fromInfoWorld

1 month ago

How developers can bring voice AI into telephony applications

Voice AI agents require complex infrastructure beyond LLMs to integrate with legacy telephony systems, demanding flexible architecture designed for component switching and evolution.

#apple-acquisition

fromThe Verge

2 months ago

Apple

Apple's second biggest acquisition ever is an AI company that listens to 'silent speech'

fromGSMArena.com

2 months ago

Apple

Apple buys secretive audio AI startup Q.ai

fromThe Verge

2 months ago

Apple

Apple's second biggest acquisition ever is an AI company that listens to 'silent speech'

fromGSMArena.com

2 months ago

Apple

Apple buys secretive audio AI startup Q.ai

more#apple-acquisition

Data science

fromNature

2 months ago

Science finds its song

Scientists are translating research data into music, fostering interdisciplinary collaboration, revealing patterns, and increasing accessibility through data-driven music events.

Music

fromThe Atlantic

2 months ago

Is AI Ruining Music?

Streaming economics, algorithmic recommendations, and generative AI commodify music, reduce artist revenue, and threaten creative control and discovery.

Medicine

fromwww.bbc.com

2 months ago

The new treatment giving people their voices back

Platelet-rich plasma (PRP) injections into scarred vocal cords can promote regeneration, improve voice projection, and offer a potentially cheaper, longer-lasting treatment for vocal damage.

fromZDNET

1 month ago

I wrote off ChatGPT's voice mode, then found 7 ways it's genuinely useful

Talking to ChatGPT feels more collaborative than typing. It shines for brainstorming, prep, and translation. Usage limits can interrupt productivity mid-session. Voice Mode runs on mobile devices, as well as in your browser. On mobile, there are two ChatGPT widgets available for the lock screen. One widget opens the app, and one launches ChatGPT Voice.

Artificial intelligence

Marketing tech

fromSocial Media Examiner

1 month ago

Clone Your Knowledge: Getting AI to Truly Sound Like You : Social Media Examiner

Create a Leadership Lexicon by systematically collecting your expertise, communication style, and decision-making processes to train AI tools that replicate your unique voice and methodology at scale.

fromYanko Design - Modern Industrial Design News

2 months ago

Teenage Engineering-inspired Music Sampler Uses AI In The Nerdiest Way Possible - Yanko Design

Junho Park's graduation concept borrows all the right cues from TE's playbook, that modular control layout, the single bold color, the mix of knobs and buttons that practically beg to be touched, but redirects them toward a gap in the market. Where Teenage Engineering designs for people who already understand synthesis and sampling, the T.M-4 targets people who have ideas but no vocabulary to express them.

Gadgets

Artificial intelligence

fromPsychology Today

1 month ago

An AI Voice Is Not a Mind

AI systems select and perform contextually appropriate personas rather than expressing unified selves with genuine beliefs, creating fluency that mimics mind without possessing interiority or conviction.

Gadgets

fromTechCrunch

2 months ago

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

Physical AI notetakers record and transcribe in-person conversations, providing AI-generated summaries, action items, translations, and varied pricing or subscription options.

fromTechCrunch

2 months ago

ElevenLabs CEO: Voice is the next interface for AI | TechCrunch

ElevenLabs co-founder and CEO Mati Staniszewski says voice is becoming the next major interface for AI - the way people will increasingly interact with machines as models move beyond text and screens. Speaking at Web Summit in Doha, Staniszewski told TechCrunch voice models like those developed by ElevenLabs have recently moved beyond simply mimicking human speech - including emotion and intonation - to working in tandem with the reasoning capabilities of large language models.

Artificial intelligence

Gadgets

fromZDNET

2 months ago

3 easy ways to upgrade your headphones today - for free

Update headphone firmware and adjust EQ before replacing otherwise functional headphones to improve sound, connectivity, noise cancellation, and microphone performance.

Artificial intelligence

fromwww.aljazeera.com

1 month ago

ElevenLabs CEO says voice AI will change everything. Can it be controlled?

Voice AI technology enables beneficial applications like speech restoration and accessibility while simultaneously creating risks for fraud, disinformation, and unauthorized voice cloning that raise fundamental questions about voice ownership and control.

Music production

fromwww.scientificamerican.com

1 month ago

Experimental composer Holly Herndon built an AI voice clone that anyone can use

Holly Herndon uses machine learning and AI models to create protocol art, where the creative act occurs in designing rule sets and datasets rather than in final media generation, making collective creativity visible.

Artificial intelligence

fromTechCrunch

1 month ago

Claude Code rolls out a voice mode capability | TechCrunch

Anthropic launches Voice Mode for Claude Code, enabling developers to interact with the AI coding assistant through spoken commands, starting with 5% of users.

Artificial intelligence

fromBusiness Matters

2 months ago

AI voice company ElevenLabs valued at $11bn after $500m funding round

ElevenLabs raised $500 million, valuing the company at $11 billion and accelerating expansion in AI voice, multilingual dubbing, music generation, and enterprise adoption.

Artificial intelligence

fromWIRED

1 month ago

Huxe Will Give You a Personalized, Daily Audio Summary Powered by AI

Huxe is an AI-powered podcast app that generates personalized daily audio briefs from your email and calendar to streamline morning productivity routines.

[ Load more ]

#audio--voice-processing#audio--voice-processing

Google Testing Audio Overviews In The Wild

Google quietly releases free offline AI dictation app for iPhone | TNW

Google Testing Audio Overviews In The Wild

Google quietly releases free offline AI dictation app for iPhone | TNW

Mouth Coding

Nothing introduces Essential Voice speech-to-text transcription and translation

Take the test to see if you can distinguish real and AI VOICES

5 Best AI Note Takers for Sales Calls in 2026

Google confirms: revamped Siri will be powered by Gemini

Hearing breakthrough holds up - Harvard Gazette

LLMs fuel new generation of natural language query systems

Anker's 'Thus' chip brings AI to its headphones and other products

The Many Ways Chatbot Tools Can Manipulate Us

Your Voice Wearable and Robot Hear the Words Mute People Can't Say - Yanko Design

BEACN Creates A Voice-First Headset, Releasing Spring 2026

ElevenLabs releases a new AI-powered music generation app | TechCrunch

Suno leans into customization with v5.5

These 5 AI Modules Listen When You Hum, Tap, or Strum, Not Type - Yanko Design

ElevenLabs releases a new AI-powered music generation app | TechCrunch

Suno leans into customization with v5.5

These 5 AI Modules Listen When You Hum, Tap, or Strum, Not Type - Yanko Design

radio time machine AI audio system recreates past sounds for cognitive health in elderly care

Mistral releases a new open-source model for speech generation | TechCrunch

radio time machine AI audio system recreates past sounds for cognitive health in elderly care

Mistral releases a new open-source model for speech generation | TechCrunch

Google quietly releases an offline-first AI dictation app on iOS | TechCrunch

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Meet the Man Making Music With His Brain Implant

Speechify's Windows app uses local models for transcription and dictation | TechCrunch

AI is teaching us to speak like bots and its a problem

Google Lyria 3 Pro makes longer AI songs

The tech that restored Eric Dane's voice shows how AI can be used for good, says Rebecca Gayheart Dane

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

The Best AI Notetakers to Record Your Meetings, Interviews, or Classes

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

The Best AI Notetakers to Record Your Meetings, Interviews, or Classes

Google's Lyria 3 Pro can now generate AI music (slop) up to 3 minutes in length

AI Is Changing Music Production - But It Can't Fill Creative Gaps

Google's new music tool, Lyria 3 is here

Google's Lyria 3 Pro can now generate AI music (slop) up to 3 minutes in length

AI Is Changing Music Production - But It Can't Fill Creative Gaps

Google's new music tool, Lyria 3 is here

Rebel Audio is a new AI podcasting tool aimed at first-time creators | TechCrunch

Arc Raiders replaced some of its AI-generated voice lines with professional actors

Google launches Lyria 3 Pro music generation model | TechCrunch

Substack launches a built-in recording studio | TechCrunch

The secret story of the vocoder, the military tech that changed music forever

Captivate Text to speech vioces downloads - eLearning

I Built a 100% Private, On-Device AI Audio Stem Splitter (No Servers!)

Getting the first word in voice search

Speech sounds are a blurhere's how your brain sorts them out

Apple Unveils Steps to Make Siri Sound Human - TechRepublic

The Audio Impact: Messaging that works

Audion Launches First AI Agent Built to Deliver Tangible Outcomes in Digital Audio Advertising

The Audio Impact: Messaging that works

Audion Launches First AI Agent Built to Deliver Tangible Outcomes in Digital Audio Advertising

How To Remove Background Noise From Video: Best Practices For Professional Content

Speakerphone | The Walrus

7 words highly intelligent people use in conversation that average people mispronounce - Silicon Canals

SoundHound AI Stuns With 80% EPS Beat and Voice AI Expansion Keeps Accelerating

'My new AI voice keeps my personality alive'

How developers can bring voice AI into telephony applications

Apple's second biggest acquisition ever is an AI company that listens to 'silent speech'

Apple buys secretive audio AI startup Q.ai

Apple's second biggest acquisition ever is an AI company that listens to 'silent speech'

Apple buys secretive audio AI startup Q.ai

Science finds its song

Is AI Ruining Music?

The new treatment giving people their voices back

I wrote off ChatGPT's voice mode, then found 7 ways it's genuinely useful

Clone Your Knowledge: Getting AI to Truly Sound Like You : Social Media Examiner

Teenage Engineering-inspired Music Sampler Uses AI In The Nerdiest Way Possible - Yanko Design

An AI Voice Is Not a Mind

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

ElevenLabs CEO: Voice is the next interface for AI | TechCrunch

3 easy ways to upgrade your headphones today - for free

ElevenLabs CEO says voice AI will change everything. Can it be controlled?

Experimental composer Holly Herndon built an AI voice clone that anyone can use

Claude Code rolls out a voice mode capability | TechCrunch

#audio--voice-processing
#audio--voice-processing