#audio--voice-processing

[ follow ]
#google
fromSearch Engine Roundtable
1 hour ago
Podcast

Google Testing Audio Overviews In The Wild

Google is testing Audio Overviews in search results, generating conversational audio for queries using Gemini models.
fromTNW | Artificial-Intelligence
2 weeks ago
Mobile UX

Google quietly releases free offline AI dictation app for iPhone | TNW

Google AI Edge Eloquent is a free, offline voice dictation app that transcribes speech in real time and polishes text without internet access.
Mobile UX
fromTNW | Artificial-Intelligence
2 weeks ago

Google quietly releases free offline AI dictation app for iPhone | TNW

Google AI Edge Eloquent is a free, offline voice dictation app that transcribes speech in real time and polishes text without internet access.
Gadgets
fromGSMArena.com
15 hours ago

Nothing introduces Essential Voice speech-to-text transcription and translation

Essential Voice is a speech-to-text engine that delivers clear, real-time text by eliminating filler words and supporting multiple languages.
Productivity
fromBusiness Matters
1 day ago

5 Best AI Note Takers for Sales Calls in 2026

Sales reps lose a full day weekly to post-call admin, impacting deal closure; AI note takers enhance focus and streamline CRM updates.
Mobile UX
fromGSMArena.com
18 hours ago

Google confirms: revamped Siri will be powered by Gemini

Apple's Siri will be revamped using Google's Gemini AI models, expected to launch at the Worldwide Developers Conference in June.
Medicine
fromHarvard Gazette
1 day ago

Hearing breakthrough holds up - Harvard Gazette

Gene therapy for inherited deafness shows significant and lasting improvements in hearing and speech recognition, especially in younger patients.
Data science
fromTheregister
2 days ago

LLMs fuel new generation of natural language query systems

Text-to-SQL tools may simplify data queries but can misinterpret business users' intentions, raising caution for organizations.
fromEngadget
2 days ago

Anker's 'Thus' chip brings AI to its headphones and other products

Anker calls Thus the 'first Compute-in-Memory (CIM) AI audio chip with neural networks.' The chip integrates computing power directly into NOR flash memory cells, providing faster read speeds than NAND memory.
Wearables
Wearables
fromCGMagazine
1 week ago

BEACN Creates A Voice-First Headset, Releasing Spring 2026

BEACN launched a premium wireless headset focused on delivering high-quality voice and sound for online communication.
#ai-music
Music production
fromTechCrunch
3 weeks ago

ElevenLabs releases a new AI-powered music generation app | TechCrunch

ElevenLabs launched ElevenMusic, an iOS app for creating and discovering AI-generated music, aiming to expand beyond voice models and compete in the music space.
Django
fromThe Verge
3 weeks ago

Suno leans into customization with v5.5

Suno v5.5 enhances user control with new features: Voices, My Taste, and Custom Models for personalized AI music creation.
Music production
fromTechCrunch
3 weeks ago

ElevenLabs releases a new AI-powered music generation app | TechCrunch

ElevenLabs launched ElevenMusic, an iOS app for creating and discovering AI-generated music, aiming to expand beyond voice models and compete in the music space.
Django
fromThe Verge
3 weeks ago

Suno leans into customization with v5.5

Suno v5.5 enhances user control with new features: Voices, My Taste, and Custom Models for personalized AI music creation.
#ai
European startups
fromTechCrunch
4 weeks ago

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral launched Voxtral TTS, an open-source text-to-speech model for voice AI assistants and enterprise applications, supporting nine languages.
European startups
fromTechCrunch
4 weeks ago

Mistral releases a new open-source model for speech generation | TechCrunch

Mistral launched Voxtral TTS, an open-source text-to-speech model for voice AI assistants and enterprise applications, supporting nine languages.
Mobile UX
fromTechCrunch
2 weeks ago

Google quietly releases an offline-first AI dictation app on iOS | TechCrunch

Google released an offline-first dictation app called Google AI Edge Eloquent for iOS, featuring advanced speech recognition and text editing capabilities.
fromTechCrunch
4 weeks ago

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Cohere's Transcribe model is designed for tasks like note-taking and speech analysis, supporting 14 languages and optimized for consumer-grade GPUs, making it accessible for self-hosting.
European startups
fromWIRED
3 weeks ago

Meet the Man Making Music With His Brain Implant

Galen Buckwalter, a 69-year-old research psychologist and quadriplegic, participated in a brain implant study to contribute to science that aids those with paralysis. The six chips in his brain decode movement intention, allowing him to operate a computer and feel sensations in his fingers again.
Music production
Gadgets
fromTechCrunch
3 weeks ago

Speechify's Windows app uses local models for transcription and dictation | TechCrunch

Speechify launched a Windows app for dictation and reading aloud, processing voice entirely on-device for enhanced user experience.
Music production
fromThe Verge
4 weeks ago

Google Lyria 3 Pro makes longer AI songs

Google's Lyria 3 music-making AI now creates tracks up to three minutes long with enhanced features for user control and integration with other Google products.
Silicon Valley
fromFast Company
1 month ago

The tech that restored Eric Dane's voice shows how AI can be used for good, says Rebecca Gayheart Dane

ElevenLabs uses AI technology to restore voices for people who have lost them, having helped approximately 7,000 people worldwide and aiming to provide free access to one million people.
#ai-notetakers
fromTechCrunch
1 month ago
Roam Research

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

Roam Research
fromTechCrunch
1 month ago

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

Physical AI notetakers provide versatile recording and transcription options for meetings, offering features like live translation and unlimited transcription without subscriptions.
#ai-music-generation
Music production
fromEngadget
4 weeks ago

Google's Lyria 3 Pro can now generate AI music (slop) up to 3 minutes in length

Google's Lyria 3 Pro generates full three-minute songs with enhanced customization and understanding of musical composition.
Music production
fromEngadget
4 weeks ago

Google's Lyria 3 Pro can now generate AI music (slop) up to 3 minutes in length

Google's Lyria 3 Pro generates full three-minute songs with enhanced customization and understanding of musical composition.
Podcast
fromTechCrunch
1 month ago

Rebel Audio is a new AI podcasting tool aimed at first-time creators | TechCrunch

Rebel Audio launches an all-in-one podcasting platform to eliminate barriers for first-time creators by consolidating recording, editing, publishing, and promotion into a single tool.
Video games
fromEngadget
1 month ago

Arc Raiders replaced some of its AI-generated voice lines with professional actors

Embark Studios replaced some AI-generated voice lines in Arc Raiders with human voice actors after player backlash, acknowledging that professional actors deliver superior quality compared to AI.
Music production
fromTechCrunch
4 weeks ago

Google launches Lyria 3 Pro music generation model | TechCrunch

Google released Lyria 3 Pro, allowing users to create longer music tracks with enhanced customization and control compared to Lyria 3.
Media industry
fromTechCrunch
1 month ago

Substack launches a built-in recording studio | TechCrunch

Substack launches Recording Studio, enabling creators to pre-record and publish videos with built-in editing, clip generation, and thumbnail creation tools.
fromThe Verge
1 month ago

The secret story of the vocoder, the military tech that changed music forever

The vocoder was never supposed to be a revolution in music. Its development began a century ago, when an engineer at Bell Labs was looking for a simpler way to send phone calls across copper telephone lines.
Music production
fromDEV Community
1 month ago

I Built a 100% Private, On-Device AI Audio Stem Splitter (No Servers!)

If you've ever used tools like PhonicMind or LALAL.AI, you know the drill: Upload your MP3. Wait in a queue. Pay for "credits" or high-quality downloads. Your file sits on someone else's server. For musicians, producers, or just karaoke fans, this is slow and privacy-invasive.
Music production
Marketing tech
fromThe Drum
2 months ago

Getting the first word in voice search

Voice search usage is growing, creating brand opportunities while requiring optimisation for accuracy, shopping trust, and adaptation to screenless interactions.
Science
fromwww.scientificamerican.com
2 months ago

Speech sounds are a blurhere's how your brain sorts them out

High-gamma brain-wave power drops about 100 milliseconds after word boundaries, marking word endings and tracking native-language fluency.
Apple
fromTechRepublic
2 months ago

Apple Unveils Steps to Make Siri Sound Human - TechRepublic

A method reduces text-to-speech latency to make Siri and other voice-driven products sound more responsive while preserving intelligibility and accuracy.
#audio-advertising
fromeLearning Industry
2 months ago

How To Remove Background Noise From Video: Best Practices For Professional Content

When professionals talk about how to remove background noise from video, they are really talking about improving the audio track of a video so the speaker's voice is clearer, more consistent, and easier to understand. Background noise refers to any unwanted sound that competes with the main voice, like air conditioning hum, office chatter, keyboard typing, traffic, or the low hiss created by recording equipment and compression. In video production, background noise removal is about reducing distractions so the listener can focus on the message.
Film
Books
fromThe Walrus
2 months ago

Speakerphone | The Walrus

Prayer as keeping an open line fosters mutual, attentive silence and faint shared speech amid everyday noises and distance.
Education
fromSilicon Canals
2 months ago

7 words highly intelligent people use in conversation that average people mispronounce - Silicon Canals

Correct pronunciation of commonly mispronounced words often reflects extensive reading, attention to language, and habitual auditory correction rather than showing off.
Venture
from24/7 Wall St.
1 month ago

SoundHound AI Stuns With 80% EPS Beat and Voice AI Expansion Keeps Accelerating

SoundHound AI narrowed quarterly losses to 2 cents per share, beating estimates by 79.65% and moving closer to breakeven while revenue grew 85% year-over-year to $84.7 million.
Medicine
fromwww.bbc.com
1 month ago

'My new AI voice keeps my personality alive'

AI technology enables a motor neurone disease patient to communicate using a reconstructed version of her own voice, restoring personal identity and family connection.
Artificial intelligence
fromInfoWorld
1 month ago

How developers can bring voice AI into telephony applications

Voice AI agents require complex infrastructure beyond LLMs to integrate with legacy telephony systems, demanding flexible architecture designed for component switching and evolution.
#apple-acquisition
Data science
fromNature
2 months ago

Science finds its song

Scientists are translating research data into music, fostering interdisciplinary collaboration, revealing patterns, and increasing accessibility through data-driven music events.
Music
fromThe Atlantic
2 months ago

Is AI Ruining Music?

Streaming economics, algorithmic recommendations, and generative AI commodify music, reduce artist revenue, and threaten creative control and discovery.
Medicine
fromwww.bbc.com
2 months ago

The new treatment giving people their voices back

Platelet-rich plasma (PRP) injections into scarred vocal cords can promote regeneration, improve voice projection, and offer a potentially cheaper, longer-lasting treatment for vocal damage.
fromZDNET
1 month ago

I wrote off ChatGPT's voice mode, then found 7 ways it's genuinely useful

Talking to ChatGPT feels more collaborative than typing. It shines for brainstorming, prep, and translation. Usage limits can interrupt productivity mid-session. Voice Mode runs on mobile devices, as well as in your browser. On mobile, there are two ChatGPT widgets available for the lock screen. One widget opens the app, and one launches ChatGPT Voice.
Artificial intelligence
Marketing tech
fromSocial Media Examiner
1 month ago

Clone Your Knowledge: Getting AI to Truly Sound Like You : Social Media Examiner

Create a Leadership Lexicon by systematically collecting your expertise, communication style, and decision-making processes to train AI tools that replicate your unique voice and methodology at scale.
fromYanko Design - Modern Industrial Design News
2 months ago

Teenage Engineering-inspired Music Sampler Uses AI In The Nerdiest Way Possible - Yanko Design

Junho Park's graduation concept borrows all the right cues from TE's playbook, that modular control layout, the single bold color, the mix of knobs and buttons that practically beg to be touched, but redirects them toward a gap in the market. Where Teenage Engineering designs for people who already understand synthesis and sampling, the T.M-4 targets people who have ideas but no vocabulary to express them.
Gadgets
Artificial intelligence
fromPsychology Today
1 month ago

An AI Voice Is Not a Mind

AI systems select and perform contextually appropriate personas rather than expressing unified selves with genuine beliefs, creating fluency that mimics mind without possessing interiority or conviction.
Gadgets
fromTechCrunch
2 months ago

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

Physical AI notetakers record and transcribe in-person conversations, providing AI-generated summaries, action items, translations, and varied pricing or subscription options.
fromTechCrunch
2 months ago

ElevenLabs CEO: Voice is the next interface for AI | TechCrunch

ElevenLabs co-founder and CEO Mati Staniszewski says voice is becoming the next major interface for AI - the way people will increasingly interact with machines as models move beyond text and screens. Speaking at Web Summit in Doha, Staniszewski told TechCrunch voice models like those developed by ElevenLabs have recently moved beyond simply mimicking human speech - including emotion and intonation - to working in tandem with the reasoning capabilities of large language models.
Artificial intelligence
Gadgets
fromZDNET
2 months ago

3 easy ways to upgrade your headphones today - for free

Update headphone firmware and adjust EQ before replacing otherwise functional headphones to improve sound, connectivity, noise cancellation, and microphone performance.
Artificial intelligence
fromwww.aljazeera.com
1 month ago

ElevenLabs CEO says voice AI will change everything. Can it be controlled?

Voice AI technology enables beneficial applications like speech restoration and accessibility while simultaneously creating risks for fraud, disinformation, and unauthorized voice cloning that raise fundamental questions about voice ownership and control.
Music production
fromwww.scientificamerican.com
1 month ago

Experimental composer Holly Herndon built an AI voice clone that anyone can use

Holly Herndon uses machine learning and AI models to create protocol art, where the creative act occurs in designing rule sets and datasets rather than in final media generation, making collective creativity visible.
Artificial intelligence
fromTechCrunch
1 month ago

Claude Code rolls out a voice mode capability | TechCrunch

Anthropic launches Voice Mode for Claude Code, enabling developers to interact with the AI coding assistant through spoken commands, starting with 5% of users.
Artificial intelligence
fromBusiness Matters
2 months ago

AI voice company ElevenLabs valued at $11bn after $500m funding round

ElevenLabs raised $500 million, valuing the company at $11 billion and accelerating expansion in AI voice, multilingual dubbing, music generation, and enterprise adoption.
Artificial intelligence
fromWIRED
1 month ago

Huxe Will Give You a Personalized, Daily Audio Summary Powered by AI

Huxe is an AI-powered podcast app that generates personalized daily audio briefs from your email and calendar to streamline morning productivity routines.
[ Load more ]