#xt-whisper

[ follow ]
Philosophy
fromJames Bennett
13 hours ago

Let's talk about LLMs

The current technological landscape may represent a significant shift driven by large language models, but its ultimate impact remains uncertain.
Data science
fromMedium
4 hours ago

The Top 10 LLM Training Datasets for 2026

Large language models require extensive training data, and practitioners can utilize ten leading public datasets for effective training and fine-tuning.
Typography
fromOK Magazine
1 day ago

AI Writing Tools: How They Work, Where They Help, and What to Watch For

AI writing tools have become essential for various professionals, enhancing productivity and creativity in content creation.
Angular
fromMedium
1 day ago

Build an AI app for chat and messaging

Building an AI chat app requires a structured approach from architecture to production using Hope AI and BitCloud.
Social media marketing
fromTechCrunch
1 day ago

X is rolling out automatic translation and photo editing powered by Grok | TechCrunch

X introduces automatic translation and a new photo editor powered by Grok models to enhance user experience.
Mobile UX
fromTNW | Artificial-Intelligence
2 days ago

Google quietly releases free offline AI dictation app for iPhone | TNW

Google AI Edge Eloquent is a free, offline voice dictation app that transcribes speech in real time and polishes text without internet access.
European startups
fromTechCrunch
1 day ago

I can't help rooting for tiny open source AI model maker Arcee | TechCrunch

Arcee has released Trinity Large Thinking, a 400B-parameter open-source LLM aimed at providing a competitive alternative to Chinese models.
#openai
fromDefector
6 days ago
Media industry

Tech Media Propaganda Operation Makes It Official, Goes In-House At OpenAI | Defector

Artificial intelligence
fromThe Verge
1 day ago

The vibes are off at OpenAI

OpenAI faces instability despite significant funding and brand recognition, with recent controversies and project discontinuations raising questions about its future.
Media industry
fromIntelligencer
2 days ago

AI's 'Big Tobacco' Moment Is Coming

OpenAI is shifting focus from broad strategies to targeted investments, exemplified by its acquisition of TBPN, a video podcast platform.
Media industry
fromDefector
6 days ago

Tech Media Propaganda Operation Makes It Official, Goes In-House At OpenAI | Defector

OpenAI acquired the Technology Business Programming Network for hundreds of millions, raising concerns about media independence despite its existing alignment with tech elites.
Artificial intelligence
fromThe Verge
1 day ago

The vibes are off at OpenAI

OpenAI faces instability despite significant funding and brand recognition, with recent controversies and project discontinuations raising questions about its future.
#ai
Python
fromPycon
3 days ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
Python
fromPycon
3 days ago

Python and the Future of AI: Agents, Inference, and Edge AI

AI tools are increasingly integrated into development, with a dedicated track at PyCon US focusing on their future and practical applications.
Remote teams
fromwww.businessinsider.com
3 days ago

I'm a Chinese CEO who jumped on the OpenClaw hype and built AI employees. We had to create a human-only Slack channel to escape them.

AI employees can handle repetitive tasks, improving workplace efficiency and allowing humans to focus on creative work.
Artificial intelligence
fromSemafor
7 hours ago

Meta races to catch up to AI giants with new model

Meta launched Muse Spark, its first AI model, aiming to compete with leading tech companies despite concerns about being behind rivals.
Data science
fromInfoQ
3 days ago

Context Engineering with Adi Polak

Context engineering moves beyond prompt engineering to enhance AI systems by adapting language and practices for better model interaction.
fromThe Verge
4 days ago

How the Amazon Echo learned to talk - and listen

Jeff Bezos had been vocal about his desire for a voice computer, believing it would simplify interactions with technology and enhance the shopping experience on Amazon.
Podcast
Digital life
fromTechRepublic
6 days ago

Google Vids Just Got a Major AI Upgrade - Here's What's New

Google Vids enables intuitive video creation using AI, allowing users to direct avatars and publish content quickly with simple text prompts.
Gadgets
fromTechCrunch
1 week ago

Speechify's Windows app uses local models for transcription and dictation | TechCrunch

Speechify launched a Windows app for dictation and reading aloud, processing voice entirely on-device for enhanced user experience.
Mobile UX
fromTechCrunch
3 days ago

Google quietly releases an offline-first AI dictation app on iOS | TechCrunch

Google released an offline-first dictation app called Google AI Edge Eloquent for iOS, featuring advanced speech recognition and text editing capabilities.
fromwww.socialmediatoday.com
2 days ago

X expands AI translations and adds in-stream photo editing

X's new AI-powered auto-translate option will enable users worldwide to read posts from other regions, enhancing accessibility and engagement across diverse languages.
Social media marketing
Apple
fromThe Verge
1 week ago

You can now use ChatGPT with Apple's CarPlay

ChatGPT is now available on CarPlay for voice-based interactions with iOS 26.4 and the latest app version.
#whatsapp
Mobile UX
fromTechCrunch
2 weeks ago

WhatsApp can now draft AI-generated responses based on your conversations | TechCrunch

WhatsApp introduces AI-powered features for suggested replies, message drafting, photo touch-ups, and space management, enhancing user experience and privacy.
Mobile UX
fromTechCrunch
2 weeks ago

WhatsApp can now draft AI-generated responses based on your conversations | TechCrunch

WhatsApp introduces AI-powered features for suggested replies, message drafting, photo touch-ups, and space management, enhancing user experience and privacy.
Productivity
fromFast Company
2 weeks ago

Writer wants to be the go-to AI tool kit for the enterprise

Writer offers AI tools for enterprises, enabling non-engineers to automate tasks without IT support.
fromTechCrunch
2 weeks ago

Cohere launches an open-source voice model specifically for transcription | TechCrunch

Cohere's Transcribe model is designed for tasks like note-taking and speech analysis, supporting 14 languages and optimized for consumer-grade GPUs, making it accessible for self-hosting.
European startups
Data science
fromInfoWorld
1 week ago

A GitHub tinkerer teaches Claude to talk less, and that may matter more than it seems

A markdown file can significantly reduce AI output token usage, enhancing efficiency without code changes.
Roam Research
fromTechCrunch
2 weeks ago

These AI notetaking devices can help you record and transcribe your meetings | TechCrunch

Physical AI notetakers provide versatile recording and transcription options for meetings, offering features like live translation and unlimited transcription without subscriptions.
Gadgets
fromTheregister
2 weeks ago

HP stuffs OpenAI LLM into new laptops in bid for small biz

HP IQ is a new AI collaboration tool from HP designed to enhance productivity in business laptops.
fromTechzine Global
2 days ago

Meta is developing open-source versions of its next frontier AI models

Meta is working on two proprietary frontier models: Avocado, a large language model, and Mango, a multimedia file generator. The open-source variants are expected to be made available at a later date.
Artificial intelligence
fromDEV Community
3 weeks ago

I Built a 100% Private, On-Device AI Audio Stem Splitter (No Servers!)

If you've ever used tools like PhonicMind or LALAL.AI, you know the drill: Upload your MP3. Wait in a queue. Pay for "credits" or high-quality downloads. Your file sits on someone else's server. For musicians, producers, or just karaoke fans, this is slow and privacy-invasive.
Music production
Apple
fromThe Verge
1 week ago

Apple will reportedly allow other AI chatbots to plug into Siri

iOS 27 will allow users to choose AI chatbots to link with Siri.
Django
fromEngadget
3 weeks ago

OpenAI reportedly plans to add Sora video generation to ChatGPT

OpenAI plans to integrate its Sora video generation model into ChatGPT to revive user interest after the standalone app's popularity declined, potentially increasing ChatGPT's active users while managing significant inference costs.
Privacy technologies
fromEngadget
4 weeks ago

Alexa+ can now swear, thanks to a new personality style

Amazon introduced a 'sassy' personality option for Alexa+ that uses censored profanity, playful sarcasm, and witty comebacks while maintaining safety guardrails against harmful content.
#anthropic
Artificial intelligence
fromComputerworld
3 days ago

Anthropic cuts OpenClaw access from Claude subscriptions, offers credits to ease transition

OpenClaw's implementation was delayed by a week due to discussions with Anthropic regarding access cuts and feature copying.
#ai-models
Artificial intelligence
fromTNW | Apps
6 days ago

Microsoft launches three in-house AI models in direct challenge to OpenAI

Microsoft has launched three in-house AI models that compete directly with OpenAI, marking a significant shift in its AI strategy.
Artificial intelligence
fromTNW | Apps
6 days ago

Microsoft launches three in-house AI models in direct challenge to OpenAI

Microsoft has launched three in-house AI models that compete directly with OpenAI, marking a significant shift in its AI strategy.
Medicine
fromwww.bbc.com
1 month ago

'My new AI voice keeps my personality alive'

AI technology enables a motor neurone disease patient to communicate using a reconstructed version of her own voice, restoring personal identity and family connection.
Artificial intelligence
fromTheregister
6 days ago

Microsoft shivs OpenAI with new AI models for speech, images

Microsoft launched public preview versions of machine learning models for speech recognition, speech synthesis, and image generation, competing directly with OpenAI.
Artificial intelligence
fromEngadget
5 days ago

It's no longer free to use Claude through third-party tools like OpenClaw

Anthropic will charge third-party apps for using Claude AI, requiring a usage bundle or API key starting April 4.
Music production
fromwww.scientificamerican.com
1 month ago

Experimental composer Holly Herndon built an AI voice clone that anyone can use

Holly Herndon uses machine learning and AI models to create protocol art, where the creative act occurs in designing rule sets and datasets rather than in final media generation, making collective creativity visible.
Artificial intelligence
fromwww.bbc.com
3 weeks ago

Amazon's Alexa has had an AI upgrade. Now she's got more to say

Amazon is launching Alexa+, an AI-powered upgrade to Echo smart speakers in the UK that makes the digital assistant more conversational, proactive, and capable of following conversation threads.
Marketing tech
fromThe Drum
2 months ago

Getting the first word in voice search

Voice search usage is growing, creating brand opportunities while requiring optimisation for accuracy, shopping trust, and adaptation to screenless interactions.
Artificial intelligence
fromInfoWorld
1 month ago

How developers can bring voice AI into telephony applications

Voice AI agents require complex infrastructure beyond LLMs to integrate with legacy telephony systems, demanding flexible architecture designed for component switching and evolution.
Apple
fromThe Verge
2 months ago

Apple's second biggest acquisition ever is an AI company that listens to 'silent speech'

Apple acquired AI audio startup Q.ai to integrate imaging and audio machine-learning for nonverbal/micro-movement recognition, enabling whispered-speech interfaces across AirPods, Vision Pro, iPhone, and Macs.
Gadgets
fromSpyglass
2 months ago

"Hello, Computer."

AI-driven advances are creating an inflection point that may finally enable practical, mainstream voice computing after years of partial progress and false starts.
fromGSMArena.com
1 month ago

OpenAI is allegedly making a pricey smart speaker with a built-in camera

Ever since OpenAI acquired ex-Apple chief designer Jony Ive's design startup, rumors have been rampant about what kind of product the collaboration will yield. Clearly, it would have to be some sort of hardware piece - a first for OpenAI. At first, insiders hinted at an AI pen, then the clues shifted things towards something simpler - AI earbuds. Now insiders familiar with the matter claim that there are three
Mobile UX
#apple
Artificial intelligence
fromTechCrunch
1 month ago

Claude Code rolls out a voice mode capability | TechCrunch

Anthropic launches Voice Mode for Claude Code, enabling developers to interact with the AI coding assistant through spoken commands, starting with 5% of users.
Mobile UX
fromGSMArena.com
2 months ago

Amazon's Alexa+ AI chatbot is now available to everyone in the US, with a catch

Alexa+ is available to all US users; Prime members receive free unlimited access, non-Prime users can pay $19.99/month or use a limited free chat.
Artificial intelligence
fromPCMAG
1 month ago

Cut the BS: GPT-5.3 Model Promises to Fix ChatGPT's Preachy Tone

OpenAI released GPT-5.3 Instant to address ChatGPT's overly preachy tone by reducing moralizing preambles and unnecessary proclamations for more natural conversation.
Apple
fromTechRepublic
2 months ago

Apple Unveils Steps to Make Siri Sound Human - TechRepublic

A method reduces text-to-speech latency to make Siri and other voice-driven products sound more responsive while preserving intelligibility and accuracy.
Apple
fromComputerworld
2 months ago

Apple's Siri future is hybrid, integrated - and already here

Apple will power a much smarter Siri via a hybrid model using on-device processing and Google's Gemini through Private Cloud Compute while enforcing privacy controls.
Apple
fromTNW | Apple
2 months ago

Apple buys Q.ai to help devices read our faces

Apple bought Israeli startup Q.ai for nearly $2 billion to add silent-speech detection and micro-movement AI to future wearables and human-computer interaction.
Artificial intelligence
fromPsychology Today
1 month ago

An AI Voice Is Not a Mind

AI systems select and perform contextually appropriate personas rather than expressing unified selves with genuine beliefs, creating fluency that mimics mind without possessing interiority or conviction.
Artificial intelligence
fromWIRED
1 month ago

Huxe Will Give You a Personalized, Daily Audio Summary Powered by AI

Huxe is an AI-powered podcast app that generates personalized daily audio briefs from your email and calendar to streamline morning productivity routines.
Artificial intelligence
fromwww.aljazeera.com
1 month ago

ElevenLabs CEO says voice AI will change everything. Can it be controlled?

Voice AI technology enables beneficial applications like speech restoration and accessibility while simultaneously creating risks for fraud, disinformation, and unauthorized voice cloning that raise fundamental questions about voice ownership and control.
Artificial intelligence
fromTechzine Global
1 month ago

IBM integrates Deepgram speech AI into watsonx Orchestrate

IBM and Deepgram integrate advanced speech-to-text and text-to-speech capabilities into watsonx Orchestrate to enable organizations to build conversational AI agents and automate operations.
fromTechCrunch
2 months ago

ElevenLabs CEO: Voice is the next interface for AI | TechCrunch

ElevenLabs co-founder and CEO Mati Staniszewski says voice is becoming the next major interface for AI - the way people will increasingly interact with machines as models move beyond text and screens. Speaking at Web Summit in Doha, Staniszewski told TechCrunch voice models like those developed by ElevenLabs have recently moved beyond simply mimicking human speech - including emotion and intonation - to working in tandem with the reasoning capabilities of large language models.
Artificial intelligence
Artificial intelligence
fromTechCrunch
2 months ago

Google reportedly snags up team behind AI voice startup Hume AI | TechCrunch

Google DeepMind acquired Hume AI's CEO and key engineers to strengthen Gemini's voice capabilities while Hume continues licensing its voice-emotion technology to other firms.
Artificial intelligence
fromFortune
2 months ago

Hey Alexa-Amazon may be teaming up with OpenAI. Here's why that matters | Fortune

Meta is massively expanding its Hyperion AI data center while Amazon and OpenAI pursue a potential multibillion-dollar alliance to integrate OpenAI models into Alexa.
Artificial intelligence
fromTechCrunch
1 month ago

Cohere launches a family of open multilingual models | TechCrunch

Cohere launched Tiny Aya open-weight multilingual models supporting 70+ languages, runnable offline on everyday devices with a 3.35B-parameter base and regional variants.
Artificial intelligence
fromBusiness Matters
2 months ago

Free AI Dubbing Tool with Audiobook Support - Convert Text to Speech Instantly

AI audiobook generators and dubbing engines let anyone convert text or video into realistic, human-like audio quickly, affordably, and across languages.
Artificial intelligence
fromBusiness Matters
2 months ago

AI voice company ElevenLabs valued at $11bn after $500m funding round

ElevenLabs raised $500 million, valuing the company at $11 billion and accelerating expansion in AI voice, multilingual dubbing, music generation, and enterprise adoption.
fromFast Company
2 months ago

Are LTMs the next LLMs? This new type of AI can do what large-language models can't

A major difference between LLMs and LTMs is the type of data they're able to synthesize and use. LLMs use unstructured data-think text, social media posts, emails, etc. LTMs, on the other hand, can extract information or insights from structured data, which could be contained in tables, for instance. Since many enterprises rely on structured data, often contained in spreadsheets, to run their operations, LTMs could have an immediate use case for many organizations.
Artificial intelligence
fromRehumanize
1 month ago

Free AI Humanizer: Humanize AI Text & Bypass AI Detectors

AI Text Humanizer Protects Your Original Intent and Meaning Maintain your core perspective while restructuring sentence patterns. Humanizer ai accurately identifies and locks in technical terms, factual data, and key arguments, ensuring the rewritten draft is simply more readable without any semantic drift. You get a qualitative leap in flow and tone, allowing you to humanize ai text while keeping your original message perfectly intact.
Artificial intelligence
fromTechzine Global
1 month ago

Qwen3.5 aims to position Alibaba alongside GPT and Claude

Qwen3.5 is available via Hugging Face and is released under an open-source license. With this, Alibaba is explicitly targeting developers and research institutions that want to work with the model themselves. The system can process very long prompts, up to 260,000 tokens, and can be scaled further with additional optimizations. This makes it suitable for complex applications such as extensive document analysis and code generation.
Artificial intelligence
[ Load more ]