#audio-processing

[ follow ]
fromZDNET
2 days ago

Why this Bose portable speaker is my top recommendation for most people

I also couldn't help but notice this speaker's lack of features. It allows two devices to connect simultaneously, has a three-band EQ, and includes a play/pause button. That's it.
Gadgets
fromSitePoint Forums | Web Development & Design Community
5 days ago

Developing speech to text transcription on python

import whisper import torch import sys def transcribe_audio(audio_path, language='el'): """ Transcribes the given audio file to text in the specified language using Whisper. Args: audio_path (str): Path to the audio file. language (str): Language code for transcription (default is 'el' for Greek). Returns: str: The transcribed text. """ device = 'cuda' if torch.cuda.is_available() else 'cpu' print(f'Using device: {device}') model = whisper.load_model('small', device=device) print(f'Transcribing {audio_path} in language: {language}...') result = model.transcribe(audio_path, language=language, task='transcribe') return result['text']
Python
fromHackernoon
6 months ago

Hosting Your Own AI with Two-Way Voice Chat Is Easier Than You Think! | HackerNoon

The integration of LLMs with voice capabilities provides a transformative mechanism for personalized customer interactions, enhancing engagement and satisfaction through dynamic, two-way voice communication.
JavaScript
[ Load more ]