#audio-processing tag

Use TorchAudio to Prepare Audio Data for Deep Learning - Real Python

A waveform is the visual representation of sound as it travels through air over time, capturing the pressure changes in audio.

Music

fromZDNET

1 month ago

Why this Bose portable speaker is my top recommendation for most people

I also couldn't help but notice this speaker's lack of features. It allows two devices to connect simultaneously, has a three-band EQ, and includes a play/pause button. That's it.

Gadgets

fromSitePoint Forums | Web Development & Design Community

1 month ago

Developing speech to text transcription on python

import whisper import torch import sys def transcribe_audio(audio_path, language='el'): """ Transcribes the given audio file to text in the specified language using Whisper. Args: audio_path (str): Path to the audio file. language (str): Language code for transcription (default is 'el' for Greek). Returns: str: The transcribed text. """ device = 'cuda' if torch.cuda.is_available() else 'cpu' print(f'Using device: {device}') model = whisper.load_model('small', device=device) print(f'Transcribing {audio_path} in language: {language}...') result = model.transcribe(audio_path, language=language, task='transcribe') return result['text']

Python

fromHackernoon

7 months ago

Hosting Your Own AI with Two-Way Voice Chat Is Easier Than You Think! | HackerNoon

The integration of LLMs with voice capabilities provides a transformative mechanism for personalized customer interactions, enhancing engagement and satisfaction through dynamic, two-way voice communication.

JavaScript

#audio-processing#audio-processing

Use TorchAudio to Prepare Audio Data for Deep Learning - Real Python

Why this Bose portable speaker is my top recommendation for most people

Developing speech to text transcription on python

Hosting Your Own AI with Two-Way Voice Chat Is Easier Than You Think! | HackerNoon

#audio-processing
#audio-processing