#speech-recognition

[ follow ]
#natural-language-processing
Artificial intelligence
fromTechCrunch
3 months ago

MLCommons and Hugging Face team up to release massive speech data set for AI research | TechCrunch

MLCommons and Hugging Face released a large public domain voice recording dataset for AI research, promoting global speech technology development.
fromHackernoon
8 months ago
Artificial intelligence

Datasets and Evaluation Define the Robustness of Speech Language Models | HackerNoon

The article discusses the methods and datasets used for training and evaluating speech-language models (SLMs) against adversarial attacks.
fromHackernoon
8 months ago
Data science

AccentFold: Enhancing Accent Recognition - AccentFold | HackerNoon

AccentFold enhances speech recognition for diverse African accents, improving model accuracy for various dialects.
fromHackernoon
1 month ago
Scala

Why Our Tiny Training Set Beat Giants in Cross-Lingual Speech Retrieval | HackerNoon

The proposed DE model excels at speech-to-text (S2T) retrieval, outperforming existing models despite limited training data.
Artificial intelligence
fromTechCrunch
3 months ago

MLCommons and Hugging Face team up to release massive speech data set for AI research | TechCrunch

MLCommons and Hugging Face released a large public domain voice recording dataset for AI research, promoting global speech technology development.
fromHackernoon
8 months ago
Artificial intelligence

Datasets and Evaluation Define the Robustness of Speech Language Models | HackerNoon

The article discusses the methods and datasets used for training and evaluating speech-language models (SLMs) against adversarial attacks.
fromHackernoon
8 months ago
Data science

AccentFold: Enhancing Accent Recognition - AccentFold | HackerNoon

AccentFold enhances speech recognition for diverse African accents, improving model accuracy for various dialects.
fromHackernoon
1 month ago
Scala

Why Our Tiny Training Set Beat Giants in Cross-Lingual Speech Retrieval | HackerNoon

The proposed DE model excels at speech-to-text (S2T) retrieval, outperforming existing models despite limited training data.
more#natural-language-processing
#openai
fromHackernoon
3 years ago
JavaScript

Building a Voice Transcription and Translation App with OpenAI Whisper and Streamlit | HackerNoon

Using Streamlit and OpenAI's Whisper, users can easily record and transcribe speech to text, enhancing interactive web app functionalities.
fromHackernoon
3 years ago
JavaScript

Building a Voice Transcription and Translation App with OpenAI Whisper and Streamlit | HackerNoon

Using Streamlit and OpenAI's Whisper, users can easily record and transcribe speech to text, enhancing interactive web app functionalities.
more#openai
Apple
fromwww.mercurynews.com
2 months ago

Apple to fix iPhone dictation glitch that suggests replacing the word racist' with Trump'

Apple is fixing a dictation bug that suggests 'Trump' when words with 'R' consonants are spoken.
#machine-learning
fromHackernoon
8 months ago
Artificial intelligence

SpeechVerse Unites Audio Encoder and LLM for Superior Spoken QA | HackerNoon

The SpeechVerse architecture combines an audio encoder with language models to enhance audio input processing.
fromHackernoon
3 months ago
Miscellaneous

Ablation Study Reveals the Role of Semantic & Acoustic Prompts in SEAMLESSEXPRESSIVELM's Performance | HackerNoon

Chain-of-thought prompting enhances model performance by improving semantic preservation during translation.
fromHackernoon
8 months ago
Artificial intelligence

SpeechVerse Unites Audio Encoder and LLM for Superior Spoken QA | HackerNoon

The SpeechVerse architecture combines an audio encoder with language models to enhance audio input processing.
fromHackernoon
3 months ago
Miscellaneous

Ablation Study Reveals the Role of Semantic & Acoustic Prompts in SEAMLESSEXPRESSIVELM's Performance | HackerNoon

Chain-of-thought prompting enhances model performance by improving semantic preservation during translation.
more#machine-learning
#language-learning
fromHackernoon
8 months ago
JavaScript

How to Create a Pronunciation Assessment App (Part 1) | HackerNoon

The tutorial focuses on creating a pronunciation app for German using JavaScript and APIs.
fromZDNET
7 months ago
Online learning

Learn a new language with Babbel, now 69% off

Babbel simplifies language learning with short lessons and a focus on conversation, making it feasible for busy individuals.
fromZDNET
6 months ago
Online learning

Save 69% on a Babbel subscription to learn a new language. Here's how

Babbel offers an accessible and effective way to learn a language through short lessons and practical conversation skills.
fromHackernoon
8 months ago
JavaScript

How to Create a Pronunciation Assessment App (Part 1) | HackerNoon

The tutorial focuses on creating a pronunciation app for German using JavaScript and APIs.
fromZDNET
7 months ago
Online learning

Learn a new language with Babbel, now 69% off

Babbel simplifies language learning with short lessons and a focus on conversation, making it feasible for busy individuals.
fromZDNET
6 months ago
Online learning

Save 69% on a Babbel subscription to learn a new language. Here's how

Babbel offers an accessible and effective way to learn a language through short lessons and practical conversation skills.
more#language-learning
JavaScript
fromCodeProject
8 months ago

Complete Voice Interaction with ChatGPT

The project effectively combines speech recognition and TTS to facilitate uninterrupted interaction with ChatGPT, enhancing user experience.
fromHackernoon
8 months ago
Artificial intelligence

AccentFold: Enhancing Accent Recognition - Conclusion, Limitations, and References | HackerNoon

AccentFold enhances speech recognition for African accented speech by utilizing accent embeddings based on linguistic relationships, showing a 3.5% WER improvement.
[ Load more ]