Spectrogram to Audio Python

Performance Analysis of CNN-Based Spectrogram with Multiple Audio Feature Types for English Digit Recognition

Abstract: Audio feature selection and neural network architecture play crucial roles in speech recognition performance. This paper presents a comparative analysis of Artificial Neural Networks (ANNs) ...

Edex Live on MSN

Listening to the forest: An AI innovator’s mission to protect humans and wildlife

By Atharva Agrawal Growing up in the Tiger Capital of India, Nagpur, a city surrounded by some of the country’s most eminent wildlife sanctuaries, including Pench National Park, Tadoba-Andhari, Kanha ...

IEEE

Wavefake-Based Audio Deepfake Detection Using Spectrograms and Convolutional Neural Networks

Abstract: The rapid advancement of audio deepfake technologies, which enable the synthesis of highly realistic speech, presents serious challenges to digital media integrity and public trust. In ...

GitHub

DCASE2025_TASK3_Stereo_PSELD_Mamba

This repo contains code for our DCASE 2025 task3 proposed method : Stereo sound event localization and detection based on PSELDnet pretraining and BiMamba sequence modeling [1]. For more information, ...

GitHub

MalcolmStran/whisper-subtitle-translator

A complete video subtitle translation pipeline with modern web interface that uses OpenAI Whisper for speech-to-text transcription and Google Translate for multi-language subtitle generation.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results