A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
-
Updated
Jun 4, 2024 - Python
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
This API utilizes a pre-trained model for emotion recognition from audio files. It accepts audio files as input, processes them using the pre-trained model, and returns the predicted emotion along with the confidence score. The API leverages the FastAPI framework for easy development and deployment.
Classifying various animal species images using CNN
Official implementation of SpottingDiffusion : A CNN-based method of detecting AI generated images.
Large-scale pretrained models for goal-directed dialog
A repository which contains dataset and a pre-trained Snips model for the Automotive Grade Linux's NLU intent engine.
PyTorch implementation of "FullSubNet: A Full-Band and Sub-Band Fusion Model for Real-Time Single-Channel Speech Enhancement."
Using OpenCV's pretrained model yolov3 for real time object detection. (faster)
A simulation about controlling a 2D car with hand gestures. The project is inspired and highly influenced by a tutorial from"cj-mills".
Tensorflow.js PWA for Diagnosing Pneumonia in Frontal Chest X-ray Images using Convolutional Neural Network and Pretrained Model 🧠
Blip Image Captioning + GPT-2 Happy Model: Generate joyful responses to image captions using state-of-the-art NLP and computer vision. Pretrained models and data preprocessing included for seamless integration. Explore the intersection of deep learning, sentiment analysis, and language generation
Mining Discourse Markers for Unsupervised Sentence Representation Learning
Forest Fire Detection By Convolutional Neural Network
Code and released pre-trained model for our ACL 2022 paper: "DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response Generation"
Recognition of Quebec road signs using transfer learning with Python.
PyTorch implementation of a collections of scalable Video Transformer Benchmarks.
Official Pytorch implementation of ReXNet (Rank eXpansion Network) with pretrained models
Image Synthesis + Corgis = <3
A PyTorch implementation of the 'FaceNet' paper for training a facial recognition model with Triplet Loss using the glint360k dataset. A pre-trained model using Triplet Loss is available for download.
Real-time hand pose estimation and gesture classification using TensorRT
Add a description, image, and links to the pretrained-model topic page so that developers can more easily learn about it.
To associate your repository with the pretrained-model topic, visit your repo's landing page and select "manage topics."