I am Anuj Diwan (अनुज दिवाण) , a Computer Science Ph.D. student at the University of Texas at Austin. My research interests broadly lie in Speech Recognition, Natural Language Processing, and Machine Learning.
Visit my website to know more!
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abuseCode for 'Unit-based Speech-to-Speech Translation Without Parallel Data'
Code for 'Why is Winoground Hard? Investigating Failures in Visuolinguistic Compositionality', EMNLP 2022
Code for 'When to Use Efficient Self Attention? Profiling Text, Speech and Image Transformer Variants', ACL 2023
Course Project for Automatic Speech Recognition(CS 753), Autumn 2019, CSE, IIT Bombay. Implemented music translation from one instrument to another using two approaches, a WaveNet autoencoder heavi…
Cuda
Pytorch implementation of https://arxiv.org/pdf/1908.05838.pdf along with a user-friendly web demo and some new experiments.
Python