Skip to content

Download the narration.wav file by cloning this repo to your local computer. Once you have cloned the repo, either place the narration.wav file in a location where your code will be executed, or update the path in the code to match the location of the file on "SpeechToTextCsharp" project.

License

Notifications You must be signed in to change notification settings

PranamBhat/Convert-Speech-From-Audio-File-To-Text

 
 

Repository files navigation

AI-102-Process-Speech

A code repository for the Process and Translate Speech with Azure Cognitive Speech Services learning path. The content of this repository is related to the modules that form part of the learning paths and content for the Worldwide Learning AI-102 Azure AI Engineer track.

The repo contains folders, and code files where necessary, to support the speech-to-text, text-to-speech, and speech translation services that are part of the Speech SDK. The instructions and supporting content for the labs are found on the Microsoft Learn platform and linked to the AI Engineer role.

The content and labs are in various stages of development and release so you may not see all of the content available at the same time this repo is made public. Therefore, the folders and code files will not appear complete or functional, without the supporting content in place. Please do not log issues or pull requests unless the supporting content is live on the Microsoft Learn site.

Microsoft Learn Labs

The sample code in this repository is for use in hands-on exercises in Microsoft Learn modules.

Setup

The exercises are designed to be completed in Visual Studio Online. To complete the labs, you'll need the following:

  • A Microsoft Azure subscription. If you don't already have one, you can sign up for a free trial at https://azure.microsoft.com.
  • A Visual Studio Online environment. This provides a hosted instance of Visual Studio Code, in which you'll be able to run the notebooks for the lab exercises. To set up this environment:
    1. Browse to https://online.visualstudio.com
    2. Click Get Started.
    3. Sign in using the Microsoft account associated with your Azure subscription.
    4. Click Create environment. If you don't already have a Visual Studio Online plan, create one. This is used to track resource utilization by your Visual Studio Online environments. Then create an environment with the following settings:
    5. Wait for the environment to be created. This will open a browser-based instance of Visual Studio Code.
    6. Wait for a minute or so while the environment is set up for you. It might look like nothing is happening, but in the background we are installing some extensions that you will use in the labs. You'll see the following things happen:
      • The files in this repo will appear in the pane on the left.
      • After a few minutes (during which there's no apparent activity, but in the background we're setting up the environment for you), a new file named REFRESH NOW will appear in the pane on the left. This is your indication that everything has been installed.
    7. After the REFRESH NOW file has appeared and the color scheme has changed, refresh the web page to ensure all of the extensions are loaded and you're ready to start.
    8. Note the .ipynb files in the Explorer pane - these contain the lab exercises.

Tip: you can change the color scheme in Visual Studio Online if you prefer - just click the icon at the bottom left and select a new Color Theme.

Contributing

At this time, we are not accepting contributions to this repository. If you encounter an issue with the exercises, please report it.

About

Download the narration.wav file by cloning this repo to your local computer. Once you have cloned the repo, either place the narration.wav file in a location where your code will be executed, or update the path in the code to match the location of the file on "SpeechToTextCsharp" project.

Topics

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 82.4%
  • Shell 17.6%