Skip to content

(DEPRICATED)Create Scheduler using Data Factory v2 to run (DEPRICATED) Azure Machine Learning Experimentation

License

Notifications You must be signed in to change notification settings

dciborow/AzureML-Scheduler-on-ADF

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

34 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Schedule ML Experimention on Data Factory v2

This sample shows how to leverage ADF to run an experiment.

Prerequisites

Create Azure Machine Learning Preview accounts and install Azure Machine Learning Workbench using these instructions.

Create Compute Targets

First, create compute targets for running the parameter sweep: DSVM and optionally, HDInsight Spark Cluster. Select File, Open Command Prompt and enter following commands to create the compute targets.

Create a Ubuntu baesd DSVM using these instructions. Attach it as compute target, and then prepare it by using:

$ az ml computetarget attach --name <dsvm> --address <dsvm-ip> --username <sshusername> --password <sshpwd> --type remotedocker
$ az ml experiment prepare -c <dsvm>

OR

Create HDInsight Spark Cluster using these instructions. Attach it as compute target, and then prepare it by using:

$ az ml computetarget attach --name <myhdi> --address <myhdi-ssh.azurehdinsight.net> --username <sshusername> --password <sshpwd> --type cluster
$ az ml experiment prepare -c <myhdi>

Run the Model from Local Machine

DSVM running a pyspark job.

$ az ml experiment submit -c <dsvm> <pyspark.py>

HDInsight Spark running a pyspark job.

$ az ml experiment submit -c <myhdi> <pyspark.py>

Deploy ADFv2 and other required resources

The creation process creates a new service princple account when run locally. This new account must be used to login to the az ml client from within the workbench if the AMLW secret store is used to store passwords.

  1. Download and place IaC folder within your AML Workspace project directory.
  2. Download and place batch task within your AML Workspace project directory.
  3. Select File, Open Powershell and enter the following command to create the deployment.
    1. Make VSTS Git Access Key
    2. DSVM Compute Target
    3. Path in AML Workbench project to code
.\IaC\CreateDeployment.ps1 `
    -gitPassword "<vstsPat>" `    
    -dsvm "<dsvm>" `
    -pythonPath "<pyspark.py>"

On Batch Windows DSVM Image

RDP into Node using username rdpuser, and provided password.

Update Azure PowerShell to >5.X

  1. microsoft web platform installer is on the desktop of DSVM, use this to update Powershell

Install AMLW - MSI must be run from admin cmd.

  1. Download AML Workbench from here, https://aka.ms/azureml-wb-msi
  2. Windows Key
  3. Type cmd
  4. Right click, select Run as Admin
  5. Change to D drive with d:
  6. cd User{username}\Downloads
  7. AmlWorkbenchSetup.msi
  8. Log into AML Workbench, this is required to create starter folders, remote experiment submission will fail if this is not complete.

Contributing

This project welcomes contributions and suggestions. Most contributions require you to agree to a Contributor License Agreement (CLA) declaring that you have the right to, and actually do, grant us the rights to use your contribution. For details, visit https://cla.microsoft.com.

When you submit a pull request, a CLA-bot will automatically determine whether you need to provide a CLA and decorate the PR appropriately (e.g., label, comment). Simply follow the instructions provided by the bot. You will only need to do this once across all repos using our CLA.

This project has adopted the Microsoft Open Source Code of Conduct. For more information see the Code of Conduct FAQ or contact opencode@microsoft.com with any additional questions or comments.

Disclaimer

©2017 Microsoft Corporation. All rights reserved. This information is provided "as-is" and may change without notice. Microsoft makes no warranties, express or implied, with respect to the information provided here.

About

(DEPRICATED)Create Scheduler using Data Factory v2 to run (DEPRICATED) Azure Machine Learning Experimentation

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published