Train a simple neural net model

Although neural networks are widely known for use in deep learning and modeling complex problems such as image recognition, they are easily adapted to regression problems. Any class of statistical models can be termed a neural network if they use adaptive weights and can approximate non-linear functions of their inputs. Thus neural network regression is suited to problems where a more traditional regression model cannot fit a solution.

Neural network regression is a supervised learning method, and therefore requires a tagged dataset, which includes a label column. Because a regression model predicts a numerical value, the label column must be a numerical data type.

Lab Overview

In this lab we will be using a subset of NYC Taxi & Limousine Commission - green taxi trip records available from Azure Open Datasets. The data is enriched with holiday and weather data. Based on the enriched dataset, we will configure the prebuilt Neural Network Regression module to create a regression model using a customizable neural network algorithm. We will train the model by providing the model and the NYC taxi dataset as an input to Train Model. The trained model can then be used to predict NYC taxi fares. We will do all of this from the Azure Machine Learning designer without writing a single line of code.

Exercise 1: Register Dataset with Azure Machine Learning studio

Task 1: Upload Dataset

In Azure portal, open the available machine learning workspace.
Select Launch now under the Try the new Azure Machine Learning studio message.
When you first launch the studio, you may need to set the directory and subscription. If so, you will see this screen:

For the directory, select Udacity and for the subscription, select Azure Sponsorship. For the machine learning workspace, you may see multiple options listed. Select any of these (it doesn't matter which) and then click Get started.
From the studio, select Datasets, + Create dataset, From web files. This will open the Create dataset from web files dialog on the right.

In the Web URL field provide the following URL for the training data file:

https://introtomlsampledata.blob.core.windows.net/data/nyc-taxi/nyc-taxi-sample-data.csv

Provide nyc-taxi-sample-data as the Name, leave the remaining values at their defaults and select Next.

Task 2: Preview Dataset

On the Settings and preview panel, set the column headers drop down to All files have same headers.
Scroll the data preview to right to observe the target column: totalAmount. After you are done reviewing the data, select Next

Task 3: Select Columns

Select columns from the dataset to include as part of your training data. Leave the default selections and select Next

Task 4: Create Dataset

Confirm the dataset details and select Create

Exercise 2: Create New Training Pipeline

Task 1: Open Pipeline Authoring Editor

From the studio, select Designer, +. This will open a visual pipeline authoring editor.

Task 2: Setup Compute Target

In the settings panel on the right, select Select compute target.
In the Set up compute target editor, select the available compute, and then select Save.

Note: If you are facing difficulties in accessing pop-up windows or buttons in the user interface, please refer to the Help section in the lab environment.

Task 3: Add Dataset

Select Datasets section in the left navigation. Next, select My Datasets, nyc-taxi-sample-data and drag and drop the selected dataset on to the canvas.

Task 4: Split Dataset

Select Data Transformation section in the left navigation. Follow the steps outlined below:
1. Select the Split Data prebuilt module
2. Drag and drop the selected module on to the canvas
3. Fraction of rows in the first output dataset: 0.7
4. Connect the Dataset to the Split Data module

Note that you can submit the pipeline at any point to peek at the outputs and activities. Running pipeline also generates metadata that is available for downstream activities such selecting column names from a list in selection dialogs.

Task 5: Initialize Regression Model

Select Machine Learning Algorithms section in the left navigation. Follow the steps outlined below:
1. Select the Neural Network Regression prebuilt module, in the Regression category.
2. Drag and drop the selected module on to the canvas
3. Create trainer mode: Single Parameter. This option indicates how you want the model to be trained.
4. Hidden layer specification: Fully connected case.
5. For Learning rate: 0.01.
Note: Because the number of nodes in the input layer is determined by the number of features in the training data, in a regression model there can be only one node in the output layer.

Task 6: Setup Train Model Module

Select Model Training section in the left navigation. Follow the steps outlined below:
1. Select the Train Model prebuilt module
2. Drag and drop the selected module on to the canvas
3. Connect the Neural Network Regression module to the first input of the Train Model module
4. Connect the first output of the Split Data module to the second input of the Train Model module
5. Select the Edit column link to open the Label column editor
The Label column editor allows you to specify your Label or Target column. Type in the label column name totalAmount and then select Save.

Task 7: Setup Score Model Module

Select Model Scoring & Evaluation section in the left navigation. Follow the steps outlined below:
1. Select the Score Model prebuilt module
2. Drag and drop the selected module on to the canvas
3. Connect the Train Model module to the first input of the Score Model module
4. Connect the second output of the Split Data module to the second input of the Score Model module

Note that Split Data module will feed data for both model training and model scoring. The first output (0.7 fraction) will connect with the Train Model module and the second output (0.3 fraction) will connect with the Score Model module.

Task 8: Setup Evaluate Model Module

Select Model Scoring & Evaluation section in the left navigation. Follow the steps outlined below:
1. Select the Evaluate Model prebuilt module
2. Drag and drop the selected module on to the canvas
3. Connect the Score Model module to the first input of the Evaluate Model module

Exercise 3: Submit Training Pipeline

Task 1: Create Experiment and Submit Pipeline

Select Submit to open the Setup pipeline run editor.

Please note that the button name in the UI is changed from Run to Submit.
In the Setup pipeline run editor, select Experiment, Create new and provide New experiment name: neural-network-regression, and then select Submit.
Wait for pipeline run to complete. It will take around 8 minutes to complete the run.
While you wait for the model training to complete, you can learn more about the training algorithm used in this lab by selecting Neural Network Regression module.

Exercise 4: Visualize Training Results

Task 1: Visualize the Model Predictions

Select Score Model, Outputs, Visualize to open the Score Model result visualization dialog or just simply right-click the Score Model module and select Visualize Scored Dataset.
Observe the predicted values under the column Scored Labels. You can compare the predicted values (Scored Labels) with actual values (totalAmount).

Task 2: Visualize the Evaluation Results

Select Evaluate Model, Outputs, Visualize to open the Evaluate Model result visualization dialog or just simply right-click the Evaluate Model module and select Visualize Evaluation Results.
Evaluate the model performance by reviewing the various evaluation metrics, such as Mean Absolute Error, Root Mean Squared Error, etc.

Next Steps

Congratulations! You have trained a simple neural net model using the prebuilt Neural Network Regression module in the AML visual designer. You can continue to experiment in the environment but are free to close the lab environment tab and return to the Udacity portal to continue with the lesson.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Train a simple neural net model

Lab Overview

Exercise 1: Register Dataset with Azure Machine Learning studio

Task 1: Upload Dataset

Task 2: Preview Dataset

Task 3: Select Columns

Task 4: Create Dataset

Exercise 2: Create New Training Pipeline

Task 1: Open Pipeline Authoring Editor

Task 2: Setup Compute Target

Task 3: Add Dataset

Task 4: Split Dataset

Task 5: Initialize Regression Model

Task 6: Setup Train Model Module

Task 7: Setup Score Model Module

Task 8: Setup Evaluate Model Module

Exercise 3: Submit Training Pipeline

Task 1: Create Experiment and Submit Pipeline

Exercise 4: Visualize Training Results

Task 1: Visualize the Model Predictions

Task 2: Visualize the Evaluation Results

Next Steps

Files

README.md

Latest commit

History

README.md

File metadata and controls

Train a simple neural net model

Lab Overview

Exercise 1: Register Dataset with Azure Machine Learning studio

Task 1: Upload Dataset

Task 2: Preview Dataset

Task 3: Select Columns

Task 4: Create Dataset

Exercise 2: Create New Training Pipeline

Task 1: Open Pipeline Authoring Editor

Task 2: Setup Compute Target

Task 3: Add Dataset

Task 4: Split Dataset

Task 5: Initialize Regression Model

Task 6: Setup Train Model Module

Task 7: Setup Score Model Module

Task 8: Setup Evaluate Model Module

Exercise 3: Submit Training Pipeline

Task 1: Create Experiment and Submit Pipeline

Exercise 4: Visualize Training Results

Task 1: Visualize the Model Predictions

Task 2: Visualize the Evaluation Results

Next Steps