java-ann-mlp

Package provides java implementation of multi-layer perceptron neural network with back-propagation learning algorithm

Features

Regression + Classification
Stochastic / Mini-batch / Steepest Descend Weight Update approaches
Back-propagation / Resilient Back-propagation (rprop) / rmsprop
Adaptive learning rate for individual weights
Weight limiting via weight penalties (L2-regularization) / weight constraints
Both numerical and categorical inputs

Install

Add the following dependency to your POM file:

<dependency>
  <groupId>com.github.chen0040</groupId>
  <artifactId>java-ann-mlp</artifactId>
  <version>1.0.6</version>
</dependency>

Usage

MLP Regression

The training of mlp regression is done by calling:

mlpRegression.fit(trainingData);

The regression of mlp is done by calling:

double predicted = mlpRegression.transform(dataRow);

The sample code below shows how to use the MLP regression to predict the (-1, +1) numerical output of the heart-scale sample from libsvm.

The training data is loaded from a data frame connected to the "heart_scale" libsvm file (please refer to here for more example on how to create a data frame).

InputStream inputStream = new FileInputStream("heart_scale");

DataFrame dataFrame = DataQuery.libsvm().from(inputStream).build();

System.out.println(dataFrame.head(10));

TupleTwo<DataFrame, DataFrame> miniFrames = dataFrame.shuffle().split(0.9);

DataFrame trainingData = miniFrames._1();
DataFrame crossValidationData = miniFrames._2();

MLPRegression regression = new MLPRegression();
regression.setHiddenLayers(8);
regression.setEpoches(1000);
regression.fit(trainingData);

for(int i = 0; i < crossValidationData.rowCount(); ++i){
 double predicted = regression.transform(crossValidationData.row(i));
 double actual = crossValidationData.row(i).target();
 logger.info("predicted: {}\texpected: {}", predicted, actual);
}

MLP Multi-Class Classifier

The training of mlp classifier is done by calling:

mlpClassifier.fit(trainingData);

The classification of mlp is done by calling:

String predicted = mlpClassifier.classify(dataRow);

The sample code below shows how to use the MLP classifier to predict the labels of the heart-scale sample from libsvm.

The training data is loaded from a data frame connected to the "heart_scale" libsvm file (please refer to here for more example on how to create a data frame).

As the heart-scale data frame has (-1, +1) numerical output, the codes first coverts the (-1, +1) as categorical output label "category-label".

InputStream inputStream = new FileInputStream("heart_scale");

DataFrame dataFrame = DataQuery.libsvm().from(inputStream).build();

dataFrame.unlock();
for(int i=0; i < dataFrame.rowCount(); ++i){
 DataRow row = dataFrame.row(i);
 row.setCategoricalTargetCell("category-label", "" + row.target());
}
dataFrame.lock();

MLPClassifier mlpClassifier = new MLPClassifier();
mlpClassifier.setHiddenLayers(6); // one hidden layer, to set two or more hidden layer call mlpClassifier.setHiddenLayer([layer1NeuronCunt], [layer2NeuronCunt], ...);
mlpClassifier.setEpoches(1000);
mlpClassifier.setLearningRate(0.2);
mlpClassifier.fit(dataFrame);

int correctnessCount = 0;
for(int i = 0; i < dataFrame.rowCount(); ++i){
 DataRow row = dataFrame.row(i);



 String predicted_label = mlpClassifier.classify(row);
 correctnessCount += (predicted_label.equals(row.categoricalTarget()) ? 1 : 0);

 if(i < 10) {
    System.out.println(row);
    System.out.println("predicted: " + predicted_label + "\tactual: " + row.categoricalTarget());
 }
}

System.out.println("Prediction Accuracy: "+(correctnessCount * 100 / dataFrame.rowCount()));

Configuration

How often to update weights

The weight update mode specifies how often the weights should be updated for a number of training cases:

StochasticGradientDescend: The "online" version of the stochastic gradient descend essentially update the weights after each training case.
MiniBatchGradientDescend: The "mini-batch" version of the stochastic gradient descend that update the weights over a batch of training cases.
SteepestGradientDescend: The "full" version update the weights over all the training cases.

The "mini-batch" is considered to be the most appropriate for most of the deep net training.

For the "mini-batch" to work well on a multi-class classification problem, the batches should be "balanced" for classes (i.e., containing roughly the same number of different classes for each batch)

For a small datasets (e.g., 10,000 cases) or bigger datasets without much redundancy, consider using

full-batch steepest descend
adaptive learning rates, resilient back-propagation

For big, redundant datasets, consider using mini-batches:

try gradient descent with momentum
try rmsprop without momentum

By default the weight updating is done via online stochastic gradient descend, to change it to steepest gradient descent:

mlpClassifier.setWeightUpdateMode(WeightUpdateMode.SteepestGradientDescend);

regression.setWeightUpdateMode(WeightUpdateMode.SteepestGradientDescend);

To change it to mini-batch gradient descent:

mlpClassifier.setWeightUpdateMode(WeightUpdateMode.MiniBatchGradientDescend);

regression.setWeightUpdateMode(WeightUpdateMode.MiniBatchGradientDescend);

By default the mini-batch size is set to 50, this can be changed by calling:

mlpClassifier.setMiniBatchSize(51);

regression.setMiniBatchSize(51);

How much to update weight

The default is by back-propagation, to change to other learning method such as rprop:

mlpClassifier.setLearningMethod(LearningMethod.ResilientBackPropagation);

Adaptive learning rate

The mlp can adapt the learning for individual weight, by default this is not enabled, to enable adaptive learning rate:

mlpClassifier.enableAdaptiveLearningRate(true);

regression.enableAdaptiveLearningRate(true);

To prevent the learning rate to increase out of bounds, we can also set the max learning rate (default is 1):

mlpClassifier.setMaxLearningRate(1.1);

Limiting the size of the weights

The size of weights in the network can be limited using either weight penalties or weight constraints

The weight penalty can be achieved via L2 regularization. The effect of L2 weight cost is to prevent the network from using weights that it does not need. This can often improve generalization a lot because it helps to stop the network from fitting the sampling error. It also makes a smoother model in which output changes more slowly as the input changes

To limit the size of the weight in the mlp via the L2 regularization (0 by default):

mlpClassifier.setL2Penalty(0.1);

Alternative approach is to use weight constraints to limit the size of weight, the weight constraints works by enforcing the maximum allowed squared length on all weights of the network.

To limit the size of the weight in the mlp via weight constraints (0 by default):

mlpClassifier.setWeightConstraint(80);

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.mvn/wrapper		.mvn/wrapper
notes		notes
src		src
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
mvnw		mvnw
mvnw.cmd		mvnw.cmd
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.mvn/wrapper

.mvn/wrapper

notes

notes

src

src

.gitignore

.gitignore

.travis.yml

.travis.yml

LICENSE

LICENSE

README.md

README.md

mvnw

mvnw

mvnw.cmd

mvnw.cmd

pom.xml

pom.xml

Repository files navigation

java-ann-mlp

Features

Install

Usage

MLP Regression

MLP Multi-Class Classifier

Configuration

How often to update weights

How much to update weight

Adaptive learning rate

Limiting the size of the weights

About

Releases

Packages

Languages

License

chen0040/java-ann-mlp

Folders and files

Latest commit

History

Repository files navigation

java-ann-mlp

Features

Install

Usage

MLP Regression

MLP Multi-Class Classifier

Configuration

How often to update weights

How much to update weight

Adaptive learning rate

Limiting the size of the weights

About

Topics

Resources

License

Stars

Watchers

Forks

Languages