Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Semi and supervised machine learning #40

Open
wants to merge 1 commit into
base: master
Choose a base branch
from
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Jump to
Jump to file
Failed to load files.
Diff view
Diff view
18 changes: 14 additions & 4 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -17,9 +17,9 @@ In supervised learning we have a set of training data as an input and a set of l

### Regression

In regression problems we do real value predictions. Basically we try to draw a line/plane/n-dimensional plane along the training examples.
In regression problems we predict real/continuos values which are countably infinite. Basically we try to draw a line/plane/n-dimensional plane along the training examples.

_Usage examples: stock price forecast, sales analysis, dependency of any number, etc._
_Usage examples: stock price forecast, sales analysis, dependency of any number, House price, Market Sales, Heart rate, etc._

#### 🤖 Linear Regression

Expand All @@ -31,9 +31,14 @@ _Usage examples: stock price forecast, sales analysis, dependency of any number,

### Classification

In classification problems we split input examples by certain characteristic.
In classification problem the output variable (feature) is **categorical**, For example: Covid or No Covid, Spam or Not Spam.
There are many types of classification problems such as:
- Binary Classification - Binary classification refers to those classification tasks that have two class labels.
- Multi-Class Classification - Multi-class classification refers to those classification tasks that have more than two class labels.
- Multi-Label Classification - Multi-label classification refers to those classification tasks that have two or more class labels, where one or more class labels may be predicted for each example.
- Imbalanced Classification - Imbalanced Classification refers to classification tasks where one variable has more examples over other variables.

_Usage examples: spam-filters, language detection, finding similar documents, handwritten letters recognition, etc._
_Usage examples: spam-filters, language detection, finding similar documents, handwritten letters recognition, Face (Male/ Female) Binary Classification, Emotion detection (Negative/ Positive), etc._

#### 🤖 Logistic Regression

Expand Down Expand Up @@ -72,6 +77,11 @@ _Usage examples: intrusion detection, fraud detection, system health monitoring,
- ⚙️ [Code | Anomaly Detection using Gaussian Distribution](homemade/anomaly_detection/gaussian_anomaly_detection.py) - implementation example
- ▶️ [Demo | Anomaly Detection](https://nbviewer.jupyter.org/github/trekhleb/homemade-machine-learning/blob/master/notebooks/anomaly_detection/anomaly_detection_gaussian_demo.ipynb) - find anomalies in server operational parameters like `latency` and `threshold`

## Semi-Supervised Learning

Semi-Supervised learning the dataset consists of target variable which has 50% values and 50% is empty (NaN). The percentage might differ according to datasets.


## Neural Network (NN)

The neural network itself isn't an algorithm, but rather a framework for many different machine learning algorithms to work together and process complex data inputs.
Expand Down