top of page

Machine Learning

Linear Regression Theory:

Linear regression is a supervised machine learning algorithm used for predicting a continuous dependent variable based on one or more independent variables. It assumes a linear relationship between the input variables and the output variable and tries to fit a line that best describes this relationship. The line can be represented mathematically as an equation of the form:


                                               y = b0 + b1 * x1 + b2 * x2 + ... + bn * xn

  • Where y is the dependent variable, x1, x2, ..., xn are the independent variables, b0 is the intercept, and b1, b2, ..., bn are the coefficients. The coefficients are determined through the process of training the model on a labeled dataset.

The objective of training a linear regression model is to find the coefficients that minimize the difference between the actual output values and the predicted values. This difference is often measured using the mean squared error (MSE) or the mean absolute error (MAE).

Linear regression can be used for simple linear regression, where there is only one independent variable, or for multiple linear regression, where there are multiple independent variables. It is a widely used algorithm for its simplicity and interpretability, and is also a good starting point for more complex regression models.

Here are some examples of when linear regression might be used:

  1. Predicting housing prices: Given data on housing prices and various factors such as size, location, age, etc., a linear regression model can be trained to predict the sale price of a house based on these factors.

  2. Forecasting stock prices: Given historical stock prices and various economic indicators, a linear regression model can be used to predict future stock prices.

  3. Predicting exam scores: Given data on students' previous exam scores and the number of hours they studied, a linear regression model can be used to predict a student's future exam score based on the number of hours they study.

  4. Predicting weight based on height: Given data on a person's height and weight, a linear regression model can be used to predict a person's weight based on their height.


These are just a few examples, but linear regression can be applied to many other domains where a continuous dependent variable needs to be predicted based on one or more independent variables.

Here's an example implementation of linear regression in Python using the scikit-learn library:

import numpy as np
from sklearn.linear_model import LinearRegression


# Training data
X = np.array([[1], [2], [3], [4], [5]])
y = np.array([1, 2, 3, 4, 5])


# Create a linear regression object
reg = LinearRegression().fit(X, y)


# Predict using the model
pred = reg.predict([[6]])


bottom of page