Python Nearest Neighbours Machine Learning Algorithm

A custom implementation of the K-Nearest Neighbors (KNN) algorithm from scratch using Python and NumPy, with comparison to scikit-learn's built-in implementation.

Overview

This project demonstrates a complete implementation of the 1-Nearest Neighbor algorithm without using any machine learning libraries for the core logic. The implementation includes custom distance calculation, prediction logic, and performance evaluation on Iris and Ionosphere datasets.

Results

Iris Dataset

Custom Implementation Accuracy: 97.37%
Scikit-learn Accuracy: 97.37%
Test Error Rate: 2.63%

Ionosphere Dataset

Custom Implementation Accuracy: 87.50%
Scikit-learn Accuracy: 87.50%
Test Error Rate: 12.50%

Installation

Clone the repository:

git clone https://github.com/sourabhmarne777/python-nearest-neighbours-machine-learning-algorithm.git
cd python-nearest-neighbours-machine-learning-algorithm

Install required packages:

pip install -r requirements.txt

Launch Jupyter Notebook:

jupyter notebook "Nearest Neighbour Algorithm.ipynb"

Usage

Open the Jupyter notebook and run all cells to see the complete implementation. The notebook will load datasets, run the custom KNN implementation, compare results with scikit-learn, and display accuracy metrics.

Core Algorithm

The NNpredict function implements 1-Nearest Neighbor using Euclidean distance calculation to find the closest training sample for each test sample.

Dataset Information

Iris Dataset: 4 features, 3 classes, 150 samples
Ionosphere Dataset: 5 features (from 34), 2 classes, binary classification

License

This project is licensed under the MIT License - see the LICENSE file for details.

Author

Sourabh Marne

GitHub: @sourabhmarne777

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
LICENSE		LICENSE
Nearest Neighbour Algorithm.ipynb		Nearest Neighbour Algorithm.ipynb
README.md		README.md
ionosphere.txt		ionosphere.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Python Nearest Neighbours Machine Learning Algorithm

Overview

Results

Iris Dataset

Ionosphere Dataset

Installation

Usage

Core Algorithm

Dataset Information

License

Author

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

sourabhmarne777/python-nearest-neighbours-machine-learning-algorithm

Folders and files

Latest commit

History

Repository files navigation

Python Nearest Neighbours Machine Learning Algorithm

Overview

Results

Iris Dataset

Ionosphere Dataset

Installation

Usage

Core Algorithm

Dataset Information

License

Author

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages