Skip to content

AnmolArora15/Iris-DataSet

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 

Repository files navigation

Introduction

The Iris flower data set or Fisher's Iris data set is a multivariate data set introduced by the British statistician and biologist Ronald Fisher in his 1936 paper The Use of multiple measurements in taxonomic problems as an example of linear discriminant analysis.

It is sometimes called Anderson's Iris data set because Edgar Anderson collected the data to quantify the morphologic variation of Iris flowers of three related species. Two of the three species were collected in the Gaspé Peninsula "all from the same pasture, and picked on the same day and measured at the same time by the same person with the same apparatus". The data set consists of 50 samples from each of three species of Iris (Iris setosa, Iris virginica, and Iris versicolor). Four features were measured from each sample: the length and the width of the sepals and petals, in centimeters. Based on the combination of these four features, Fisher developed a linear discriminant model to distinguish the species from each other.

Objective

In this notebook, I will construct Logistic Regression, Decision Tree, and Random Forest Algorithms to classify them into Iris setosa, Iris virginica, and Iris versicolor.

The following steps I have implemented -

Exploratory Data Analysis Logistic Regression Decision Tree Random Forest

OPEN FOR SUGGESTIONS

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published