We perform eda on a data containing data science books on amazon names prices ratings weight etc. This was more of a for fun project that answers one question. What are the best books for data scientists on Amazon?. As someone who prefers books to courses (yes, it's odd, i know), when i came across this dataset on kaggle i knew i had to try it out.
Ok i may have answered more than one question, i couldn't resist.- How much do books cost (effect of size on price).
- WHat are the best books for python.
- What are the books ML books.
-
From the scatter plot we can see books with more pages have higher prices but to better visualize it we can look at the correlation matrix.
-
There is a moderately positive correlation between number of pages and book prices (0.43) i.e (The more pages a book has the higher the chances it costs more)
To asses this we use the avg rating and number of reviews.
-
The best book for learning python on Amazon is Python Crash Course, 2nd Edition: A Hands-On, Project-Based Introduction to Programming with a 4.7 star rating over 7,000+ reviews.
-
A honorable mention (This is my personal opinion) would be Python for Data Analysis: Data Wrangling with Pandas, NumPy, and IPython. It's great for data analysts and is the 8th best book with a 4.6-star rating and over 1,000+ reviews.
-
The best book for ML is Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow, If you learn with books you have probably heard of this one. It has a 4.8 star rating with 3,000+ reviews.
- The most expensive data science related book listed on Amazon is he Basics of Data Literacy: Helping Your Students (And You!) Make Sense of Data - PB343X at $287. It was published by National Science Teachers Association - NSTA Press (November 15, 2013), it has a 4.6 star rating but just 6 reviews so i can't take the rating too serious.