Skip to main content

Python Advanced- scatter plot with MatplotLib

Have you tried drawing your first pyplot? If not then please see my previous post about this.
Let's move ahead and learn about the matplotlib scatter plot

Before plotting a plot we need data to plot. So here we are taking an example of cars data in csv format which you can download here cars data 
If you face any issue while downloading the file, comment me your email so that I can share the same directly.

Since the cars data is in csv format we need to first read the data with the help of pandas library as below-

The data will be looked look as below-

Since data is too long we need solid information from the data to plot so run Correlation & then run Covariance to find the most correlated variable to MPG-



Final cleaning of the cars data-

See now we have info to plot then let's plot it as given below-

Looks very cool right. Let's plot another one -

Key Note-Boxplots are great when you have a numeric column that you want to compare across different categories. When you want to visualize two numeric columns, scatter plots are ideal.

Please try above plotting in your notebook and comment if you face any issue.

Comments

Popular posts from this blog

Machine Learning-Cross Validation & ROC curve

Another post starts with you beautiful people! Hope you enjoyed my previous post about improving your model performance by  confusion metrix . Today we will continue our performance improvement journey and will learn about Cross Validation (k-fold cross validation) & ROC in Machine Learning. A common practice in data science competitions is to iterate over various models to find a better performing model. However, it becomes difficult to distinguish whether this improvement in score is coming because we are capturing the relationship better or we are just over-fitting the data. To find the right answer of this question, we use cross validation technique. This method helps us to achieve more generalized relationships. What is Cross Validation? Cross Validation is a technique which involves reserving a particular sample of a data set on which we do not train the model. Later, we test the model on this sample before finalizing the model. Here are the steps involved in...

Relational Database in Python-CRUD operations

Another post starts with you beautiful people! Hope you have enjoyed and learn from my previous post about  Relational Database In Python-sqlite Previously, we used the Table object to reflect a table from an existing database , but what if you wanted to create a new table? We'd still use the Table object; however, we'd need to replace the autoload and autoload_with parameters with Column objects . The Column object takes a name, a SQLAlchemy type with an optional format, and optional keyword arguments for different constraints. After defining the table, we can create the table in the database by using the .create_all() method on metadata and supplying the engine as the only parameter. You can find more details about metadata here-  tell me more about metadata! Output:- Well done! When creating a table, it's important to carefully think about what data types each column should be. Constraints and Data Defaults- We're now going to practice creating a t...

My solution to HackerEarth's Identify the dance form challenge

Another post starts with you beautiful people! Today an interesting deep learning challenge is finished in  HackerEarth  and I got 91.17026 mAP score in the leader board. One drawback I see in HackerEarth is due to small dataset many participants manually prepare the submission files and show 100% score in the leader board. Many aspiring data scientists see this and become nervous. Even with getting score 75+, they become demotivated and leave their experiments in between the challenge. Also the winning approach is not disclosed after the challenge. With this post I will try to motivate my all aspiring data scientists and I will share my solution so that in their next challenge they can easily get 85+ score or even 92+ score :) Problem statement An event management company organized an evening of Indian classical dance performances to celebrate the rich, eloquent, and elegant art of dance. After the event, the company plans to create a micro site to promote and raise aw...