Pandas Scatter Plot – DataFrame.plot.scatter()
Last Updated :
03 Apr, 2025
A Scatter plot is a type of data visualization technique that shows the relationship between two numerical variables. In Pandas, we can create a scatter plot using the DataFrame.plot.scatter() method. This method helps in visualizing how one variable correlates with another. Example:
Python
import pandas as pd
import matplotlib.pyplot as plt
data = {'Height': [150, 160, 170, 180, 190],
'Weight': [50, 65, 75, 85, 95]}
df = pd.DataFrame(data)
# Creating a scatter plot
df.plot.scatter(x='Height', y='Weight')
plt.show()
Output
Basic scatter plotExplanation: This scatter plot shows how Weight changes with Height. As height increases, weight also tends to increase, indicating a positive correlation
Syntax of DataFrame.plot.scatter()
DataFrame.plot.scatter(x, y, s=None, c=None, colormap=None, alpha=None, figsize=None, grid=False, **kwargs)
Parameters:
Parameter | Description |
---|
x (Required) | Column name to be used for x-axis values. |
---|
y (Required) | Column name to be used for y-axis values. |
---|
s (Optional) | Size of the markers (default is None). Can be a single value or an array. |
---|
c (Optional) | Color of the markers. Can be a column name, color string or an array. |
---|
colormap (Optional) | Colormap to use for coloring points. |
---|
alpha (Optional) | Transparency level of points (range: 0 to 1). |
---|
figsize (Optional) | Tuple (width, height) to define figure size. |
---|
grid (Optional) | Boolean (True or False) to display a grid. |
---|
**kwargs | Additional arguments passed to Matplotlib’s scatter() function. |
---|
Returns: It returns a Matplotlib AxesSubplot object with the scatter plot.
Examples of scatter plot
Example 1: In this example, we visualize Age distribution among individuals. The size of each point is determined by the Age and the color of all points is set to red.
Python
import pandas as pd
import matplotlib.pyplot as plt
data = {'Name': ['Dhanashri', 'Smita', 'Rutuja', 'Sunita', 'Poonam', 'Srushti'],
'Age': [20, 18, 27, 50, 12, 15]}
df = pd.DataFrame(data)
# scatter plot with size determined by age
df.plot.scatter(x='Name', y='Age', s=df['Age']*10, c='red')
plt.show()
Output
Customized scatter plotExplanation: A scatter plot where each person's name is plotted on the x-axis, and their age on the y-axis. The marker size is proportional to the age, making older individuals more prominent in the plot.
Example 2: In this example, we analyze how the population of different countries correlates with their CO₂ emissions. The size of the markers is determined by the country's population, making larger countries more prominent.
Python
import pandas as pd
import matplotlib.pyplot as plt
data = {'Country': ['USA', 'China', 'India', 'Germany', 'Brazil', 'Australia'],
'Population': [331, 1441, 1393, 83, 213, 26], # in millions
'CO2_Emissions': [5000, 12000, 2500, 800, 1300, 400]} # in megatonnes
df = pd.DataFrame(data)
# Creating scatter plot
df.plot.scatter(x='Population', y='CO2_Emissions', s=df['Population'] * 2, c='blue')
plt.xlabel("Population (in millions)")
plt.ylabel("CO₂ Emissions (megatonnes)")
plt.grid(True)
plt.show()
Output
Population vs. CO₂ EmissionsExplanation: A scatter plot showing the relationship between a country's population and its CO₂ emissions. Larger populations tend to have higher emissions, which is reflected in the marker size.
Example 3: In this example, we analyze how years of experience affect salary while using job level to size the markers. The size of each marker is determined by the Job Level (higher job levels result in larger markers).
Python
import pandas as pd
import matplotlib.pyplot as plt
data = {'Experience': [1, 3, 5, 7, 10, 12, 15],
'Salary': [40000, 60000, 80000, 110000, 140000, 180000, 220000], # in $
'Job_Level': [1, 2, 3, 4, 5, 6, 7]} # Job level (higher = senior)
df = pd.DataFrame(data)
# Creating scatter plot
df.plot.scatter(x='Experience', y='Salary', s=df['Job_Level'] * 50, c='green')
plt.xlabel("Years of Experience")
plt.ylabel("Salary ($)")
plt.grid(True)
plt.show()
Output
Experience vs. Salary GrowthExplanation: A scatter plot where salary increases as experience grows. Higher job levels are represented with larger markers, making it easy to see how senior positions impact salary.
Similar Reads
Python Tutorial | Learn Python Programming Language Python Tutorial â Python is one of the most popular programming languages. Itâs simple to use, packed with features and supported by a wide range of libraries and frameworks. Its clean syntax makes it beginner-friendly.Python is:A high-level language, used in web development, data science, automatio
10 min read
Python Interview Questions and Answers Python is the most used language in top companies such as Intel, IBM, NASA, Pixar, Netflix, Facebook, JP Morgan Chase, Spotify and many more because of its simplicity and powerful libraries. To crack their Online Assessment and Interview Rounds as a Python developer, we need to master important Pyth
15+ min read
Non-linear Components In electrical circuits, Non-linear Components are electronic devices that need an external power source to operate actively. Non-Linear Components are those that are changed with respect to the voltage and current. Elements that do not follow ohm's law are called Non-linear Components. Non-linear Co
11 min read
Python OOPs Concepts Object Oriented Programming is a fundamental concept in Python, empowering developers to build modular, maintainable, and scalable applications. By understanding the core OOP principles (classes, objects, inheritance, encapsulation, polymorphism, and abstraction), programmers can leverage the full p
11 min read
Python Projects - Beginner to Advanced Python is one of the most popular programming languages due to its simplicity, versatility, and supportive community. Whether youâre a beginner eager to learn the basics or an experienced programmer looking to challenge your skills, there are countless Python projects to help you grow.Hereâs a list
10 min read
Python Exercise with Practice Questions and Solutions Python Exercise for Beginner: Practice makes perfect in everything, and this is especially true when learning Python. If you're a beginner, regularly practicing Python exercises will build your confidence and sharpen your skills. To help you improve, try these Python exercises with solutions to test
9 min read
Python Programs Practice with Python program examples is always a good choice to scale up your logical understanding and programming skills and this article will provide you with the best sets of Python code examples.The below Python section contains a wide collection of Python programming examples. These Python co
11 min read
Spring Boot Tutorial Spring Boot is a Java framework that makes it easier to create and run Java applications. It simplifies the configuration and setup process, allowing developers to focus more on writing code for their applications. This Spring Boot Tutorial is a comprehensive guide that covers both basic and advance
10 min read
Class Diagram | Unified Modeling Language (UML) A UML class diagram is a visual tool that represents the structure of a system by showing its classes, attributes, methods, and the relationships between them. It helps everyone involved in a projectâlike developers and designersâunderstand how the system is organized and how its components interact
12 min read
Enumerate() in Python enumerate() function adds a counter to each item in a list or other iterable. It turns the iterable into something we can loop through, where each item comes with its number (starting from 0 by default). We can also turn it into a list of (number, item) pairs using list().Let's look at a simple exam
3 min read