How to Make Boxplot with Jittered Data Points using Altair in Python? Last Updated : 23 Jul, 2025 Comments Improve Suggest changes Like Article Like Report In this article, we will determine the way to form a boxplot with data points using Altair. The latest version of Altair supports making simple boxplots. However, the current version of Altair doesn't support adding jittered data points on top of boxplots. Because of Justin Bois from Caltech, we will use his data visualization utility package Altair-catplot, we will make boxplots with jittered data points. Altair : Altair is a statistical visualization library in Python. It is declarative in nature and is based on Vega and Vega-Lite visualization grammars. It is fast becoming the first choice of people looking for a quick and efficient way to visualize datasets. If you have used imperative visualization libraries like matplotlib, you will be able to rightly appreciate the capabilities of Altair.Catplot : Catplot is a relatively new addition to Seaborn that simplifies plotting that involves categorical variables. In Seaborn version v0. 9.0 that came out in July 2018, changed the older factor plot to catplot to make it more consistent with terminology in pandas and in seaborn.Boxplot : We can create a box plot as follows. Note that the mark is a string specifying a box plot (as will be in the future with Altair), and the encoding is specified as a dictionary of key-value pairs.Steps Required Import LibrariesImport or create dataUse altair_catplot.catplot() method with jitterbox transform.Modify the values of different attributes for better visualization (optional). Example 1: Python3 # importing packages import altair import altair_catplot import seaborn # load data tip = seaborn.load_dataset('tips') # draw a plot altair_catplot.catplot(tip, transform ='jitterbox', mark ='point', encoding = dict(x = altair.X('time:N', title = None), y = altair.Y('total_bill:Q', scale = altair.Scale(zero = False)), color = altair.Color('time:N', legend = None)) ) Output: Example 2: Python3 # importing packages import altair import altair_catplot import seaborn # load data tip = seaborn.load_dataset('tips') # draw a plot altair_catplot.catplot(tip, transform ='jitterbox', mark ='square', encoding = dict(x = altair.X('day:N', title = None), y = altair.Y('total_bill:Q', scale = altair.Scale(zero = False)), color = altair.Color('day:N', legend = None)) ) Output: Example 3: Python3 # importing packages import altair import altair_catplot import seaborn # load data iris = seaborn.load_dataset('iris') # draw a plot altair_catplot.catplot(iris, transform ='jitterbox', mark ='circle', encoding = dict(x = altair.X('species:N', title = None), y = altair.Y('sepal_length:Q', scale = altair.Scale(zero = False)), color = altair.Color('species:N', legend = None)) ) Output: Comment More infoAdvertise with us Next Article What is Data Visualization and Why is It Important? D deepanshu_rustagi Follow Improve Article Tags : Data Visualization AI-ML-DS Python-Altair AI-ML-DS With Python Python Data Visualization +1 More Similar Reads Python - Data visualization tutorial Data visualization is a crucial aspect of data analysis, helping to transform analyzed data into meaningful insights through graphical representations. This comprehensive tutorial will guide you through the fundamentals of data visualization using Python. We'll explore various libraries, including M 7 min read What is Data Visualization and Why is It Important? Data visualization uses charts, graphs and maps to present information clearly and simply. It turns complex data into visuals that are easy to understand.With large amounts of data in every industry, visualization helps spot patterns and trends quickly, leading to faster and smarter decisions.Common 4 min read Data Visualization using Matplotlib in Python Matplotlib is a widely-used Python library used for creating static, animated and interactive data visualizations. It is built on the top of NumPy and it can easily handles large datasets for creating various types of plots such as line charts, bar charts, scatter plots, etc. Visualizing Data with P 11 min read Data Visualization with Seaborn - Python Seaborn is a popular Python library for creating attractive statistical visualizations. Built on Matplotlib and integrated with Pandas, it simplifies complex plots like line charts, heatmaps and violin plots with minimal code.Creating Plots with SeabornSeaborn makes it easy to create clear and infor 9 min read Data Visualization with Pandas Pandas is a powerful open-source data analysis and manipulation library for Python. The library is particularly well-suited for handling labeled data such as tables with rows and columns. Pandas allows to create various graphs directly from your data using built-in functions. This tutorial covers Pa 6 min read Plotly for Data Visualization in Python Plotly is an open-source Python library designed to create interactive, visually appealing charts and graphs. It helps users to explore data through features like zooming, additional details and clicking for deeper insights. It handles the interactivity with JavaScript behind the scenes so that we c 12 min read Data Visualization using Plotnine and ggplot2 in Python Plotnine is a Python data visualization library built on the principles of the Grammar of Graphics, the same philosophy that powers ggplot2 in R. It allows users to create complex plots by layering components such as data, aesthetics and geometric objects.Installing Plotnine in PythonThe plotnine is 6 min read Introduction to Altair in Python Altair is a declarative statistical visualization library in Python, designed to make it easy to create clear and informative graphics with minimal code. Built on top of Vega-Lite, Altair focuses on simplicity, readability and efficiency, making it a favorite among data scientists and analysts.Why U 4 min read Python - Data visualization using Bokeh Bokeh is a data visualization library in Python that provides high-performance interactive charts and plots. Bokeh output can be obtained in various mediums like notebook, html and server. It is possible to embed bokeh plots in Django and flask apps. Bokeh provides two visualization interfaces to us 4 min read Pygal Introduction Python has become one of the most popular programming languages for data science because of its vast collection of libraries. In data science, data visualization plays a crucial role that helps us to make it easier to identify trends, patterns, and outliers in large data sets. Pygal is best suited f 5 min read Like