Seaborn: Where Data Visualization Meets Scientific Excellence!

Seaborn use case.


Seaborn is a special visualization library for making statistical graphics and charts in Python.
Seaborn was created on top of matplotlib and integrated closely with Pandas dataset structures. So you can guess tha it's visual and statistial oriented features are nicer and more advanced than matplotlib has.

Visualization with Seaborn.

Coding: The Silent Symphony. - Updated: 2024-05-19 by Andrey BRATUS, Senior Data Analyst.




Seaborn practices data driven approach and helps us explore and understand available data in a smart way. Its plotting functions operate on Pandas dataframes and NumPy arrays and internally perform the necessary semantic mapping and statistical aggregation to produce enlightening graphic plots, that help us focus on better data understanding, rather than on the details of how to draw them.


Plot creation algorithm.


The main steps for creating the plots with Seaborn are:

1 Preparing the data
2 Controlling the plot
3 Plotting
4 Customizing the plot
5 Showing and saving the plot



import matplotlib.pyplot as plt
import seaborn as sns

tips = sns.load_dataset("tips") # Step 1
sns.set_style("whitegrid") # Step 2
g = sns.lmplot(x="tip", # Step 3
y="total_bill",
data=tips,
aspect=2)
g = (g.set_axis_labels("Tip","Total bill(USD)").
set(xlim=(0,10),ylim=(0,100)))  # Step 4
plt.title("title")
plt.show(g)  # Step 5
Seaborn visualization example.


Preparing the data.



import pandas as pd
import numpy as np
uniform_data = np.random.rand(10, 12)
data = pd.DataFrame({'x':np.arange(1,101),
'y':np.random.normal(0,4,100)})

# you can also use built-in datasets
titanic = sns.load_dataset("titanic")
iris = sns.load_dataset("iris")

Controlling the plot - one subplot.



f, ax = plt.subplots(figsize=(5,6))

Controlling the plot - Seaborn styles.



sns.set() #(Re) set the seaborn default
sns.set_style("whitegrid") #Set the matplotlib parameters
sns.set_style("ticks",
{"xtick.major.size":8,
"ytick.major.size":8})
sns.axes_style("whitegrid")

Controlling the plot - Context Functions.



sns.set_context("talk") #Set context to "talk"
sns.set_context("notebook", #Set context to "notebook",
font_scale=1.5, #Scale font elements and override param mapping
rc={"lines.linewidth":2.5})

Controlling the plot - Color Palette.



sns.set_palette("husl",3) #Define the color palette
sns.color_palette("husl") #Use with with to temporarily set palette
flatui = ["#9b59b6","#3498db","#95a5a6","#e74c3c","#34495e","#2ecc71"]
sns.set_palette(flatui) #Set your own color palette

Plotting With Seaborn - Axis Grids.



g = sns.FacetGrid(titanic, #Subplot grid for plotting conditional relationships
col="survived",
row="sex")
g = g.map(plt.hist,"age")
sns.factorplot(x="pclass", #Draw a categorical plot onto a Facetgrid
y="survived",
hue="sex",
data=titanic)
sns.lmplot(x="sepal_width", #Plot data and regression model fits across a FacetGrid
y="sepal_length",
hue="species",
data=iris)

h = sns.PairGrid(iris) #Subplot grid for plotting pairwise relationships
h = h.map(plt.scatter)
sns.pairplot(iris) #Plot pairwise bivariate distributions
i = sns.JointGrid(x="x", #Grid for bivariate plot with marginal univariate plots
y="y",
data=data)
i = i.plot(sns.regplot,
sns.distplot)
sns.jointplot("sepal_length", #Plot bivariate distribution
"sepal_width",
data=iris,
kind='kde')

Plotting With Seaborn - Categorical Plots.



#Scatterplot
sns.stripplot(x="species", #Scatterplot with one categorical variable
y="petal_length",
data=iris)
sns.swarmplot(x="species", #Categorical scatterplot with non-overlapping points
y="petal_length",
data=iris)

#Bar Chart
sns.barplot(x="sex", #Show point estimates and confidence intervals with scatterplot glyphs
y="survived",
hue="class",
data=titanic)

#Count Plot
sns.countplot(x="deck", #Show count of observations
data=titanic,
palette="Greens_d")

#Point Plot
sns.pointplot(x="class", #Show point estimates and confidence intervals as rectangular bars
y="survived",
hue="sex",
data=titanic,
palette={"male":"g",
"female":"m"},
markers=["^","o"],
linestyles=["-","--"])

#Boxplot
sns.boxplot(x="alive", #Boxplot
y="age",
hue="adult_male",
data=titanic)
sns.boxplot(data=iris,orient="h") #Boxplot with wide-form data

#Violinplot
sns.violinplot(x="age", #Violin plot
y="sex",
hue="survived",
data=titanic)

Plotting With Seaborn - Regression Plots.



sns.regplot(x="sepal_width", #Plot data and a linear regression model fit
y="sepal_length",
data=iris,
ax=ax)

Plotting With Seaborn - Distribution Plots.



plot = sns.distplot(data.y, #Plot univariate distribution
kde=False,
color="b")

Plotting With Seaborn - Matrix Plots.



sns.heatmap(uniform_data,vmin=0,vmax=1) #Heatmap

Customizing the plot.



g.despine(left=True) #Remove left spine
g.set_ylabels("Survived") #Set the labels of the y-axis
g.set_xticklabels(rotation=45) #Set the tick labels for x
g.set_axis_labels("Survived", #Set the axis labels
"Sex")
h.set(xlim=(0,5), #Set the limit and ticks of the x-and y-axis
ylim=(0,5),
xticks=[0,2.5,5],
yticks=[0,2.5,5])

plt.title("A Title") #Add plot title
plt.ylabel("Survived") #Adjust the label of the y-axis
plt.xlabel("Sex") #Adjust the label of the x-axis
plt.ylim(0,100) #Adjust the limits of the y-axis
plt.xlim(0,10) #Adjust the limits of the x-axis
plt.setp(ax,yticks=[0,5]) #Adjust a plot property
plt.tight_layout() #Adjust subplot params

Showing and Saving the Plot.



plt.show() #Show the plot
plt.savefig("foo.png") #Save the plot as a figure
plt.savefig("foo.png", #Save transparent figure
transparent=True)

Close & Clear.



plt.cla() #Clear an axis
plt.clf() #Clear an entire figure
plt.close() #Close a window



See also related topics: