Matplotlib, Jupyter and updating multiple interactive plots

Posted on 26. December 2019 by eremo

For experiments in Machine Learning [ML] it is quite useful to see the development of some characteristic quantities during optimization processes for algorithms – e.g. the behaviour of the cost function during the training of Artificial Neural Networks. Beginners in Python the look for an option to continuously update plots by interactively changing or extending data from a running Python code.

Does Matplotlib offer an option for interactively updating plots? In a Jupyter notebook? Yes, it does. It is even possible to update multiple plot areas simultanously. The magic (meta) commands are “%matplotlib notebook” and “matplotlib.pyplot.ion()”.

The following code for a Jupyter cell demonstrates the basic principles. I hope it is useful for other ML- and Python beginners as me.

# Tests for dynamic plot updates
#-------------------------------
%matplotlib notebook
import numpy as np
import matplotlib.pyplot as plt
import time

x = np.linspace(0, 10*np.pi, 100)
y = np.sin(x)

# The really important command for interactive plot updating
plt.ion()

# sizing of the plots figure sizes 
fig_size = plt.rcParams["figure.figsize"]
fig_size[0] = 8
fig_size[1] = 3

# Two figures 
# -----------
fig1 = plt.figure(1)
fig2 = plt.figure(2)

# first figure with two plot-areas with axes 
# --------------------------------------------
ax1_1 = fig1.add_subplot(121)
ax1_2 = fig1.add_subplot(122)

fig1.canvas.draw()

# second figure with just one plot area with axes
# -------------------------------------------------
ax2 = fig2.add_subplot(121)
line1, = ax2.plot(x, y, 'b-')
fig2.canvas.draw()

z= 32
b = np.zeros([1])
c = np.zeros([1])
c[0] = 1000

for i in range(z):
    # update data 
    phase = np.pi / z * i 
    line1.set_ydata(np.sin(0.5 * x + phase))
    b = np.append(b, [i**2])
    c = np.append(c, [1000.0 - i**2])
    
    # re-plot area 1 of fig1  
    ax1_1.clear()
    ax1_1.set_xlim (0, 100)
    ax1_1.set_ylim (0, 1000)
    ax1_1.plot(b)
    
    # re-plot area 2 of fig1  
    ax1_2.clear()
    ax1_2.set_xlim (0, 100)
    ax1_2.set_ylim (0, 1000)
    ax1_2.plot(c)
    
    # redraw fig 1 
    fig1.canvas.draw()

    # redraw fig 2 with updated data  
    fig2.canvas.draw()
    
    time.sleep(0.1)

As you see clearly we defined two different “figures” to be plotted – fig1 and fig2. The first figure ist horizontally splitted into two plotting areas with axes “ax1_1” and “ax1_2”. Such a plotting area is created via the “fig1.add_subplot()” function and suitable parameters. The second figure contains only one plotting area “ax2”.

Then we update data for the plots within a loop witrh a timer of 0.1 secs. We clear the respective areas, redefine the axes and perform the plot for the updated data via the function “plt.figure.canvas.draw()”.

In our case we see two parabolas develop in the upper figure; the lower figure shows a sinus-wave moving slowly from the right to the left.

The following plots show screenshots of the output in a Jupyter notebook in th emiddle of the loop and at its end:

You see that we can deal with 3 plots at the same time. Try it yourself!

Hint:
There is small problem with the plot sizing when you have used the zoom-functionality of Chrome, Chromium or Firefox. You should work with interactive plots with the browser-zoom set to 100%.

Posted in Machine Learning, matplotlib, Python and Eclipse PyDev | Tagged Interactive plots, Jupyter notebook, matplotlib, multiple plots, update plots

The moons dataset and decision surface graphics in a Jupyter environment – V – a class for plots and some experiments

Posted on 29. July 2019 by eremo

We proceed with our exercises on the moons dataset. This series of articles is intended for readers which – as me – are relatively new both to Python and Machine Learning. By working with examples we try to extend our knowledge about the tools “Juypter notebooks” and “Eclipse/PyDev” for setting up experiments which require plots for an interpretation.

We have so far used a Jupyter notebook to perform some initial experiments for creating and displaying a decision surface between the moons dataset clusters with an algorithm called “LinearSVC”. If you followed everything I described in the last articles

The moons dataset and decision surface graphics in a Jupyter environment – I
The moons dataset and decision surface graphics in a Jupyter environment – II – contourplots
The moons dataset and decision surface graphics in a Jupyter environment – III – Scatter-plots and LinearSVC
The moons dataset and decision surface graphics in a Jupyter environment – IV – plotting the decision surface

you may now have gathered around 20 different cells with code. Part of the cells’ code was used to learn some basics about contour and scatter plots. This code is now irrelevant for further experiments. Time to consolidate our plotting knowledge.

In the last article I promised to put plot-related code into a Python class. The class itself can become a part of a Python module – which we in turn can import into the code of Jupyter notebook. By doing this we can reduce the number of cells in a notebook drastically. The importing of external classes is thus helpful for concentrating on “real” data analysis experiments with different learning and predicting algorithms and/or a variation of their parameters.

I assume that you have some basic knowledge on how classes are build in Python. If not please see an introductory book on Python 3.

A class for plotting simple decision surfaces in a 2-dimensional space

In the articles

Eclipse, PyDev, virtualenv and graphical output of matplotlib on KDE – I
Eclipse, PyDev, virtualenv and graphical output of matplotlib on KDE – II
Eclipse, PyDev, virtualenv and graphical output of matplotlib on KDE – III

I had shown how to set up Eclipse PyDev to be used in the context of a Python virtual environment. In our special environment “ml1” used by our Jupyter notebook “moons1.ipynb” we have the following directory structure:

“ml1” has a sub-directory “mynotebooks” which contains notebook files as our “moons1.ipynb”. To provide a place for other general code there we open up a directory “mycode“. In it we create a file “myplots.py” for a module
“myplots“, which shall comprise our own Python classes for plotting.

We distribute the code discussed in the last 2 articles of this series into methods of a class “MyDecisionPlot“; we put the following code into our file “myplots.py” with the Pydev editor.

''' Created on 15.07.2019 Module to gather classes for plotting @author: rmo ''' import numpy as np import sys from matplotlib import pyplot as plt from matplotlib.colors import ListedColormap import matplotlib.patches as mpat #from matplotlib import ticker, cm #from mpl_toolkits import mplot3d class MyDecisionPlot(): ''' This class allows for 1) decision surfaces in 2D (x1,x2) planes 2) plotting scatter data of datapoints ''' def __init__(self, X, y, predictor = None, ax_x_delta=1.0, ax_y_delta=1.0, mesh_res=0.01, alpha=0.4, bcontour=1, bscatter=1, figs_x1=12.0, figs_x2=8.0, x1_lbl='x1', x2_lbl='x2', legend_loc='upper right' ): ''' Constructor of MyDecisionPlot Input: X: Input array (2D) for learning- and predictor-algorithm as VSM y: result data for learning- and predictor-algorithm ax_x_delta, ax_y_delta : delta for extension of both axis beyond the given X, y-data mesh_res: resolution of the mesh spanned in the (x1,x2)-plane (x_max-x_min) * mesh_res alpha: transparency of contours bcontour: 0: Do not plot contour areas 1: plot contour areas bscatter: 0: Do not plot scatter points of the input data sample 1: Plot scatter plot of the input data sample figs_x1: plot size in x1 direction figs_x2: plot size in x2 direction x1_lbl, x2_lbl : axes lables legend_loc : position of a legend Ouptut: Internal: self._mesh_points (mesh points created) External: Plots - shoukd cone up automatically in Jupyter notebooks ''' # initiate some internal variables self._x1_min = 0.0 self._x1_max = 1.0 self._x2_min = 0.0 self._x2_max = 1.0 # Alternatives to resize plots # 1: just resize figure 2: resize plus create subplots() [figure + axes] self._plot_resize_alternative = 2 # X (x1,x2)-Input array self.__X = X self.__y = y self._Z = None # predictor = algorithm to create y-values for new (x1,x2)-data points self._predictor = predictor # meshdata self._resolution = mesh_res # resolution of the mesh self.__ax_x_delta = ax_x_delta self.__ax_y_delta = ax_y_delta self._alpha = alpha self._bscatter = bscatter self._bcontour = bcontour self._xm1 = None self._xm2 = None self._mesh_points = None # set marker array and define colormap self._markers = ('s', 'x', 'o', '^', 'v') self._colors = ('red', 'blue', 'lightgreen', 'gray', 'cyan') self._cmap = ListedColormap(self._colors[:len(np.unique(y))]) self._x1_lbl = x1_lbl self._x2_lbl = x2_lbl self._legend_loc = legend_loc # Plot-sizing self._figs_x1 = figs_x1 self._figs_x2 = figs_x2 self._fig = None self._ax = None # alternative 2 does resizing and (!) subplots() self.initiate_and_resize_plot(self._plot_resize_alternative) r # create mesh in x1, x2 - direction with mesh_res resolution # meshpoint-array will be creted with right dimension for plotting data self.create_mesh() # Array meshpoints should exist now if(self._bcontour == 1): try: if self._predictor == None: raise ValueError except ValueError: print ("You must provide an algorithm = 'predictor' as parameter") #sys.exit(0) sys.exit() self.make_contourplot() else: if (self._bscatter == 1): self.make_scatter_plot() # method to create a dense mesh in the (x1,x2)-plane def create_mesh(self): ''' Method to create a dense mesh in an (x1,x2) plane Input: x1, x2-data are constructed from array self.__X Output: A suitable array of meshpoints is written to self._mesh_points() ''' try: self._x1_min = self.__X[:, 0].min() self._x1_max = self.__X[:, 0].max() except ValueError: # as e: print ("cannot determine x1_min = X[:,0].min() or x1_max = X[:,0].max()") try: self._x2_min = self.__X[:, 1].min() self._x2_max = self.__X[:, 1].max() except ValueError: # as e: print ("cannot determine x2_min =X[:,1].min()) or x2_max = X[:,1].max()") self._x1_min, self._x1_max = self._x1_min - self.__ax_x_delta, self._x1_max + self.__ax_x_delta self._x2_min, self._x2_max = self._x2_min - self.__ax_x_delta, self._x2_max + self.__ax_x_delta #create mesh data (x1, x2) self._xm1, self._xm2 = np.meshgrid( np.arange(self._x1_min, self._x1_max, self._resolution), np.arange(self._x2_min, self._x2_max, self._resolution)) #print (self._xm1) # for hasattr the variable cannot be provate ! #print ("xm1 is there: ", hasattr(self,'_xm1' ) ) # ordering and transposing of the mesh-matrix # for understanding the structure and transpose requirements see linux-blog.anracom.con self._mesh_points = np.array([self._xm1.ravel(), self._xm2.ravel()]).T try: if( hasattr(self, '_mesh_points') == False ): raise ValueError except ValueError: print("The required array mesh_points has not been created!") exit # ------------- # Some helper functions to change valus on the fly if necessary def set_mesh_res(self, new_mesh_res): self._resolution = new_mesh_res def change_predictor(self, new_predictor): self._predictor = new_predictor def change_alpha(self, new_alpha): self._alpha = new_alpha def change_figs(self, new_figs_x1, new_figs_x2): self._figs_x1 = new_figs_x1 self._figs_x2 = new_figs_x2 # ------------- # method to get subplots and resize the figure # ------------- def initiate_and_resize_plot(self, alternative=2 ): # Alternative 1 to resize plots - works afte rimports to Jupyter notebooks, too if alternative == 1: self._fig_size = plt.rcParams["figure.figsize"] self._fig_size[0] = self._figs_x1 self._fig_size[1] = self._figs_x2 plt.rcParams["figure.figsize"] = self._fig_size # Not working for sizing if plain subplots() is used #plt.figure(figsize=(self._figs_x1 , self._figs_x2)) #self._fig, self._ax = plt.subplots() # instead you have to put the figsize-parameter into the subplots() function # Alternative 2 for resizing plots and using subplots() # we use this alternative as we may need the axes later for 3D plots if alternative == 2: self._fig, self._ax = plt.subplots(figsize=(self._figs_x1 , self._figs_x2)) # *********************************************** # ------------- # method to create contour plots # ------------- def make_contourplot(self): ''' Method to create a contourplot based on a dense mesh of points in an (x1,x2) plane and a predictor algorithm which allows for value calculations ''' try: if( not hasattr(self, '_mesh_points') ): raise ValueError except ValueError: print("The required array mesh_points has not been created!") exit # Predict values for all meshpoints try: self._Z = self._predictor.predict(self._mesh_points) except AttributeError: print("method predictor.predict() does not exist") #reshape self._Z = self._Z.reshape(self._xm1.shape) #print (self._Z) # make the plot plt.contourf(self._xm1, self._xm2, self._Z, alpha=self._alpha, cmap=self._cmap) # create a scatter-plot of data sample as well if (self._bscatter == 1): self.make_scatter_plot() self.make_plot_legend() # ------------- # method to create a scatter plot of the data sample # ------------- def make_scatter_plot(self): alpha2 = self._alpha + 0.4 if (alpha2 > 1.0 ): alpha2 = 1.0 for idx, yv in enumerate(np.unique(self.__y)): plt.scatter(x=self.__X[self.__y==yv, 0], y=self.__X[self.__y==yv, 1], alpha=alpha2, c=[self._cmap(idx)], marker=self._markers[idx], label=yv) if self._bscatter == 0: self._bscatter = 1 self.make_plot_legend() # ------------- # method to add a legend # ------------- def make_plot_legend(self): plt.xlim(self._x1_min, self._x1_max) plt.ylim(self._x2_min, self._x2_max) plt.xlabel(self._x1_lbl) plt.ylabel(self._x2_lbl) # we have two cases # a) for a scatter plot we have array values where the legend is taken from automatically # b) For apure contourplot we need to prepare a legend with "patches" (kind og labels) used by pyplot.legend() if (self._bscatter == 1): plt.legend(loc=self._legend_loc) else: red_patch = mpat.Patch(color='red', label='0', alpha=0.4) blue_patch = mpat.Patch(color='blue', label='1', alpha=0.4) plt.legend(handles=[red_patch, blue_patch], loc=self._legend_loc)

This certainly is no masterpiece of superior code design; so you may change it. However, the code is good enough for our present purposes.

Note that we have to import basic Python modules into the namespace of this module. This is conventionally done at the top of the file.

Note also the 2 alternatives offered for resizing a plot! Both work also for “inline” plotting in a Jupyter environment; see the text below.

Using the module “myplots” in a Jupyter notebook

In a terminal we move to our example directory “/projekte/GIT/ai/ml1” and start our virtual Python environment:

myself@mytux: /projekte/GIT/ai/ml1> source bin/activate (ml1) myself@mytux:/projekte/GIT/ai/ml1> jupyter notebook [I 11:46:14.687 NotebookApp] Writing notebook server cookie secret to /run/user/1999/jupyter/notebook_cookie_secret [I 11:46:15.942 NotebookApp] Serving notebooks from local directory: /projekte/GIT/ai/ml1 [I 11:46:15.942 NotebookApp] The Jupyter Notebook is running at: .... ....

We then open our old notebook “moons1” and save it under the name “moons2”:

We delete all old cells. Then we change the first cell of our new notebook to the following contents:

import imp %matplotlib inline from mycode import myplots from sklearn.datasets import make_moons from sklearn.pipeline import Pipeline from sklearn.preprocessing import StandardScaler from sklearn.preprocessing import PolynomialFeatures from sklearn.svm import LinearSVC from sklearn.svm import SVC

You see that we imported the “myplots” module from the “package” directory “mycode”. The Jupyter notebook will find the path to “mycode” as long as we have opened the notebook on a higher directory level. Which we did … see above.

Note the statement with a so called “magic command” for our notebook:

%matplotlib inline

There are many other “magic commands” and parameters which you can study at
Ipython magic commands

The command “%matplotlib inline” informs the notebook to show plots created by any imported modules “inline”, i.e. in the visual context of the affected cell. This specific magic directive should be issued before matplotlib/pyplot is imported into any of the external python modules which we in turn import into the cell code.

A call of plt.show() in our class’s method “make_contourplot()” is no longer necessary afterwards.

If we, however, want to resize the plots in comparison to Jupyter standard values we have to control this by parameters of our plot class. Such parameters are offered already in the interface of the class; but they can be changed by a method “change_figs(new_figs_x1, new_figs_x2)), too.

In a second cell of our new notebook we now prepare the moon data set

X, y = make_moons(noise=0.1, random_state=5)

Further cells will be used for quick individual experiments with the moons dataset. E.g.:

imp.reload(myplots) X, y = make_moons(noise=0.1, random_state=5) # contour plot of the moons data - with scatter plot / training with LinearSVC polynomial_degree = 3 max_iterations = 6000 polynomial_svm_clf = Pipeline([ ("poly_features", PolynomialFeatures(degree=polynomial_degree)), ("scaler", StandardScaler()), ("svm_clf", LinearSVC(C=18, loss="hinge", max_iter=max_iterations)) ]) #training polynomial_svm_clf.fit(X, y) #plotting MyPlt = myplots.MyDecisionPlot(X, y, polynomial_svm_clf, bcontour = 1, bscatter=1 )

The last type of cell just handles the setup and training of our specific algorithm “LinearSVC” and delegates plotting to our class.

Testing the new notebook

A test of the 3 cells in their order gives

All well! This is exactly what we hoped to achieve.

Three experiments with a varying polynomial degree

As we now have a simple and compact cell template for experiments we add three
further cells where we vary the degree of the polynomials for LinearSVC. Below we show the results for degree 6, 7 and for comparison also for a degree of 2.

On a modern computer it should take almost no time to produce the results. (We shall learn how to measure CPU-time in the next article).

We understand that we at least need a polynomial of degree 3 to separate the data reasonably. However, polynomials with an even degree (>= 4) separate the 2 data regions very differently compared to polynomials with an uneven degree (>=3) in outer areas of the (x1,x2)-plane (where no training data were placed):

Depending on the polynomial degree our Linear SVC algorithm obviously extrapolates in different ways to regions without such data. And we have no clue which of the polynomials is the better choice …

This poses a warning for the use of AI in general:

We should be extremely careful to trust predictions of any AI algorithm in parameter regions for which the algorithm must extrapolate – as long as we have no real data points available there to discriminate between multiple solutions that all work equally well in regions with given training data.

Would general modules be imported twice in a Jupyter cell – via the import of an external module, which itself includes import statements, and a direct import statement in a Jupyter cell?

The question posed in the headline is an interesting one for me as a Python beginner. Coming from other programming languages I get a bit nervous when I see the possibility for import statements referring to a specific module both in another already imported module and by a direct import statement afterwards in a Jupyter cell. E.g. we import numpy indirectly via our “myplots” module, but we could and sometimes must import it in addition directly in our Jupyter cell.

Note that we must make the general modules as numpy, matplotlib, etc. available in the namespace environment of our private module “myplots”. Otherwise the functions cannot be used there. The Jupyter cell, however, corresponds to an independent namespace environment. So, an import may indeed be required there, too, if we plan to handle numpy arrays via code in such a cell.

Reading a bit about the Python import logic on the Internet reveals that a double import or overwriting should not take place; instead an already imported piece of code only gets multiple references in the various specific namespaces of different code environments.

We can test this with the following code in a Jupyter cell:

Note that numpy is also imported by our “myplots”. However, the length of the list produced by sys.modules.keys(), which enumerates all possible module reference points (including imports) does not change.

Reloading modules in Jupyter cells

What if we in the
course of or experiments need to change the code of our imported module? Then we need to reload the module in a Jupyter cell before we run it again. In our case (Python 3!) this can be done by the command

imp.reload(myplots)

As the code of our first cell reveals, the general package “imp” must have been imported before we can use its reload-function.

Conclusion

We saw that it is easy to use our own modules with Python class code, which we created in an Eclipse/PyDev environment, in a Jupyter notebook. We just use Python’s standard import mechanism in Jupyter cells to get access to our classes. We applied this to a module with a simple class for preparing decision surface plots based on contour and scatter plot routines of matplotlib. We imported the module in a new Jupyter notebook.

Plots created by our imported class-methods were displayed correctly within the cell environment as soon as we used the magic directive “%matplotlib inline” within our notebook.

In addition we used our new notebook with its compact cell structure for some quick experiments: We set different values for the polynomial degree parameter of our LinearSVC algorithm. We saw that the results of algorithms should be interpreted with caution if the predictions refer to data points in regions of the representation or feature space which were not at all covered by the data sets for training.

The prediction results (= extrapolations) of even one and the same algorithm with different parameters may deviate strongly in such regions – and we may not have any reliable indications to decide which of the predictions and their basic parameter setting are better.

In the next article of this series

The moons dataset and decision surface graphics in a Jupyter environment – VI – Kernel-based SVC algorithms

we shall have a look at what kind of decision surface some other SVM algorithms than LinearSVC create for the moons dataset. In addition shall briefly discuss kernel based algorithms.

Posted in Machine Learning, matplotlib, SVM- Support Vector Machines | Tagged %matplotlib inline, a class for plots, decision surface, Jupyter notebook, LinearSVC, module import, Moons dataset, plots

The moons dataset and decision surface graphics in a Jupyter environment – II – contourplots

Posted on 6. July 2019 by eremo

I proceed with my present article series on the “moons dataset” as an example for classification tasks in the field of “machine learning” [ML]. My objective is to gather basic knowledge on Python related tools for performing related experiments. In my last blog article

The moons dataset and decision surface graphics in an Jupyter environment – I

In the case of the “moons dataset” we can apply and train support vector machines [SVM] algorithms for solving the classification task: The trained algorithm will predict to which of the 2 clusters a new data point probably belongs. The basic task for this kind of information reduction is to find a (curved) decision surface between the data clusters in the n-dimensional representation space of the data points during the training of the algorithm.

As the moons feature space is only 2-dimensional the decision surface would be a curved line. Of course, we would like to add this line to the 2D-plot of the moons clusters shown in the last article.

The challenge of plotting data points and decision surfaces for our moon clusters

is sufficiently simple for a Python- and AI/ML-beginner as me,

is a good opportunity to learn how to work with a Jupyter notebook,

gives us a reason to become acquainted with some basic plotting functions of matplotlib,

an access to some general functions of SciKit – and some specific ones for SVM-problems.

Much to learn from one little example. Points 2 and 3 are the objectives of this article.

Contour plots !

But what kind of plots should we be interested in? We need to separate areas of a 2-dimensional parameter space (x1,x2) for which we get different (integer) target or y-values, i.e. to distinguish between a set of distinct classes to which data points may belong – in our case either to a class “0” of the first moon like cluster and a class “1” for data points around the second cluster.

In applied mathematics there is a very similar problem: For a given function z(x1,x2) we want to visualize regions in the (x1,x2)-plane for which the z-values cover a range between 2 selected distinct z-values, so called contour areas. Such contour areas are separated by contour lines. Think of height lines in a map of a mountain region. So, there is an close relation between a contour line and a decision surface – at least in a two dimensional setup. We need contour plots!

Let us see how we start a Jupyter environment and how we produce nice 2D- and even 3D-contour-plots.

Starting a Jupyter notebook from a virtual Python environment on our Linux machine

I discussed the setup of a virtual Python environment (“virtualenv”) already in the article Eclipse, PyDev, virtualenv and graphical output of matplotlib on KDE – I of this blog. I refer to the example and the related paths there. The “virtualenv” has a name of “ml1” and is located at “/projekte/GIT/ai/ml1”.

In the named article I had also shown how to install the Jupyter package with the help of “pip3” within this environment. You can verify the Jupyter installation by having a look into the directory “/projekte/GIT/ai/ml1/bin” – you should see some files “ipython3” and “jupyter” there. I had also prepared a directory
“/projekte/GIT/ai/ml1/mynotebooks”
to save some experimental notebooks there.

How do we start a Jupyter notebook? This is simple – we just use a
terminal window and enter:

myself@mytux:/projekte/GIT/ai/ml1> source bin/activate (ml1) myself@mytux:/projekte/GIT/ai/ml1> jupyter notebook [I 16:16:27.734 NotebookApp] Writing notebook server cookie secret to /run/user/1004/jupyter/notebook_cookie_secret [I 16:16:29.040 NotebookApp] Serving notebooks from local directory: /projekte/GIT/ai/ml1 [I 16:16:29.040 NotebookApp] The Jupyter Notebook is running at: [I 16:16:29.040 NotebookApp] http://localhost:8888/?token=942e6f5e75b0d014659aea047b1811d1992ca77e4d8cc714 [I 16:16:29.040 NotebookApp] Use Control-C to stop this server and shut down all kernels (twice to skip confirmation). [C 16:16:29.054 NotebookApp] To access the notebook, open this file in a browser: file:///run/user/1004/jupyter/nbserver-19809-open.html Or copy and paste one of these URLs: http://localhost:8888/?token=942e6f5e75b0d014659aea047b1811d1992ca77e4d8cc714

We see that a local http-server is started and that a http-request is issued. In the background on my KDE desktop a new tag in my standard browser “Firefox” is opened for this request:

Note that a standard port 8888 is used; this port should not be used by other services on your machine.

On the displayed web page we can move to the “mynotebooks” directory. We open a new notebook there by clicking on the “New”-button on the right side of the browser window:

We choose Python3 as the relevant interpreter and get a new browser window:

We give the notebook a title by clicking on “File >> Save as …” before start using the provided input “cell” for coding

I name it “moons1” in the next input form and check afterward in a terminal that the file “/projekte/GIT/ai/ml1mynotebooks/moons1.ipynb” really has been created; you see this also in the address bar of the browser – see below.

Lets do some plotting within a notebook

Most of the icons regarding the notebook screen are self explanatory. The interesting and pretty nice thing about a Jupyter notebook is that the multiple lines of Python code can be filled into cells. All lines can be executed in a row by first choosing a cell via clicking on a it and then clicking on the “Run” button.

As a first exercise I want to do some plotting with “matplotlib” (which I also installed together with the numpy package in a previous article). We start by importing the required modules:

A new cell for input opens automatically (it is clever to separate cells for imports and for real code). Let us produce a most simple plot there:

No effort in comparison to what we had to do to prepare an Eclipse environment for plotting (see Eclipse, PyDev, virtualenv and graphical output of matplotlib on KDE – II). Calling plot routines simply works – no special configuration is required. Jupyter and the browser do all the work for us. We save our present 2 cells by clicking on the “Save“-icon.

How do we plot contour lines or contour areas?

Later on we need to plot a separation line in a 2-dimensional parameter space between 2 clustered sets of data. This task is very similar to plotting a contour line. As this is a common task in math we expect matplotlib to provide some functionality for us. Our ultimate goal is to wrap this plotting functionality into a function or class which also accepts an SVM based ML-method of SciKit to prepare and evaluate the basic data first. But let us proceed step by step.

Some research on the Internet shows: The keys to contour plotting with matplotlib are the functions “contour()” and “contourf()” (matplotlib.pyplot.contourf):

contour(f)([X, Y,] Z, [levels], **kwargs)

“contour()” plots lines, only, whilst “contourf()” fills the area between the lines with some color.

Both functions accept data sets in the form of X,Y-coordinates and Z-values (e.g. defined by some function Z=f(X,Y)) at the respective points.

X and Y can be provided as 1-dim arrays; Z-values, however, must be given by a 2-dim array, such that len(X) == M is the number of columns in Z and len(Y) == N is the number of rows in Z. We cover the X,Y-plane with Z-values from bottom to top (Y, lines) and from the left to the right (X, columns).

Somewhat counter-intuitively, X and Y can also be provided as 2-dim arrays – with the same dimensionality as Z.
There is a nice function “meshgrid” (of packet numpy) which allows for the creation of e.g. a mesh of two 2-dimensional X- and separately Y-matrices. See for further information (numpy.meshgrid). Both arrays then have a (N,M)-layout (shape); as the degree of information of one coordinate is basically 1-dimensional, we do expect repeated values of either coordinate in the X-/Y-meshgrid-matrices.

The function “shape” gives us an output in the form of (N lines, M columns) for a 2-dim array. Lets apply all this and create a rectangle shaped (X,Y)-plane:

The basic numpy-function “arange()” turns a range between two limiting values into an array of equally spaced values. We see that meshgrid() actually produces two 2-dim arrays of the same “shape”.

For test purposes let us use a function

Z1=-0.5* (X)**2 + 4*(Y)**2.

For this function we expect elliptical contours with the longer axis in X-direction. The “contourf()”-documentation shows that we can use the parameters “levels“, “cmap” and “alpha” to set the number of contour levels (= number of contour lines -1), a so called colormap, and the opacity of the area
coloring, respectively.

You find predefined colormaps and their names at this address: matplotlib colormaps. If you add an “_r” to the colormap-name you just reverse the color sequence.

We combine all ingredients now to create a 2D-plot (with the “plasma” colormap):

Our first reasonable contour-plot within a Jupyter notebook! We got the expected elliptic curves! Time for a coffee ….

Changing the plot size

A question that may come to your mind at this stage is: How can we change the size of the plot?

Well, this can be achieved by defining some basic parameters for plotting. You need to do this in advance of any of your specific plots. One also wants to add some labels for all axis. We, therefore, extend the code in our cell a bit by the following statements and click again on “Run”:

You see that “fig_size = plt.rcParams[“figure.figsize”]” provides you with some kind of array- or object like information on the size of plots. You can change this by assigning new values to this object. “figure” is an instance of a container class for all plot elements. “plt.xlabel” and “plt.ylabel” offer a simple option to add some text to an axis of the plot.

What about a 3D-representation …

As we are here – isn’t our function for Z1 not a good example to get a 3D-representation of our data? As 3D-plots are helpful in other contexts of ML, lets have a quick side look at this. You find some useful information at the following addresses:
PythonDataScienceHandbook and mplot3d-tutorial

I used the given information in form of the following code:

You see that we can refer to a special 3D-plot-object as the output of plt.axes(projection=’3d’). The properties of such an object can be manipulated by a variety of methods. You also see that I manipulated the number of ticks on the z-axis to 5 by using a function “set_major_locator(plt.MaxNLocator(5)“. I leave it to the reader to dive deeper into manipulation options for a plot axis.

Addendum – 07.07.2019: Adding a colorbar

A reader asked me to show how one can set ticks and add a color-bar to the plots. I give an example code below:

The result is:

For the 3D-plots we get:

Conclusion

Enough for today. We have seen that it is relatively simple to create nice contour and even 3D-plots in a Jupyter notebook environment. This new knowledge provides us with a good basis for a further approach to our objective of plotting a decision surface for the moons dataset. In the next article

The moons dataset and decision surface graphics in a Jupyter environment – III – scatter-plots and LinearSVC

we first import the moons data set into our Jupyter notebook. Then we shall create a so called “scatter plot” for all data points. Furthermore we shall train a specific SVM algorithm (LinearSVC) on the dataset.

Links

https://codeyarns.com/2014/10/27/how-to-change-size-of-matplotlib-plot/
matplotlib.pyplot.contourf
https://stackoverflow.com/questions/12608788/changing-the-tick-frequency-on-x-or-y-axis-in-matplotlib

Posted in Machine Learning, matplotlib, SVM- Support Vector Machines | Tagged contour plot, contourf, decision surface, Jupyter notebook, Machine Learning, matplotlib, Moons dataset, mplot3d

July 2025

M T W T F S S

1 2 3 4 5 6

7 8 9 10 11 12 13

14 15 16 17 18 19 20

21 22 23 24 25 26 27

28 29 30 31

« Jun

Recent Posts

Upgrade from Leap 15.3 over 15.4, 15.5 up to Leap 15.6 – problems with the named service

Keras 3/TF vs. PyTorch – small model performance tests on a Nvidia 4060 TI

Leap 15.6, Nvidia driver 570 – resume from suspend to RAM not working / workaround

Leap 15.6 – upgrade from Leap 15.5 on laptop with Optimus architecture

Opensuse Leap 15.5 – KDE Plasma on Wayland – Nvidia driver, Opera, Chromium, Libreoffice, StandBy

Categories

Allgemeines / Meinungen

Blender

Cups, Druck

Debian

Debian 8

Debian 9

Kali Linux

Erfahrungsberichte

Firewall Netfilter Iptables

Hardware, Treiber

Nvidia

KDE

Kontact – Kmail

LAMP / Webentwicklung

Apache

Eclipse für PHP

GIT

Javascript / JQuery / Ajax

MySQL

PHP

phpMyAdmin

SVN

vsftp

LDAP

LibreOffice, OpenOffice

Linux 3D Desktop

Linux-Container

Machine Learning

Autoencoder

Cluster Algorithms

CNNs

DeepDream

GPT

Keras

Mathematics

ML environment

MLP

Perceptron

SVM- Support Vector Machines

Tensorflow 2

Text analysis with ML

Text pre-processing for ML

Variational Autoencoder

Netzwerk / networking

Open Source

Open-Xchange

Open-Xchange 5

Open-Xchange 6

Opensuse

Leap 15.0

Leap 15.1

Leap 15.2

Leap 15.3

Leap 15.4

Leap 15.5

Leap 15.6

Leap 42.1

Leap 42.2

Leap 42.3

Opensuse 12.1

Opensuse 12.2

Opensuse 12.3

Opensuse 13.1

Opensuse 13.2

Postfix, Cyrus IMAP, Kmail

Python and Eclipse PyDev

matplotlib

Numpy

Security und Security-Tools

Sound

Strato-V-Server

systemd und udev

Verschlüsselung – SSH – SFTP – TLS – E-Mails

Virtualisierung, KVM, Xen

VMware Workstation

Web – Browser, HTTP, HTML, CSS

Wordpress

Blogroll

machine-learning@anracon

homepages

Homepage anracom.com

Archives

June 2025

April 2025

February 2025

September 2024

August 2024

July 2024

May 2024

April 2024

March 2024

February 2024

January 2024

November 2023

October 2023

September 2023

August 2023

July 2023

June 2023

May 2023

April 2023

March 2023

February 2023

January 2023

December 2022

November 2022

October 2022

September 2022

August 2022

June 2022

May 2022

April 2022

March 2022

February 2022

January 2022

November 2021

October 2021

September 2021

August 2021

July 2021

June 2021

May 2021

April 2021

March 2021

February 2021

January 2021

December 2020

November 2020

October 2020

September 2020

August 2020

July 2020

June 2020

May 2020

April 2020

March 2020

February 2020

January 2020

December 2019

November 2019

October 2019

September 2019

August 2019

July 2019

June 2019

May 2019

April 2019

March 2019

January 2019

December 2018

November 2018

October 2018

September 2018

August 2018

July 2018

June 2018

May 2018

April 2018

March 2018

February 2018

January 2018

December 2017

November 2017

October 2017

September 2017

August 2017

July 2017

June 2017

May 2017

April 2017

March 2017

February 2017

January 2017

December 2016

November 2016

October 2016

September 2016

August 2016

July 2016

June 2016

April 2016

March 2016

February 2016

January 2016

December 2015

November 2015

October 2015

September 2015

August 2015

July 2015

June 2015

May 2015

April 2015

February 2015

January 2015

December 2014

November 2014

October 2014

September 2014

August 2014

July 2014

May 2014

April 2014

March 2014

December 2013

November 2013

October 2013

September 2013

August 2013

July 2013

June 2013

May 2013

April 2013

March 2013

February 2013

December 2012

November 2012

October 2012

September 2012

August 2012

July 2012

June 2012

May 2012

April 2012

March 2012

February 2012

January 2012

November 2011

October 2011

August 2011

July 2011

May 2011

April 2011

March 2011

February 2011

January 2011

December 2010

October 2010

September 2010

August 2010

July 2010

June 2010

May 2010

April 2010

March 2010

February 2010

January 2010

December 2009

November 2009

October 2009

September 2009

August 2009

July 2009

June 2009

May 2009

April 2009

February 2009

January 2009

December 2008

November 2008

October 2008

September 2008

August 2008

July 2008

June 2008

May 2008

February 2008

October 2007

September 2007

July 2007

Meta

Log in

Entries feed

Comments feed

WordPress.org

Linux-Blog – Dr. Mönchmeyer / anracon

Notes about Linux, ML and some simple math …

Tag Archives: Jupyter notebook

Matplotlib, Jupyter and updating multiple interactive plots

The moons dataset and decision surface graphics in a Jupyter environment – V – a class for plots and some experiments

A class for plotting simple decision surfaces in a 2-dimensional space

Using the module “myplots” in a Jupyter notebook

Testing the new notebook

Three experiments with a varying polynomial degree

Would general modules be imported twice in a Jupyter cell – via the import of an external module, which itself includes import statements, and a direct import statement in a Jupyter cell?

Reloading modules in Jupyter cells

Conclusion

The moons dataset and decision surface graphics in a Jupyter environment – II – contourplots

Contour plots !

Starting a Jupyter notebook from a virtual Python environment on our Linux machine

Lets do some plotting within a notebook

How do we plot contour lines or contour areas?

Changing the plot size

What about a 3D-representation …

Addendum – 07.07.2019: Adding a colorbar

Conclusion

Links