svd_mpsic/jupyter.ipynb

584 lines
17 KiB
Plaintext
Raw Normal View History

{
"cells": [
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
2022-05-09 14:30:12 +02:00
"import IPython\n",
"import numpy as np\n",
2022-05-09 14:30:12 +02:00
"import matplotlib.pyplot as plt\n",
"from skimage import data\n",
"from skimage.color import rgb2gray\n",
"from skimage import img_as_ubyte,img_as_float\n",
"from numpy.linalg import svd\n",
"from PIL import Image\n",
"# change the cell width\n",
"from IPython.core.display import display, HTML\n",
"display(HTML(\"<style>.container { width:80% !important; }</style>\"))\n"
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
"execution_count": null,
"outputs": [],
"source": [
"#This line is required to display visualizations in the browser\n",
"%matplotlib inline"
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
"source": [
2022-05-09 14:30:12 +02:00
"### SVD and Image compression"
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
"source": [
"Now we will explore how to apply Singular Value Decomposition of a matrix to the problem of image compression. SVD decomposes a rectangular matrix $M$ to a three parts.\n",
"$M=U\\Sigma V^T$ -\n",
"- $U$ - matrix of left singular vectors in the columns\n",
"- $\\Sigma$ - diagonal matrix with singular values\n",
"- $V$ - matrix of right singular vectors in the columns\n",
"\n",
"SVD in effect involves reconstructing the original matrix as a linear combination of several rank one matrices. A rank one matrix can be expressed as a outer product of two column vectors. \n",
"\n",
"$M=\\sigma_1u_1v_1^T+\\sigma_2u_2v_2^T+\\sigma_3u_3v_3^T+\\sigma_3u_3v_3^T+....$ . \n",
"A matrix of rank r will have r terms of these.\n",
"\n",
"Here $\\sigma_1,\\sigma_2,\\sigma_3 ...$ are singular values. $u_1,u_2,u_3 ...$ and $v_1,v_2,v_3 ...$ are left and right singular vectors respectively.\n",
"\n",
"Image compression using SVD involves taking advantage of the fact that very few of the singular values are large. Although images from the real world are of full rank, they have low effective rank which means that only few of the singular values of the SVD of images will be large."
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
"source": [
"### skimage image processing library\n",
"\n",
"We will use skimage image processing library (from sci-kit family of packages) for working with images in python. skimage has a module called data which makes available a set of images for exploration. We will load some images and convert them into a gray scale format. These images are stored in a python dict object gray_images."
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
2022-05-09 14:30:12 +02:00
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"gray_images = {\n",
" \"cat\":rgb2gray(img_as_float(data.chelsea())),\n",
" \"astro\":rgb2gray(img_as_float(data.astronaut())),\n",
" \"camera\":data.camera(),\n",
" \"coin\": data.coins(),\n",
" \"clock\":data.clock(),\n",
" \"blobs\":data.binary_blobs(),\n",
" \"coffee\":rgb2gray(img_as_float(data.coffee()))\n",
"}"
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
2022-05-09 14:30:12 +02:00
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
"source": [
"### svd in python\n",
"We will use ```numpy.linalg``` library's ```svd``` function to compute svd of a matrix in python. The svd function returns U,s,V .\n",
" - U has left singular vectors in the columns\n",
" - s is rank 1 numpy array with singular values\n",
" - V has right singular vectors in the rows -equivalent to $V^T$ in traditional linear algebra literature\n",
" \n",
"The reconstructed approximation of the original matrix is done using a subset of singular vectors as below in the ```compress_svd``` function . We use numpy array slicing to select k singular vectors and values. Instead of storing $m\\times n$ values for the original image, we can now store $k(m+n)+k$ values\n",
"\n",
" reconst_matrix = np.dot(U[:,:k],np.dot(np.diag(s[:k]),V[:k,:]))\n",
" "
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
2022-05-09 14:30:12 +02:00
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"def compress_svd(image,k):\n",
" \"\"\"\n",
" Perform svd decomposition and truncated (using k singular values/vectors) reconstruction\n",
" returns\n",
" --------\n",
" reconstructed matrix reconst_matrix, array of singular values s\n",
" \"\"\"\n",
" U,s,V = svd(image,full_matrices=False)\n",
" reconst_matrix = np.dot(U[:,:k],np.dot(np.diag(s[:k]),V[:k,:]))\n",
2022-05-09 14:18:40 +02:00
"\n",
" return reconst_matrix,s"
2022-05-09 14:30:12 +02:00
],
"metadata": {
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
}
},
{
"cell_type": "markdown",
2022-05-09 14:30:12 +02:00
"source": [
"### Compress gray scale images\n",
"The function ```compress_show_gray_images``` below takes in the image name (img_name) and number of singular values/vectors(k) to be used in the compressed reconstruction. It also plots the singular values and the image."
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"def compress_show_gray_images(img_name,k):\n",
" \"\"\"\n",
" compresses gray scale images and display the reconstructed image.\n",
" Also displays a plot of singular values\n",
" \"\"\"\n",
" image=gray_images[img_name]\n",
" original_shape = image.shape\n",
" reconst_img,s = compress_svd(image,k)\n",
" fig,axes = plt.subplots(1,2,figsize=(8,5))\n",
" axes[0].plot(s)\n",
" compression_ratio =100.0* (k*(original_shape[0] + original_shape[1])+k)/(original_shape[0]*original_shape[1])\n",
" axes[1].set_title(\"compression ratio={:.2f}\".format(compression_ratio)+\"%\")\n",
" axes[1].imshow(reconst_img,cmap='gray')\n",
" axes[1].axis('off')\n",
2022-05-09 14:30:12 +02:00
" fig.tight_layout()"
],
"metadata": {
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
}
},
{
"cell_type": "markdown",
2022-05-09 14:30:12 +02:00
"source": [
"Use the below interactive widget to explore how the quality of the reconstructed image varies with $k$"
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"def compute_k_max(img_name):\n",
" \"\"\"\n",
" utility function for calculating max value of the slider range\n",
" \"\"\"\n",
" img = gray_images[img_name]\n",
" m,n = img.shape\n",
" return m*n/(m+n+1)\n",
"\n",
"#set up the widgets\n",
"import ipywidgets as widgets\n",
"\n",
"list_widget = widgets.Dropdown(options=list(gray_images.keys()))\n",
"int_slider_widget = widgets.IntSlider(min=1,max=compute_k_max('cat'))\n",
"def update_k_max(*args):\n",
" img_name=list_widget.value\n",
" int_slider_widget.max = compute_k_max(img_name)\n",
2022-05-09 14:30:12 +02:00
"list_widget.observe(update_k_max,'value')\n"
],
2022-05-09 14:18:40 +02:00
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
2022-05-09 14:18:40 +02:00
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
"execution_count": null,
2022-05-09 14:18:40 +02:00
"outputs": [],
"source": [
"# Print matrices\n",
"def print_matrices(img_name, k):\n",
" image=gray_images[img_name]\n",
" original_shape = image.shape\n",
" print(f\"Input image dimensions. Width:{original_shape[1]} Height:{original_shape[0]}\")\n",
"\n",
" U,s,V = svd(image,full_matrices=False)\n",
" print(f\"Shape of U matrix: {U[:,:k].shape}\")\n",
" print(f\"U MATRIX: {U[:,:k]}\")\n",
" print('*' * 100)\n",
" print(f\"Shape of S matrix: {s[:k].shape}\")\n",
" print(f\"S MATRIX: {np.diag(s[:k])}\")\n",
" print('*' * 100)\n",
" print(f\"Shape of V matrix: {V[:k,:].shape}\")\n",
" print(f\"V MATRIX: {V[:k,:]}\")\n"
2022-05-09 14:30:12 +02:00
],
"metadata": {
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
}
2022-05-09 14:18:40 +02:00
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"interact(print_matrices, img_name=list_widget, k=int_slider_widget)"
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
2022-05-09 14:18:40 +02:00
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"interact(compress_show_gray_images,img_name=list_widget,k=int_slider_widget);"
],
2022-05-09 14:18:40 +02:00
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
2022-05-09 14:18:40 +02:00
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
2022-05-09 14:30:12 +02:00
"source": [
"### Load color images"
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"color_images = {\n",
" \"cat\":img_as_float(data.chelsea()),\n",
" \"astro\":img_as_float(data.astronaut()),\n",
2022-05-09 14:30:12 +02:00
" \"coffee\":img_as_float(data.coffee()),\n",
" \"koala\": img_as_float(Image.open('koala.jpeg')),\n",
" \"orange\": img_as_float(Image.open('orange.jpeg'))\n",
"}"
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
2022-05-09 14:30:12 +02:00
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
"source": [
"### Compress color images\n",
"\n",
"Color images are represented in python as 3 dimensional numpy arrays - the third dimension to represent the color values (red,green blue). However, svd method is applicable to two dimensional matrices. So we have to find a way to convert the 3 dimensional array to 2 dimensional arrays, apply svd and reconstruct it back as a 3 dimensional array . There are two ways to do it. We will show both these methods below .\n",
" - reshape method\n",
" - Layer method\n"
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
"source": [
"#### Reshape method to compress a color image\n",
"This method involves flattening the third dimension of the image array into the second dimension using numpy's reshape method .\n",
" \n",
" ``` image_reshaped = image.reshape((original_shape[0],original_shape[1]*3))```\n",
"\n",
"The svd decomposition is applied on the resulting reshaped array and reconstructed with the desired number of singular values/vectors. The image array is reshaped back to the three dimensions by another call to reshape method.\n",
" \n",
" ```image_reconst = image_reconst.reshape(original_shape)```\n",
"\n"
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
2022-05-09 14:30:12 +02:00
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
"execution_count": null,
"outputs": [],
"source": [
"def compress_show_color_images_reshape(img_name,k):\n",
" \"\"\"\n",
" compress and display the reconstructed color image using the reshape method \n",
" \"\"\"\n",
" image = color_images[img_name]\n",
" original_shape = image.shape\n",
" image_reshaped = image.reshape((original_shape[0],original_shape[1]*3))\n",
" image_reconst,_ = compress_svd(image_reshaped,k)\n",
" image_reconst = image_reconst.reshape(original_shape)\n",
" compression_ratio =100.0* (k*(original_shape[0] + 3*original_shape[1])+k)/(original_shape[0]*original_shape[1]*original_shape[2])\n",
" plt.title(\"compression ratio={:.2f}\".format(compression_ratio)+\"%\")\n",
" plt.imshow(image_reconst)"
2022-05-09 14:30:12 +02:00
],
"metadata": {
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
}
},
{
"cell_type": "markdown",
2022-05-09 14:30:12 +02:00
"source": [
"Here is the interactive widget to explore image compression of color images using the reshape method. By dragging the slider to vary $k$, observe how image quality varies. Also, we can explore different images by selecting through the drop down widget."
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"def compute_k_max_color_images(img_name):\n",
" image = color_images[img_name]\n",
" original_shape = image.shape\n",
" return (original_shape[0]*original_shape[1]*original_shape[2])//(original_shape[0] + 3*original_shape[1] + 1)\n",
"\n",
"\n",
"list_widget = widgets.Dropdown(options=list(color_images.keys()))\n",
"int_slider_widget = widgets.IntSlider(min=1,max=compute_k_max_color_images('cat'))\n",
"def update_k_max_color(*args):\n",
" img_name=list_widget.value\n",
" int_slider_widget.max = compute_k_max_color_images(img_name)\n",
"list_widget.observe(update_k_max_color,'value')"
2022-05-09 14:30:12 +02:00
],
"metadata": {
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"interact(print_matrices, img_name=list_widget, k=int_slider_widget)"
],
2022-05-09 14:18:40 +02:00
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
2022-05-09 14:18:40 +02:00
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
2022-05-09 14:18:40 +02:00
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"interact(compress_show_color_images_reshape,img_name=list_widget,k=int_slider_widget);"
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "markdown",
"source": [
"### Layers method to compress color images\n",
"In the function ```compress_show_color_images_layer```, we treat a color image as a stack of 3 seperate two dimensional images (Red,blue and green layers) . We apply the truncated svd reconstruction on each two dimensional layer seperately.\n",
"\n",
"```image_reconst_layers = [compress_svd(image[:,:,i],k)[0] for i in range(3)]```\n",
"\n",
"And we put back the reconstructed layers together.\n",
"\n",
"```\n",
"image_reconst = np.zeros(image.shape)\n",
"for i in range(3):\n",
" image_reconst[:,:,i] = image_reconst_layers[i]\n",
"```\n",
"\n",
"\n",
" "
2022-05-09 14:30:12 +02:00
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
2022-05-09 14:30:12 +02:00
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
"execution_count": null,
"outputs": [],
"source": [
"def compress_show_color_images_layer(img_name,k):\n",
" \"\"\"\n",
" compress and display the reconstructed color image using the layer method \n",
" \"\"\"\n",
" image = color_images[img_name]\n",
" original_shape = image.shape\n",
" image_reconst_layers = [compress_svd(image[:,:,i],k)[0] for i in range(3)]\n",
2022-05-09 14:18:40 +02:00
" print(image_reconst_layers)\n",
" image_reconst = np.zeros(image.shape)\n",
" for i in range(3):\n",
" image_reconst[:,:,i] = image_reconst_layers[i]\n",
" \n",
" compression_ratio =100.0*3* (k*(original_shape[0] + original_shape[1])+k)/(original_shape[0]*original_shape[1]*original_shape[2])\n",
" plt.title(\"compression ratio={:.2f}\".format(compression_ratio)+\"%\")\n",
2022-05-09 14:18:40 +02:00
" plt.imshow(image_reconst, vmin=0, vmax=255)\n"
2022-05-09 14:30:12 +02:00
],
"metadata": {
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
}
},
{
"cell_type": "markdown",
2022-05-09 14:30:12 +02:00
"source": [
"Here is the widget to explore layers method of compressing color images."
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%% md\n"
}
2022-05-09 14:30:12 +02:00
}
},
{
"cell_type": "code",
2022-05-09 14:30:12 +02:00
"execution_count": null,
"outputs": [],
"source": [
"def compute_k_max_color_images_layers(img_name):\n",
" image = color_images[img_name]\n",
" original_shape = image.shape\n",
" return (original_shape[0]*original_shape[1]*original_shape[2])// (3*(original_shape[0] + original_shape[1] + 1))\n",
"\n",
"\n",
"list_widget = widgets.Dropdown(options=list(color_images.keys()))\n",
"int_slider_widget = widgets.IntSlider(min=1,max=compute_k_max_color_images_layers('cat'))\n",
"def update_k_max_color_layers(*args):\n",
" img_name=list_widget.value\n",
" int_slider_widget.max = compute_k_max_color_images_layers(img_name)\n",
"list_widget.observe(update_k_max_color_layers,'value')"
],
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
2022-05-09 12:39:29 +02:00
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
2022-05-09 12:39:29 +02:00
},
{
"cell_type": "code",
"execution_count": null,
2022-05-09 14:30:12 +02:00
"outputs": [],
"source": [
"interact(compress_show_color_images_layer,img_name=list_widget,k=int_slider_widget);"
],
2022-05-09 12:39:29 +02:00
"metadata": {
2022-05-09 14:30:12 +02:00
"collapsed": false,
"pycharm": {
"name": "#%%\n"
}
2022-05-09 14:30:12 +02:00
}
}
],
"metadata": {
"kernelspec": {
"display_name": "Python 3 (ipykernel)",
"language": "python",
"name": "python3"
},
"language_info": {
"codemirror_mode": {
"name": "ipython",
"version": 3
},
"file_extension": ".py",
"mimetype": "text/x-python",
"name": "python",
"nbconvert_exporter": "python",
"pygments_lexer": "ipython3",
2022-05-09 14:30:12 +02:00
"version": "3.8.9"
}
},
"nbformat": 4,
"nbformat_minor": 1
}