Feature Extraction Methods for Images

Feature Extraction Methods for Images#

The basic idea of feature extraction methods applied to images is to convert raw image data into a numerical representation, typically an array or a set of numerical descriptors. This numerical representation captures important characteristics or features of the image, making it amenable to analysis by machine learning algorithms and statistical techniques.

Feature extraction serves as a bridge between the raw visual content of an image and the mathematical models used in computer vision tasks. By extracting relevant features from images, we can abstract away unnecessary details while retaining essential information that is crucial for the task at hand.

There exist numerous feature extraction methods tailored to different types of image data and tasks. Among them, some standard techniques are commonly employed in image processing pipelines. These methods often include:

Pixel Intensity: A straightforward approach involves using the pixel intensity values directly as features. In grayscale images, each pixel’s intensity represents a feature, while in color images, features may include intensity values from different color channels (e.g., Red, Green, Blue). Histogram-based Features: Histograms provide a compact representation of the distribution of pixel intensities in an image. Histogram-based features capture statistical information about the image’s brightness, contrast, and color distribution. Common histogram features include mean, variance, skewness, and kurtosis.
Texture Features: Texture refers to the spatial arrangement of pixels in an image and their variations in intensity or color. Texture features describe patterns, structures, and regularities present in different regions of the image. Techniques such as co-occurrence matrices, local binary patterns (LBP), and Gabor filters are commonly used for texture feature extraction.
Edge and Contour Features: Edges represent abrupt changes in pixel intensity and often correspond to object boundaries or significant image structures. Edge detection algorithms extract features based on the presence, orientation, and strength of edges in the image. Contour-based features describe the shape, curvature, and spatial arrangement of object boundaries.
Transform-based Features: Transform-based methods, such as Fourier transform, wavelet transform, and principal component analysis (PCA), extract features by analyzing the frequency, spatial, or spectral characteristics of the image. These methods help capture global and local image patterns efficiently.

Requirements#

Here we gather the required libraries, classes and function for this notebook.

import polars as pl
import numpy as np
from PIL import Image
import os
import sys
import matplotlib.pyplot as plt
import seaborn as sns
sns.set_style('whitegrid')
import tensorflow as tf
from keras.applications.vgg16 import preprocess_input
from keras.applications.vgg16 import VGG16 
from keras.models import Model

PyImageML is a Python package that has been developed under this project, which has several utils for plotting images and extracting features from them, features that later could be used along with Machine Learning algorithms to solve typical ML tasks.

sys.path.insert(0, r"C:\Users\fscielzo\Documents\Packages\PyImageML_Package_Private")
from PyImageML.preprocessing import ImageFeaturesExtraction, ImageTensorFeaturesExtraction

Reading the data#

# Extracting the names of the images files as well as their class/category.
files_list_name = r'C:\Users\fscielzo\Documents\DataScience-GitHub\Image Analysis\Image-Classification\Fire-Detection\files_list.txt'
files_df = pl.read_csv(files_list_name, separator='\t', has_header=False, new_columns=['path', 'class'])
img_files_names = [files_df['path'][i].split('/')[1] for i in range(len(files_df))]

# building a list with the current paths of the data-set images.
img_path_list = []
folder_path = r'C:\Users\fscielzo\Documents\DataScience-GitHub\Image Analysis\Image-Classification\Fire-Detection\Data'
for filename in img_files_names:
    img_path_list.append(os.path.join(folder_path, filename))

Defining Response and Predictors#

In this section we define the response and predictors.

Predictors: a list with the paths of the images files.
Response: a vector (1D array) that identify the category of each image.

Y = files_df['class'].to_numpy()
X = img_path_list 

Pixel Method#

Given and image $\mathcal{I}$ of height $h$ and wight $w$, we have a different pixels representation of the image depending on whether it is color-scale or gray-scale.

In the Pixels method for extracting features on images the following points are crucial:

$\mathcal{I}$ is defined by its pixels.

$\mathcal{I}$ has $h\cdot w$ pixels.

$\mathcal{I}$ can be represented as an 2D array (matrix) of pixels, where each element/position is a pixel.

\[\begin{split}M(\mathcal{I}) = \begin{pmatrix} p_{1,1} & p_{1,2} &\dots & p_{1,w} \\ p_{2,1} & p_{2,2} & \dots & p_{2,w} \\ \dots & \dots & \dots \\ p_{h,1} & p_{h,2} & \dots & p_{h,w} \end{pmatrix} = \left( p_{ij}\right)_{\substack{i=1,..,h \\ j=1,..,w}}\end{split}\]

Where $p_{ij}$ is the pixel of $\mathcal{I}$ that occupies position $(i,j)$ in the image, namely, the pixel $(i,j)$.

If $\mathcal{I}$ is gray-scale:

\[\begin{split}p_{ij} \in [0,255] \subset \mathbb{Z}\\[0.5cm]\end{split}\]
- The closer $p_{ij}$ to $0$, the lower the gray intensity of the pixel.
- The closer $p_{ij}$ to $255$, the greater the gray intensity of the pixel.

If $\mathcal{I}$ is color-scale:

\[p_{ij} = (R_{ij}, G_{ij}, B_{ij})\]
- $R_{ij}, G_{ij}, B_{ij} \in [0,255] \subset \mathbb{Z}$ are the red, green and blue channels, respectively, that is, the red, green and blue part of the pixel $p_{ij}$.
- The closer $R_{ij}, G_{ij}, B_{ij}$ to $0$, the lower the red, green, blue intensity of the pixel, respectively.
- The closer $R_{ij}, G_{ij}, B_{ij}$ to $255$, the greater the red,green, blue intensity of the pixel, respectively.

Extracting pixel features from gray-scale image#

When $\mathcal{I}$ is gray-scale, there are no differences with the previous approach.

Python gives us $M(\mathcal{I})$ directly as a 2D numpy array.

X = img_path_list 
example_img = Image.open(X[2])
example_img_gray = example_img.convert('L')
example_img_gray

_images/9ed0a4c765d5a7e7503ecda43e2c1c8952cd22f04177d86fabf6b601c7e99568.png

Building the pixels matrix for a gray-scale image: $M(\mathcal{I})$

$M(\mathcal{I}) = $ np.array(example_img_gray)

img_gray_array = np.array(example_img_gray)
img_gray_array

array([[ 48,  87, 112, ..., 251, 252, 252],
       [ 33,  71,  53, ..., 252, 252, 252],
       [ 94, 105, 140, ..., 251, 252, 253],
       ...,
       [107, 102,  98, ..., 109, 108, 106],
       [102, 106, 109, ..., 119, 112, 109],
       [108, 107, 105, ..., 135, 122, 117]], dtype=uint8)

$p_{12} =$ img_gray_array[0,1]

img_gray_array[0,1]

$p_{40, 12} =$ img_gray_array[39,11]

img_gray_array[39,11]

Plotting the array as an image

plt.imshow(img_gray_array, cmap='gray')
plt.axis('off')  # Hide the axis
plt.show()

_images/5af423cb23969bb6a30a6eca85e4f0028f5c1ed99291798a9943b4ff50f8baa8.png

Extracting pixel features from color image#

When $\mathcal{I}$ is color-scale, there are slightly differences with the previous approach.

Python doesnt give us $M(\mathcal{I})$ directly, but we can access to their elements by mean of a 3D array.

example_img

_images/8959eb06acdb9ebf1131b4fb45bbc61e11136c60d3af9a42432c45283fc86c05.png

That 3D array is obtained with np.array(example_img)

img_color_array = np.array(example_img)
img_color_array

array([[[ 17,  73,   0],
        [ 58, 111,  39],
        [ 84, 136,  64],
        ...,
        [253, 250, 255],
        [254, 251, 255],
        [254, 250, 255]],

       [[  0,  57,   0],
        [ 40,  96,  25],
        [ 25,  76,   7],
        ...,
        [252, 251, 255],
        [252, 251, 255],
        [253, 251, 255]],

       [[ 60, 120,  48],
        [ 73, 130,  59],
        [112, 163,  94],
        ...,
        [251, 251, 251],
        [252, 251, 255],
        [253, 252, 255]],

       ...,

       [[ 93, 112, 116],
        [ 88, 107, 111],
        [ 84, 103, 107],
        ...,
        [101, 112, 116],
        [100, 111, 115],
        [ 98, 109, 113]],

       [[ 88, 107, 111],
        [ 92, 111, 115],
        [ 95, 114, 118],
        ...,
        [111, 122, 126],
        [104, 115, 119],
        [101, 112, 116]],

       [[ 94, 113, 117],
        [ 93, 112, 116],
        [ 91, 110, 114],
        ...,
        [127, 138, 142],
        [114, 125, 129],
        [109, 120, 124]]], dtype=uint8)

The 3D img_color_array has a size of $(h, w, 3)$, where $h$ and $w$ are respectively the height and width of the image.

img_color_array.shape

(182, 277, 3)

img_color_array is made up by $h$ 2D arrays of size $w\times 3$.

The $i$-th 2D array of img_color_array, namely img_color_array[i,:], contains in its rows the 1D array that represents pixel $(i,j)$, for $j=1,\dots,w$.

In other words, let $A(\mathcal{I})$ be the 3D array img_color_array, it contains $h$ 2D arrays $A(\mathcal{I})[i,:]$, that contains the pixels $(p_{ij} : j=1,\dots,w)$.

\[\begin{split}A(\mathcal{I})[i,:] = \begin{pmatrix} p_{i1} \\ p_{i2} \\ \dots \\ p_{iw} \end{pmatrix} = \begin{pmatrix} R_{i1} & G_{i1} & B_{i1} \\ R_{i2}& G_{i2}& B_{i2} \\ \dots & \dots & \dots \\ R_{iw}& G_{iw}& B_{iw} \end{pmatrix} \end{split}\]

Therefore:

\[A(\mathcal{I})[i,:][j,:] = p_{ij} = (R_{ij}, G_{ij}, B_{ij})\]

The flattened representation of $A(\mathcal{I})[i,:]$ is:

\[\mathcal{F}(A(\mathcal{I})[i,:]) = (p_{i1}, \dots , p_{iw}) = (R_{i1} , G_{i1} , B_{i1}, \dots, R_{iw}, G_{iw}, B_{iw})\]

So, we can construct the $M(\mathcal{I})$ matrix as:

\[\begin{split} M(\mathcal{I}) = \Bigl(\mathcal{F}(A(\mathcal{I})[i,:])\Bigr)_{\substack{i=1,..,h}} = = \begin{pmatrix} p_{11} & \dots & p_{1w} \\ p_{21} & \dots& p_{2w} \\ \dots &\dots & \dots\\ p_{h1} & \dots& p_{hw} \end{pmatrix} = \begin{pmatrix} R_{11} & G_{11} & B_{11}& \dots & R_{1w}& G_{1w}& B_{1w} \\ R_{21} & G_{21} & B_{21}& \dots& R_{2w}& G_{2w}& B_{2w} \\ \dots &\dots & \dots& \dots & \dots & \dots& \dots\\ R_{h1} & G_{h1} & B_{h1}& \dots& R_{hw}& G_{hw}& B_{hw} \end{pmatrix} \end{split}\]

For instance: $\hspace{0.1cm} p_{13}=$example_img_color_array[0,:][2,:] = example_img_color_array[0,2]

img_color_array[0,:][2,:]

array([ 84, 136,  64], dtype=uint8)

img_color_array[0,2]

array([ 84, 136,  64], dtype=uint8)

Building the pixels matrix for a color image: $M(\mathcal{I})$

We use the flatten procedure described above.

h = img_color_array.shape[0]
w = img_color_array.shape[1]
new_w = w*img_color_array.shape[2]
img_color_2D_array = np.zeros((h,new_w))
for i in range(0,h):
    img_color_2D_array[i,:] = img_color_array[i,:].flatten()

img_color_2D_array

array([[ 17.,  73.,   0., ..., 254., 250., 255.],
       [  0.,  57.,   0., ..., 253., 251., 255.],
       [ 60., 120.,  48., ..., 253., 252., 255.],
       ...,
       [ 93., 112., 116., ...,  98., 109., 113.],
       [ 88., 107., 111., ..., 101., 112., 116.],
       [ 94., 113., 117., ..., 109., 120., 124.]])

img_color_2D_array.shape

(182, 831)

We can plot the 2D array img_color_2D_array to realize how this representation of the original color image is not precise at all, since modify its size and remove the color pattern.

plt.imshow(img_color_2D_array, cmap='gray')
plt.axis('off')  # Hide the axis
plt.show()

_images/6fa85ec7fec21ef6eece2b7c6a8454cdb83ca549c67a63a5f12750d245b073a0.png

If we want to plot the original color image we need the 3D array img_color_array that contains all its information, specifically regarding the color and the size.

plt.imshow(img_color_array, cmap='gray')
plt.axis('off')  # Hide the axis
plt.show()

_images/7e6911dc01bb00d4d4808dda17dee45b5b048c656d8ac971f8f09a964875960a.png

Pixel features array#

Once we know how to extract features of images the next step is to build a features array with those extracted features to be use along with Machine Learning models.

In this section this topic will be addressed following the different feature extraction method explained before.

Most part of Machine Learning algorithms need a matrix (2D array) as input, usually called features/predictors matrix, but other are able to work with tensor, that is, 3D arrays, usually refer as sequential data. This last is specially common in the Deep Learning field.

Here we will cover both approaches, so that we are going to learn how to build both predictors matrices and tensors.

Sequential data: features tensor#

Given $n$ images $\mathcal{I}_1,\dots , \mathcal{I}_n$ of the same size $h\times w$.

Using the pixels method for features extraction, a features tensor for those images can be build as follows:

\[\begin{split}X = \begin{pmatrix} M(\mathcal{I}_1) \\ M(\mathcal{I}_2) \\ \dots \\ M(\mathcal{I}_n) \end{pmatrix} \end{split}\]

Where $M(\mathcal{I}_i)$ is the pixels matrix of the image $\mathcal{I}_i$.

This tensor is a 3D array of size $n\times h\times w$ that contains the pixels matrix of each image, so, is an array with $n$ matrices.

Tabular data: features matrix#

Given $n$ images $\mathcal{I}_1,\dots , \mathcal{I}_n$ of the same size $h\times w$.

Using the pixels method for features extraction, a features matrix for those images can be build as follows:

\[\begin{split}X = \begin{pmatrix} v(\mathcal{I}_1) \\ v(\mathcal{I}_2) \\ \dots \\ v(\mathcal{I}_n) \end{pmatrix} \end{split}\]

Where $v(\mathcal{I}_i)$ is the vectorial pixels representation (1D array) of the image $\mathcal{I}_i$ and $p$ is the length of $v(\mathcal{I}_i)$.

This matrix is a 2D array of size $n\times p$ that contains the flattened pixels matrix (img_pixels_array.flatten()) of each image, let say, the pixels vector of each image.

Vectorial representation of an image#

$v(\mathcal{I}_i)$ is the 1D array resulting of flatten $M(\mathcal{I}_i)$ $\Rightarrow$ img_pixels_array.flatten()

If $\mathcal{I}_i$ is gray-scale:

\[v(\mathcal{I}_i) = (p_{11}, \dots, p_{1w}, p_{21},\dots,p_{2w}, \dots,p_{h1},\dots,p_{hw})^{\prime}\]

with a size of $h\cdot w$.
If $\mathcal{I}_i$ is color:

\[v(\mathcal{I}_i) = (R_{11}, G_{11}, B_{11},\dots,R_{1w},G_{1w},B_{1w},\dots,R_{h1},G_{h1},B_{h1},\dots,R_{hw},G_{hw},B_{hw})^{\prime}\]

and its size is $3\cdot h \cdot w$.

We can obtain the vectorial representation of both the gray and color-scale images using using the function flatten() on img_gray_array and img_color_array, respectively.

img_gray_pixels_vector = img_gray_array.flatten()
img_gray_pixels_vector

array([ 48,  87, 112, ..., 135, 122, 117], dtype=uint8)

img_color_pixels_vector = img_color_array.flatten()
img_color_pixels_vector

array([ 17,  73,   0, ..., 109, 120, 124], dtype=uint8)

Extracting the vector img_pixels_vector for each image on the data-set and concatenating them by rows we obtain a pixels features matrix.

Once we have $$X = \begin{pmatrix} v(\mathcal{I}_1) \\ v(\mathcal{I}_2) \\ \dots \\ v(\mathcal{I}_n) \end{pmatrix} $$

we have a quantitative predictors matrix with images as rows (observations/individuals) and pixels (gray-scale) or channel-pixels (color-scale) as columns (features/predictors).

This matrix is $n\times p$ size, with $p=h\cdot w$ if images are gray-scale, and $p=3\cdot h \cdot w$ if images are color-scale.

The problem of this approach is the high dimensionality on $p$.

To solve this dimensionality reduction techniques such as PCA could be used.

Plotting vector as image

plt.imshow(img_gray_pixels_vector.reshape(img_gray_array.shape), cmap='gray')
plt.axis('off')  # Hide the axis
plt.show()

plt.imshow(img_color_pixels_vector.reshape(img_color_array.shape), cmap='gray')
plt.axis('off')  # Hide the axis
plt.show()

Getting pixels features matrix for gray-scale images#

In this section we are going to use the class ImageFeaturesExtraction from the PyImageML package , which has been developed to extract features array from data-sets of images, allowing different method for features extraction as well as different functionalities like working with both color and gray images, many filters, different formats and strategies for the different features extraction methods and so on.

In this section ImageFeaturesExtraction will be applied with the pixels method for feature extraction on gray-scale images, using the data-set of images presented above.

Array format

# Defining the desired size of the images
img_height = 240
img_width = 184

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=True, filter=None, format='array')

gray_pixels_features_matrix = img_feature_extraction.fit_transform(X=X)

gray_pixels_features_matrix

array([[ 26,  19,  28, ..., 115, 110, 109],
       [ 44,  96, 109, ..., 151, 149, 149],
       [ 62, 102,  75, ..., 109, 131, 118],
       ...,
       [ 11,  10,  10, ...,   1,   2,   2],
       [  5,   6,   6, ...,   6,   4,  27],
       [148, 150, 152, ...,  33,  43,  48]], dtype=uint8)

gray_pixels_features_matrix.shape

(300, 44160)

Data-frame format

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=True, filter=None, format='data-frame')

gray_pixels_features_matrix = img_feature_extraction.fit_transform(X=X)

gray_pixels_features_matrix

shape: (300, 44_160)

pixel_1_1	pixel_1_2	pixel_1_3	pixel_1_4	pixel_1_5	pixel_1_6	pixel_1_7	pixel_1_8	pixel_1_9	pixel_1_10	pixel_1_11	pixel_1_12	pixel_1_13	pixel_1_14	pixel_1_15	pixel_1_16	pixel_1_17	pixel_1_18	pixel_1_19	pixel_1_20	pixel_1_21	pixel_1_22	pixel_1_23	pixel_1_24	pixel_1_25	pixel_1_26	pixel_1_27	pixel_1_28	pixel_1_29	pixel_1_30	pixel_1_31	pixel_1_32	pixel_1_33	pixel_1_34	pixel_1_35	pixel_1_36	pixel_1_37	…	pixel_184_204	pixel_184_205	pixel_184_206	pixel_184_207	pixel_184_208	pixel_184_209	pixel_184_210	pixel_184_211	pixel_184_212	pixel_184_213	pixel_184_214	pixel_184_215	pixel_184_216	pixel_184_217	pixel_184_218	pixel_184_219	pixel_184_220	pixel_184_221	pixel_184_222	pixel_184_223	pixel_184_224	pixel_184_225	pixel_184_226	pixel_184_227	pixel_184_228	pixel_184_229	pixel_184_230	pixel_184_231	pixel_184_232	pixel_184_233	pixel_184_234	pixel_184_235	pixel_184_236	pixel_184_237	pixel_184_238	pixel_184_239	pixel_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
26	19	28	50	75	17	37	32	47	75	72	28	29	20	24	21	49	87	91	80	56	48	87	117	98	81	86	73	57	56	65	99	68	76	81	85	83	…	87	89	96	98	90	78	68	68	67	52	59	59	71	71	79	75	63	63	68	74	80	145	175	171	127	174	153	113	126	105	110	113	128	121	115	110	109
44	96	109	49	70	50	65	73	100	83	89	117	151	148	169	160	62	82	115	78	86	152	190	205	211	193	187	183	180	188	144	109	145	108	66	56	126	…	155	155	157	158	156	151	153	160	161	161	161	156	159	159	124	125	153	151	153	156	158	157	153	152	154	153	153	152	152	155	155	153	152	151	151	149	149
62	102	75	82	156	122	90	113	37	44	44	52	45	40	73	52	38	85	100	47	135	176	93	114	59	53	34	30	46	37	53	87	87	81	71	102	76	…	108	120	117	111	108	106	111	114	114	97	105	116	109	109	107	108	109	104	108	114	109	100	114	107	116	120	110	108	115	110	109	109	105	108	109	131	118
133	105	51	110	130	127	137	84	103	120	129	140	111	80	144	103	99	111	107	59	103	111	84	88	84	115	139	146	153	161	176	159	148	131	155	122	71	…	80	81	82	80	81	83	85	86	89	91	92	93	93	96	98	99	95	79	59	47	45	34	40	46	87	100	95	93	94	91	85	89	93	98	99	92	92
235	246	250	247	248	245	241	237	240	248	249	244	242	241	245	231	230	236	241	243	242	242	242	242	242	242	242	242	242	242	242	242	243	243	243	243	243	…	161	154	150	150	151	162	148	137	143	156	157	163	149	148	144	139	133	147	140	140	142	151	138	150	140	125	128	133	135	138	159	161	119	107	142	150	145
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
93	86	95	87	89	153	138	99	118	93	104	104	122	83	84	104	72	86	63	70	101	75	81	133	121	88	87	88	91	88	100	107	80	88	98	101	100	…	107	107	107	107	107	108	108	108	108	108	108	109	109	110	110	110	109	108	108	109	109	110	111	111	111	111	111	111	111	111	111	110	109	109	109	110	111
4	6	7	9	11	8	7	6	3	2	2	3	4	5	6	6	6	6	6	6	8	8	8	8	8	11	15	19	23	28	18	11	10	12	17	21	30	…	122	126	121	104	104	106	111	112	118	114	97	109	121	115	116	120	124	112	106	110	126	111	107	101	91	85	79	91	80	75	87	86	74	99	94	79	77
11	10	10	10	9	9	9	9	10	10	11	10	10	10	11	12	12	13	13	13	13	13	13	12	13	13	14	14	15	17	18	19	20	21	23	23	25	…	1	2	2	2	2	2	1	1	1	1	1	1	1	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	1	1	1	1	1	1	2	2
5	6	6	7	7	10	10	10	10	11	12	12	12	12	13	14	14	16	17	18	19	21	23	26	26	26	26	26	26	27	29	30	32	33	36	36	34	…	23	28	39	64	35	32	21	17	22	27	52	41	35	44	73	67	56	45	39	37	34	44	53	57	52	51	38	21	8	9	23	2	0	1	6	4	27
148	150	152	156	157	156	156	159	158	154	152	154	149	142	133	128	117	112	111	109	107	105	104	110	117	124	130	141	146	150	155	158	157	157	157	158	161	…	53	52	61	65	63	62	60	57	56	54	57	58	66	63	57	63	51	52	51	48	57	51	65	62	61	63	55	50	41	47	49	44	50	48	33	43	48

Applying equalization

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=True, filter='equalized', format='array')

gray_pixels_features_filtered_matrix = img_feature_extraction.fit_transform(X=X)

gray_pixels_features_filtered_matrix

array([[6.91349638e-02, 4.56748188e-02, 7.71739130e-02, ...,
        5.79211957e-01, 5.48550725e-01, 5.41802536e-01],
       [6.11186594e-02, 3.41553442e-01, 4.19723732e-01, ...,
        6.42255435e-01, 6.28736413e-01, 6.28736413e-01],
       [3.23596014e-01, 6.44021739e-01, 4.18025362e-01, ...,
        7.19791667e-01, 8.49682971e-01, 8.01041667e-01],
       ...,
       [5.70312500e-01, 5.47961957e-01, 5.47961957e-01, ...,
        9.89130435e-02, 2.85665761e-01, 2.85665761e-01],
       [2.26449275e-04, 4.30253623e-04, 4.30253623e-04, ...,
        4.30253623e-04, 9.05797101e-05, 8.42617754e-02],
       [8.93432971e-01, 8.99230072e-01, 9.05049819e-01, ...,
        6.14356884e-02, 1.38337862e-01, 1.81227355e-01]])

gray_pixels_features_filtered_matrix.shape

(300, 44160)

Applying sobel filter

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=True, filter='sobel', format='array')

gray_pixels_features_filtered_matrix = img_feature_extraction.fit_transform(X=X)

gray_pixels_features_filtered_matrix

array([[ 2.6925824 , 13.        , 20.98809186, ...,  7.77817459,
         4.24264069,  0.70710678],
       [37.83186488, 50.25497488, 32.669749  , ...,  1.60078106,
         1.41421356,  0.        ],
       [27.90721233, 18.51013236, 17.83605898, ..., 15.39683409,
         7.8142498 ,  9.48683298],
       ...,
       [ 0.70710678,  0.70710678,  0.        , ...,  0.70710678,
         0.70710678,  0.        ],
       [ 0.75      ,  0.55901699,  0.70710678, ..., 27.0935878 ,
        28.7108429 , 15.97654531],
       [11.01135777, 12.35161933, 14.63087489, ...,  2.15058132,
        10.85414667,  4.27200187]])

gray_pixels_features_filtered_matrix.shape

(300, 44160)

Getting pixels features matrix for color images#

In this section we are going to use the class ImageFeaturesExtraction from the PyImageML package , which has been developed to extract features matrices from data-sets of images, allowing different method for features extraction as well as different functionalities like working with both color and gray images, many filters, different formats and strategies for the different features extraction methods and so on.

In this section ImageFeaturesExtraction will be applied with the pixels method for feature extraction on color images, using the data-set of images presented above.

Array format

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=False, filter=None, format='array')

color_pixels_features_matrix = img_feature_extraction.fit_transform(X=X)

color_pixels_features_matrix

array([[ 23,  28,  21, ..., 120, 111,  72],
       [ 43,  48,  26, ..., 136, 152, 168],
       [ 31,  87,  13, ..., 110, 121, 125],
       ...,
       [ 26,   5,   2, ...,   3,   1,   2],
       [ 18,   0,   0, ...,  41,  22,  14],
       [105, 155, 224, ...,  63,  44,  29]], dtype=uint8)

color_pixels_features_matrix.shape

(300, 132480)

Data-frame format

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=False, filter=None, format='data-frame')

color_pixels_features_matrix = img_feature_extraction.fit_transform(X=X)

color_pixels_features_matrix

shape: (300, 132_480)

R_1_1	G_1_1	B_1_1	R_1_2	G_1_2	B_1_2	R_1_3	G_1_3	B_1_3	R_1_4	G_1_4	B_1_4	R_1_5	G_1_5	B_1_5	R_1_6	G_1_6	B_1_6	R_1_7	G_1_7	B_1_7	R_1_8	G_1_8	B_1_8	R_1_9	G_1_9	B_1_9	R_1_10	G_1_10	B_1_10	R_1_11	G_1_11	B_1_11	R_1_12	G_1_12	B_1_12	R_1_13	…	B_184_228	R_184_229	G_184_229	B_184_229	R_184_230	G_184_230	B_184_230	R_184_231	G_184_231	B_184_231	R_184_232	G_184_232	B_184_232	R_184_233	G_184_233	B_184_233	R_184_234	G_184_234	B_184_234	R_184_235	G_184_235	B_184_235	R_184_236	G_184_236	B_184_236	R_184_237	G_184_237	B_184_237	R_184_238	G_184_238	B_184_238	R_184_239	G_184_239	B_184_239	R_184_240	G_184_240	B_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
23	28	21	15	22	12	25	31	19	47	53	40	72	79	63	14	21	4	34	41	26	29	35	21	44	50	40	72	77	72	69	74	69	25	29	30	26	…	108	188	171	153	167	150	130	127	111	88	138	125	98	118	104	75	121	110	77	124	114	78	139	130	93	130	123	83	125	117	76	121	112	73	120	111	72
43	48	26	94	100	80	105	114	97	42	54	40	62	75	65	42	55	48	57	69	66	66	76	75	95	103	102	80	84	84	88	90	88	116	116	124	150	…	163	146	155	162	146	155	162	145	154	162	145	154	163	148	156	166	148	156	167	146	154	165	145	153	166	145	152	165	143	152	165	136	152	168	136	152	168
31	87	13	74	126	54	50	97	27	56	105	34	127	180	108	88	148	73	55	117	41	83	137	65	17	55	0	36	57	0	29	61	0	16	79	6	12	…	116	115	123	120	106	112	110	105	109	110	111	116	117	105	111	114	101	112	113	98	113	113	92	110	110	97	112	114	101	112	116	123	134	138	110	121	125
153	127	110	121	102	81	62	50	25	117	112	82	139	131	100	141	125	97	157	132	107	104	79	56	119	100	76	131	119	97	138	129	106	155	137	117	125	…	74	103	101	88	98	96	83	96	94	81	97	95	82	94	92	79	88	86	73	92	90	77	96	94	81	101	99	86	102	100	87	95	93	80	95	93	80
235	235	237	246	246	248	250	250	252	247	247	249	248	248	250	245	245	247	241	241	243	237	237	239	240	240	242	248	248	250	249	249	252	244	243	249	242	…	124	133	124	109	136	127	112	141	132	117	143	134	119	146	137	122	167	158	143	169	160	145	127	118	103	115	106	91	150	141	126	158	149	134	153	144	129
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
99	97	59	92	90	52	101	99	61	93	91	53	95	93	55	159	157	119	144	142	104	105	103	65	124	122	84	99	97	57	111	108	67	112	108	66	130	…	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	104	112	115	103	111	114	105	110	114	106	109	114	107	110	115	108	111	116
4	5	0	6	7	2	7	8	3	9	10	5	11	12	7	8	9	4	7	8	3	6	7	2	3	4	0	2	3	0	2	3	0	3	4	0	4	…	25	106	86	22	100	80	17	111	93	30	100	82	20	94	77	15	105	89	29	103	89	28	91	76	16	116	101	42	109	97	37	94	83	20	92	81	17
26	5	2	25	4	1	25	4	1	25	4	1	24	3	0	24	3	0	23	4	0	23	4	0	26	4	1	27	3	1	29	3	2	28	3	1	30	…	0	6	0	0	6	0	0	6	0	0	6	0	0	3	1	0	2	1	0	2	1	0	2	1	0	2	1	0	2	1	0	3	1	2	3	1	2
18	0	0	19	1	0	19	1	0	20	2	0	20	2	0	23	5	1	23	6	0	23	6	0	23	6	0	24	6	0	27	6	1	27	6	1	25	…	5	95	37	10	73	26	8	49	11	2	26	0	0	26	2	3	33	18	19	6	0	0	0	0	0	4	0	0	14	2	2	13	0	0	41	22	14
105	155	224	107	157	226	109	159	228	113	163	229	115	164	231	115	163	229	115	163	229	119	165	230	119	164	229	115	160	225	113	159	222	113	161	225	108	…	36	78	60	38	70	52	30	65	47	25	56	38	16	62	44	24	64	46	26	59	40	23	65	46	29	63	44	30	48	29	14	58	39	24	63	44	29

Applying equalization

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=False, filter='equalized', format='array')

color_pixels_features_filtered_matrix = img_feature_extraction.fit_transform(X=X)

color_pixels_features_filtered_matrix

array([[4.65089942e-02, 5.66196451e-02, 4.24647339e-02, ...,
        5.48270993e-01, 5.07150669e-01, 3.28962596e-01],
       [4.72852531e-02, 5.27835383e-02, 2.85910833e-02, ...,
        6.21546825e-01, 6.94669981e-01, 7.67793137e-01],
       [1.28630418e-01, 3.60995045e-01, 5.39417883e-02, ...,
        6.55259591e-01, 7.20785550e-01, 7.44613171e-01],
       ...,
       [5.26830199e-01, 1.01313500e-01, 4.05253999e-02, ...,
        6.63003517e-02, 2.21001172e-02, 4.42002344e-02],
       [2.52291134e-04, 0.00000000e+00, 0.00000000e+00, ...,
        2.30458316e-02, 1.23660560e-02, 7.86930834e-03],
       [3.94867372e-01, 5.82899453e-01, 8.42383726e-01, ...,
        2.05513093e-01, 1.43532954e-01, 9.46012648e-02]])

color_pixels_features_filtered_matrix.shape

(300, 132480)

Applying canny filter

img_feature_extraction = ImageFeaturesExtraction(method='pixels', image_height=img_height, image_width=img_width, 
                                                 convert_to_gray=False, filter='canny', format='array')

color_pixels_features_filtered_matrix = img_feature_extraction.fit_transform(X=X)

color_pixels_features_filtered_matrix

array([[0.00900378, 0.01096112, 0.00822084, ..., 0.00277297, 0.002565  ,
        0.00166378],
       [0.13352273, 0.14904863, 0.08073467, ..., 0.00224478, 0.00250888,
        0.00277297],
       [0.03868009, 0.10855381, 0.01622068, ..., 0.03273887, 0.03601276,
        0.03720327],
       ...,
       [0.00277297, 0.00053326, 0.00021331, ..., 0.        , 0.        ,
        0.        ],
       [0.00277297, 0.        , 0.        , ..., 0.06980089, 0.03745414,
        0.02383445],
       [0.02024147, 0.02988026, 0.0431818 , ..., 0.01675295, 0.01170047,
        0.00771167]])

color_pixels_features_filtered_matrix.shape

(300, 132480)

Getting pixels features tensor for gray-scale images#

In this section we are going to present the class ImageTensorFeaturesExtraction from the PyImageML package , which has been developed to extract features tensors from data-sets of images, is based in the pixels method for features extraction, and allows both color and gray images, and different formats to deal with the color images.

In this section ImageTensorFeaturesExtraction will be applied on gray-scale images, using the data-set of images presented above.

img_feature_extraction = ImageTensorFeaturesExtraction(image_height=img_height, image_width=img_width, convert_to_gray=True)
gray_pixels_features_tensor = img_feature_extraction.fit_transform(X=X)

gray_pixels_features_tensor

array([[[ 26,  19,  28, ...,  75,  63,  59],
        [ 21,  48,  43, ...,  81,  53,  46],
        [ 21,  49,  47, ...,  92,  59,  44],
        ...,
        [145, 206, 205, ..., 119, 110, 109],
        [130, 185, 173, ..., 115, 110, 109],
        [136, 163, 170, ..., 115, 110, 109]],

       [[ 44,  96, 109, ..., 225, 233, 229],
        [ 42, 100, 130, ..., 220, 235, 246],
        [ 61,  77, 139, ..., 218, 241, 239],
        ...,
        [ 73,  70,  80, ..., 151, 153, 153],
        [ 71,  69,  77, ..., 151, 149, 149],
        [ 71,  69,  78, ..., 151, 149, 149]],

       [[ 62, 102,  75, ..., 251, 251, 252],
        [ 49,  70,  60, ..., 251, 252, 252],
        [ 65,  83,  85, ..., 251, 252, 252],
        ...,
        [104, 105, 102, ..., 103, 113, 108],
        [105, 108, 103, ..., 105, 122, 113],
        [108, 106, 102, ..., 109, 131, 118]],

       ...,

       [[ 11,  10,  10, ...,   4,   4,   4],
        [ 11,  10,  10, ...,   4,   4,   4],
        [ 11,  10,  10, ...,   4,   4,   4],
        ...,
        [  2,   2,   2, ...,   2,   2,   2],
        [  2,   2,   2, ...,   1,   2,   2],
        [  2,   2,   2, ...,   1,   2,   2]],

       [[  5,   6,   6, ...,   7,   8,   8],
        [  6,   6,   6, ...,   7,   9,   9],
        [  6,   6,   7, ...,   8,  11,  11],
        ...,
        [  5,   8,  25, ...,  55, 138,  39],
        [  5,   7,  22, ...,  20,  65,  34],
        [  5,   7,  22, ...,   6,   4,  27]],

       [[148, 150, 152, ...,  41,  39,  44],
        [133, 133, 132, ...,  41,  42,  40],
        [123, 123, 124, ...,  36,  40,  35],
        ...,
        [ 53,  53,  53, ...,  41,  43,  47],
        [ 54,  54,  53, ...,  35,  44,  51],
        [ 55,  57,  54, ...,  33,  43,  48]]], dtype=uint8)

gray_pixels_features_tensor.shape

(300, 240, 184)

Getting pixels features tensor for color images#

In this section we are going to use the class ImageTensorFeaturesExtraction from the PyImageML package , which has been developed to extract features tensors from data-sets of images, is based in the pixels method for features extraction, and allows both color and gray images, and different formats to deal with the color images.

In this section ImageTensorFeaturesExtraction will be applied on color images, using the data-set of images presented above.

Tensor of 2D arrays

img_feature_extraction = ImageTensorFeaturesExtraction(image_height=img_height, image_width=img_width, convert_to_gray=False, color_dim='2D')
color_pixels_features_tensor = img_feature_extraction.fit_transform(X=X)

color_pixels_features_tensor

array([[[ 23.,  28.,  21., ...,  52.,  64.,  54.],
        [ 18.,  23.,  17., ...,  39.,  51.,  38.],
        [ 18.,  23.,  17., ...,  38.,  49.,  34.],
        ...,
        [142., 143., 162., ..., 120., 111.,  71.],
        [125., 130., 142., ..., 120., 111.,  72.],
        [132., 136., 148., ..., 120., 111.,  72.]],

       [[ 43.,  48.,  26., ..., 226., 229., 236.],
        [ 41.,  46.,  24., ..., 243., 246., 251.],
        [ 60.,  65.,  43., ..., 236., 239., 244.],
        ...,
        [ 69.,  74.,  78., ..., 147., 154., 162.],
        [ 68.,  71.,  76., ..., 135., 152., 169.],
        [ 68.,  71.,  76., ..., 136., 152., 168.]],

       [[ 31.,  87.,  13., ..., 254., 250., 255.],
        [ 16.,  73.,   8., ..., 253., 251., 255.],
        [ 32.,  90.,  25., ..., 253., 251., 255.],
        ...,
        [ 90., 109., 113., ..., 100., 111., 115.],
        [ 91., 110., 114., ..., 105., 116., 120.],
        [ 94., 113., 117., ..., 110., 121., 125.]],

       ...,

       [[ 26.,   5.,   2., ...,  10.,   1.,   2.],
        [ 26.,   5.,   2., ...,  10.,   1.,   2.],
        [ 27.,   4.,   2., ...,  10.,   1.,   2.],
        ...,
        [  3.,   2.,   0., ...,   3.,   1.,   2.],
        [  3.,   2.,   0., ...,   3.,   1.,   2.],
        [  3.,   2.,   0., ...,   3.,   1.,   2.]],

       [[ 18.,   0.,   0., ...,  26.,   1.,   0.],
        [ 18.,   1.,   0., ...,  27.,   2.,   0.],
        [ 19.,   1.,   0., ...,  29.,   4.,   0.],
        ...,
        [ 16.,   0.,   2., ...,  52.,  35.,  24.],
        [ 16.,   0.,   5., ...,  48.,  30.,  20.],
        [ 15.,   0.,   8., ...,  41.,  22.,  14.]],

       [[105., 155., 224., ...,  36.,  48.,  45.],
        [ 90., 140., 209., ...,  32.,  44.,  42.],
        [ 80., 130., 199., ...,  28.,  38.,  36.],
        ...,
        [ 52.,  56.,  41., ...,  62.,  43.,  28.],
        [ 53.,  57.,  42., ...,  66.,  47.,  32.],
        [ 54.,  58.,  43., ...,  63.,  44.,  29.]]])

color_pixels_features_tensor.shape

(300, 240, 552)

Tensor of 3D arrays

img_feature_extraction = ImageTensorFeaturesExtraction(image_height=img_height, image_width=img_width, convert_to_gray=False, color_dim='3D')
color_pixels_features_tensor = img_feature_extraction.fit_transform(X=X)

color_pixels_features_tensor

array([[[[ 23,  28,  21],
         [ 15,  22,  12],
         [ 25,  31,  19],
         ...,
         [ 68,  81,  60],
         [ 55,  67,  59],
         [ 52,  64,  54]],

        [[ 18,  23,  17],
         [ 45,  50,  43],
         [ 40,  46,  36],
         ...,
         [ 74,  87,  66],
         [ 46,  58,  45],
         [ 39,  51,  38]],

        [[ 18,  23,  17],
         [ 47,  51,  44],
         [ 44,  50,  40],
         ...,
         [ 85,  98,  78],
         [ 53,  64,  49],
         [ 38,  49,  34]],

        ...,

        [[142, 143, 162],
         [204, 205, 220],
         [203, 205, 214],
         ...,
         [129, 121,  82],
         [121, 112,  72],
         [120, 111,  71]],

        [[125, 130, 142],
         [181, 185, 197],
         [167, 174, 183],
         ...,
         [125, 117,  76],
         [121, 112,  73],
         [120, 111,  72]],

        [[132, 136, 148],
         [160, 163, 175],
         [165, 171, 180],
         ...,
         [125, 117,  76],
         [121, 112,  73],
         [120, 111,  72]]],


       [[[ 43,  48,  26],
         [ 94, 100,  80],
         [105, 114,  97],
         ...,
         [222, 225, 233],
         [230, 233, 240],
         [226, 229, 236]],

        [[ 41,  46,  24],
         [ 98, 105,  83],
         [125, 135, 118],
         ...,
         [217, 220, 228],
         [232, 235, 242],
         [243, 246, 251]],

        [[ 60,  65,  43],
         [ 75,  82,  60],
         [134, 144, 126],
         ...,
         [215, 218, 226],
         [238, 241, 249],
         [236, 239, 244]],

        ...,

        [[ 69,  74,  78],
         [ 66,  71,  75],
         [ 75,  81,  85],
         ...,
         [146, 152, 156],
         [147, 154, 162],
         [147, 154, 162]],

        [[ 68,  71,  76],
         [ 66,  69,  74],
         [ 74,  77,  82],
         ...,
         [143, 152, 166],
         [135, 152, 169],
         [135, 152, 169]],

        [[ 68,  71,  76],
         [ 66,  69,  74],
         [ 75,  78,  83],
         ...,
         [143, 152, 165],
         [136, 152, 168],
         [136, 152, 168]]],


       [[[ 31,  87,  13],
         [ 74, 126,  54],
         [ 50,  97,  27],
         ...,
         [253, 250, 250],
         [253, 250, 255],
         [254, 250, 255]],

        [[ 16,  73,   8],
         [ 42,  94,  23],
         [ 35,  82,  13],
         ...,
         [252, 251, 250],
         [252, 251, 255],
         [253, 251, 255]],

        [[ 32,  90,  25],
         [ 54, 107,  37],
         [ 60, 107,  39],
         ...,
         [252, 251, 248],
         [252, 251, 254],
         [253, 251, 255]],

        ...,

        [[ 90, 109, 113],
         [ 91, 110, 114],
         [ 88, 107, 111],
         ...,
         [ 95, 106, 110],
         [105, 116, 120],
         [100, 111, 115]],

        [[ 91, 110, 114],
         [ 94, 113, 117],
         [ 89, 108, 112],
         ...,
         [ 97, 108, 112],
         [114, 125, 129],
         [105, 116, 120]],

        [[ 94, 113, 117],
         [ 92, 111, 115],
         [ 88, 107, 111],
         ...,
         [101, 112, 116],
         [123, 134, 138],
         [110, 121, 125]]],


       ...,


       [[[ 26,   5,   2],
         [ 25,   4,   1],
         [ 25,   4,   1],
         ...,
         [ 12,   0,   0],
         [ 10,   1,   2],
         [ 10,   1,   2]],

        [[ 26,   5,   2],
         [ 25,   4,   1],
         [ 25,   4,   1],
         ...,
         [ 12,   0,   0],
         [ 10,   1,   2],
         [ 10,   1,   2]],

        [[ 27,   4,   2],
         [ 26,   3,   1],
         [ 26,   3,   1],
         ...,
         [ 12,   0,   0],
         [ 10,   1,   2],
         [ 10,   1,   2]],

        ...,

        [[  3,   2,   0],
         [  3,   2,   0],
         [  3,   2,   0],
         ...,
         [  3,   2,   0],
         [  3,   1,   2],
         [  3,   1,   2]],

        [[  3,   2,   0],
         [  3,   2,   0],
         [  3,   2,   0],
         ...,
         [  2,   1,   0],
         [  3,   1,   2],
         [  3,   1,   2]],

        [[  3,   2,   0],
         [  3,   2,   0],
         [  3,   2,   0],
         ...,
         [  2,   1,   0],
         [  3,   1,   2],
         [  3,   1,   2]]],


       [[[ 18,   0,   0],
         [ 19,   1,   0],
         [ 19,   1,   0],
         ...,
         [ 25,   0,   0],
         [ 26,   1,   0],
         [ 26,   1,   0]],

        [[ 18,   1,   0],
         [ 19,   1,   0],
         [ 19,   1,   0],
         ...,
         [ 25,   0,   0],
         [ 27,   2,   0],
         [ 27,   2,   0]],

        [[ 19,   1,   0],
         [ 19,   1,   0],
         [ 20,   2,   0],
         ...,
         [ 26,   1,   0],
         [ 29,   4,   0],
         [ 29,   4,   0]],

        ...,

        [[ 16,   0,   2],
         [ 26,   0,   0],
         [ 55,  14,   1],
         ...,
         [ 62,  53,  51],
         [147, 135, 129],
         [ 52,  35,  24]],

        [[ 16,   0,   5],
         [ 25,   0,   0],
         [ 53,  11,   1],
         ...,
         [ 27,  17,  17],
         [ 75,  62,  57],
         [ 48,  30,  20]],

        [[ 15,   0,   8],
         [ 24,   0,   1],
         [ 53,  10,   1],
         ...,
         [ 14,   2,   2],
         [ 13,   0,   0],
         [ 41,  22,  14]]],


       [[[105, 155, 224],
         [107, 157, 226],
         [109, 159, 228],
         ...,
         [ 36,  44,  41],
         [ 31,  43,  39],
         [ 36,  48,  45]],

        [[ 90, 140, 209],
         [ 90, 140, 209],
         [ 89, 139, 207],
         ...,
         [ 36,  43,  40],
         [ 35,  46,  41],
         [ 32,  44,  42]],

        [[ 80, 130, 199],
         [ 80, 130, 199],
         [ 81, 131, 199],
         ...,
         [ 33,  38,  36],
         [ 35,  43,  40],
         [ 28,  38,  36]],

        ...,

        [[ 52,  56,  41],
         [ 52,  56,  41],
         [ 52,  56,  40],
         ...,
         [ 56,  37,  21],
         [ 58,  39,  25],
         [ 62,  43,  28]],

        [[ 53,  57,  42],
         [ 53,  57,  42],
         [ 52,  56,  40],
         ...,
         [ 50,  31,  16],
         [ 59,  40,  25],
         [ 66,  47,  32]],

        [[ 54,  58,  43],
         [ 56,  60,  45],
         [ 53,  57,  41],
         ...,
         [ 48,  29,  14],
         [ 58,  39,  24],
         [ 63,  44,  29]]]], dtype=uint8)

color_pixels_features_tensor.shape

(300, 240, 184, 3)

Histogram of Oriented Gradients Method#

HOG (Histogram of Oriented Gradients) is a technique used for feature extraction in image processing and computer vision tasks like object detection and classification. It works by dividing an image into small regions, computes the gradient orientation within each region, and then builds a histogram of these gradients. The resulting feature vector represents the distribution of gradient orientations, capturing important information about the edges and textures present in the image. HOG is particularly effective for recognizing objects in images regardless of changes in lighting conditions and backgrounds, making it a popular choice in various applications such as pedestrian detection, face recognition, and surveillance.

A brief summary of how Histogram of Oriented Gradients (HOG) works as a feature extraction method for images:

Gradient Calculation: Compute the gradient (derivative) of pixel intensities in the image to capture local edge information. Typically, gradient magnitude and direction are computed using gradient filters like the Sobel operator.
Cell Division: Divide the image into small, overlapping cells (e.g., 8x8 pixels). Each cell accumulates gradient information by constructing histograms of gradient orientations.
Histograms of Oriented Gradients: Within each cell, construct histograms of gradient orientations to represent the distribution of edge directions. This captures local texture and shape information.
Block Normalization: Group cells into larger blocks (e.g., 2x2 or 3x3 cells) to enhance local contrast and normalize illumination changes. Normalize the histograms within each block to make the descriptor invariant to changes in lighting and contrast.
Descriptor Formation: Concatenate the normalized block features to form the final HOG descriptor for the image. This descriptor captures the local intensity gradients and their orientations across different regions of the image.
Feature Vector Extraction: The HOG descriptor forms a feature vector representing the image. This feature vector can be used as input to machine learning algorithms for tasks such as object detection, classification, or similarity comparison.

The method hog from the module feature of the skimage package provides a way to compute a Histogram of Oriented Gradients (HOG) by:

Global image normalisation (optional)
Computing the gradient image in x and y
Computing gradient histograms
Normalising across blocks
Flattening into a feature vector

HOG usually works on gray-scale images, so images should be converted to gray-scale before serving as input for this method.

Once we have the HOG feature vector for a given image we can proceed in several ways, specifically the following will be considered throughout this project:

Consider the HOG features vector as the numerical representation of the image.
- This approach usually leads to a computational problem since the length of this vectors is usually massive, but this issue can be mitigated utilizing dimensionality reduction techniques, such as PCA.
Transform the HOG features vector into a features matrix.
- Transform the features matrix in a new features vector by computing statistics along the matrix columns.
- Transform the features matrix in a new features vector by computing the histogram of bags of visual words (BVW).

In order to understand better how HOG works we are going to apply it to several images, and plot the resulting HOG image along with the original one, to compare and extract valuable insight regarding this method just in a visual manner.

Here a classic example, provided in the skimage documentation.

Image.open(r'C:\Users\fscielzo\Documents\DataScience-GitHub\Image Analysis\Image-Classification\Fire-Detection\images\HOG_1.webp')

_images/0271c2b90810ea64e31e90036ad12d9b0f457aaaa98f365c698fee341b07a15f.png

The next plot includes several images of our data-set together with its HOG representation.

CELLS_PER_BLOCK_HOR = 2
CELLS_PER_BLOCK_VER = 2
PIXELS_PER_CELL_HOR = 8
PIXELS_PER_CELL_VER = 8
orientations = 8
pixels_per_cell=(PIXELS_PER_CELL_HOR, PIXELS_PER_CELL_VER)
cells_per_block=(CELLS_PER_BLOCK_HOR, CELLS_PER_BLOCK_VER)

n_images = 9
np.random.seed(0)
random_idx = np.random.choice(len(X), n_images, replace=False).tolist()
random_idx = random_idx + [103]
img_gray_array_list = [np.array(Image.open(X[i]).convert('L')) for i in random_idx]
img_color_array_list = [np.array(Image.open(X[i])) for i in random_idx]

HOG_visualize_list = []
for img_array in img_gray_array_list:
    
    HOG_img, HOG_visualize = feature.hog(img_array, orientations=orientations,
                pixels_per_cell=pixels_per_cell,
                cells_per_block=cells_per_block,
                transform_sqrt=True, visualize=True)
    
    HOG_visualize_list.append(HOG_visualize)
    
n_rows = len(list(zip(img_color_array_list, HOG_visualize_list)))
subplot_idx = [(i, i + 1) for i in range(0, n_rows*2, 2)]

fig, axes = plt.subplots(n_rows, 2, figsize=(12,40))
axes = axes.flatten() 

for (i, j), img_color_array, HOG_visualize in zip(subplot_idx, img_color_array_list, HOG_visualize_list):
    
    axes[i].imshow(img_color_array, cmap='gray')
    axes[j].imshow(HOG_visualize, cmap='gray')
    
    if i == 0:
        axes[i].set_title('Original image', size=15, weight='bold')
        axes[j].set_title('HOG image', size=15, weight='bold')

for i in range(len(axes)):
    axes[i].axis('off')  # Hide the axis

plt.subplots_adjust(hspace=0.025, wspace=0.01) 
plt.show()

_images/27589ba13a8da6d84969ec3b15c2e7ee972553bf907c7734058d2f1d57f3198c.png

Now we are going to explore how are the outputs resulting of applying HOG on an image.

HOG features vector

img_gray_array = np.array(Image.open(X[103]).convert('L'))

HOG_img, HOG_visualize = feature.hog(img_gray_array, orientations=orientations,
                pixels_per_cell=pixels_per_cell,
                cells_per_block=cells_per_block,
                transform_sqrt=True, visualize=True)

HOG_img

array([0.36364343, 0.03360488, 0.02089678, ..., 0.03660126, 0.02114829,
       0.01176881])

HOG_img.shape

(23232,)

HOG feature matrix: reshaping the features vector into a matrix.

p = orientations * cells_per_block[0] * cells_per_block[1] # number of features
n = int(np.shape(HOG_img)[0]/p) # length of the features
HOG_img = np.reshape(HOG_img, (n, p))

HOG_img

array([[0.36364343, 0.03360488, 0.02089678, ..., 0.03471119, 0.04365133,
        0.        ],
       [0.47612765, 0.        , 0.        , ..., 0.0672579 , 0.12666173,
        0.        ],
       [0.38112443, 0.        , 0.07521982, ..., 0.        , 0.048066  ,
        0.        ],
       ...,
       [0.15979678, 0.0981024 , 0.16545619, ..., 0.14736083, 0.08917909,
        0.03633761],
       [0.04752936, 0.15648015, 0.12212673, ..., 0.11533974, 0.05735903,
        0.00558269],
       [0.25748894, 0.28158486, 0.11105426, ..., 0.03660126, 0.02114829,
        0.01176881]])

HOG_img.shape

(726, 32)

Based on Statistics#

Statistics aggregation can be used to transform the HOG_img matrix into a features vector that represent the given image. In other words, to extracts a features vector for a given images based on HOG.

In the next cells a couple of examples of how to do it are shown.

Statistics: mean

HOG_img_mean = np.mean(HOG_img, axis=0)
HOG_img_mean

array([0.14679672, 0.0803544 , 0.09536614, 0.1419433 , 0.21723404,
       0.08575819, 0.10150205, 0.08248819, 0.14743567, 0.08275199,
       0.09822818, 0.1462247 , 0.21792027, 0.08908326, 0.10341332,
       0.08480108, 0.15140446, 0.08353412, 0.09937754, 0.16057091,
       0.23059348, 0.09163763, 0.1043415 , 0.08710898, 0.15154042,
       0.08582017, 0.100348  , 0.16331974, 0.23107536, 0.09231949,
       0.10460763, 0.08779868])

HOG_img_mean.shape

(32,)

Statistics: mean-std

HOG_img_mean = np.mean(HOG_img, axis=0)
HOG_img_std = np.std(HOG_img, axis=0)
HOG_img_stats = np.hstack([HOG_img_mean, HOG_img_std])
HOG_img_stats

array([0.14679672, 0.0803544 , 0.09536614, 0.1419433 , 0.21723404,
       0.08575819, 0.10150205, 0.08248819, 0.14743567, 0.08275199,
       0.09822818, 0.1462247 , 0.21792027, 0.08908326, 0.10341332,
       0.08480108, 0.15140446, 0.08353412, 0.09937754, 0.16057091,
       0.23059348, 0.09163763, 0.1043415 , 0.08710898, 0.15154042,
       0.08582017, 0.100348  , 0.16331974, 0.23107536, 0.09231949,
       0.10460763, 0.08779868, 0.1317824 , 0.09294007, 0.09738209,
       0.15405147, 0.13246163, 0.09432467, 0.110728  , 0.10418498,
       0.13221261, 0.09430074, 0.09785966, 0.15478669, 0.13196037,
       0.09770127, 0.11163924, 0.10598677, 0.134641  , 0.09318608,
       0.09613184, 0.1606208 , 0.13438775, 0.09621373, 0.11163673,
       0.10831574, 0.13312731, 0.09576936, 0.0957986 , 0.15947756,
       0.13395827, 0.09442077, 0.11003086, 0.10837857])

HOG_img_stats.shape

(64,)

Based on Bags of Visual Words (BVW)#

The following diagram explains how the BVW can be used to extract a features vector from an image based on HOG.

Image.open(r'C:\Users\fscielzo\Documents\DataScience-GitHub\Image Analysis\Image-Classification\Fire-Detection\images\HOG_2.jpg')

_images/93a6f73804c4a732f070617c2fea51c8e624f8826f9f368c6a271e79ca2ea203.png

HOG-stats features matrix#

Given $n$ images $\mathcal{I}_1,\dots , \mathcal{I}_n$ of the same size $h\times w$.

Using the HOG method for features extraction based on statistics, a features matrix for those images can be build as follows:

\[\begin{split}X = \begin{pmatrix} v(\mathcal{I}_1) \\ v(\mathcal{I}_2) \\ \dots \\ v(\mathcal{I}_n) \end{pmatrix} \end{split}\]

Now $v(\mathcal{I}_i)$ is the a vector of statistics computed by columns on the HOG features matrix $\mathcal{M}(\mathcal{I}_i)$ (HOG_img) for the image $\mathcal{I}_i$ $\hspace{0.1cm}\Rightarrow\hspace{0.1cm}$ $v(\mathcal{I}_i)= $ HOG_img_stats.

This matrix is a 2D array of size $n\times p$ that contains the HOG statistics of each image.

Extracting the vector HOG_img_stats for each image on the data-set and concatenating them by rows we obtain a HGO-stats features matrix.

Getting HGO-stats features matrix#

In this section we are going to use the class ImageFeaturesExtraction from the PyImageML package , which has been developed to extract features matrices from data-sets of images, allowing different method for features extraction as well as different functionalities like working with both color and gray images, many filters, different formats and strategies for the different features extraction methods and so on.

In this section ImageFeaturesExtraction will be applied with the HOG method for feature extraction based on statistics, using the data-set of images presented above.

Statistics: mean

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter=None, reshape=True,
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True, statistics='mean')

HGO_features_matrix = img_feature_extraction.fit_transform(X=X)

HGO_features_matrix

array([[0.20157584, 0.14783572, 0.12745415, ..., 0.10648443, 0.13134968,
        0.18349608],
       [0.17286368, 0.14136847, 0.15825279, ..., 0.11890768, 0.11860601,
        0.1418163 ],
       [0.18598968, 0.15119294, 0.1556355 , ..., 0.12523065, 0.1435575 ,
        0.16787505],
       ...,
       [0.23156577, 0.10066881, 0.10889268, ..., 0.0777515 , 0.11607629,
        0.11501912],
       [0.25385873, 0.13853749, 0.08665303, ..., 0.09441686, 0.16071724,
        0.22178972],
       [0.19882647, 0.15653812, 0.14570778, ..., 0.12343722, 0.13397905,
        0.15106539]])

HGO_features_matrix.shape

(300, 32)

Statistics: mean-Q25-median-Q75-std

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter=None, reshape=True,
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True, statistics='mean-Q25-median-Q75-std')

HGO_features_matrix = img_feature_extraction.fit_transform(X=X)

HGO_features_matrix

array([[0.20157584, 0.14783572, 0.12745415, ..., 0.07722948, 0.08753731,
        0.09752229],
       [0.17286368, 0.14136847, 0.15825279, ..., 0.08342822, 0.08358776,
        0.10379498],
       [0.18598968, 0.15119294, 0.1556355 , ..., 0.07379397, 0.08382094,
        0.09449716],
       ...,
       [0.23156577, 0.10066881, 0.10889268, ..., 0.08503151, 0.10009281,
        0.11873147],
       [0.25385873, 0.13853749, 0.08665303, ..., 0.08438005, 0.10134144,
        0.10840959],
       [0.19882647, 0.15653812, 0.14570778, ..., 0.08484173, 0.0841926 ,
        0.10205706]])

HGO_features_matrix.shape

(300, 160)

Applying equalization

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter='equalized', reshape=True, 
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True, statistics='mean')

HGO_features_filtered_matrix = img_feature_extraction.fit_transform(X=X)

HGO_features_filtered_matrix

array([[0.20078454, 0.14787799, 0.12604256, ..., 0.10548965, 0.13279582,
        0.18339162],
       [0.17293476, 0.14033908, 0.1537719 , ..., 0.11915944, 0.11854023,
        0.14170413],
       [0.18746046, 0.15079661, 0.14928058, ..., 0.12890023, 0.14689157,
        0.16894386],
       ...,
       [0.23320771, 0.08942915, 0.1014497 , ..., 0.07468827, 0.11502458,
        0.11756563],
       [0.25469751, 0.13832403, 0.08580414, ..., 0.09091659, 0.16319569,
        0.22209216],
       [0.19423289, 0.15211316, 0.14351524, ..., 0.12170027, 0.12920767,
        0.15031476]])

HGO_features_filtered_matrix.shape

(300, 32)

Applying hessian filter

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter='hessian', reshape=True, 
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True, statistics='mean')

HGO_features_filtered_matrix = img_feature_extraction.fit_transform(X=X)

HGO_features_filtered_matrix

array([[0.21395189, 0.15094232, 0.13975573, ..., 0.12033038, 0.13637175,
        0.15742064],
       [0.18118397, 0.14117219, 0.14688905, ..., 0.11819649, 0.12805744,
        0.13398258],
       [0.1952499 , 0.16164455, 0.14418718, ..., 0.12276572, 0.14569308,
        0.15200626],
       ...,
       [0.1919733 , 0.11779311, 0.12347916, ..., 0.10288798, 0.10688693,
        0.16267898],
       [0.24987259, 0.10253705, 0.10691321, ..., 0.12632426, 0.11728681,
        0.20749636],
       [0.20363335, 0.14678954, 0.13949207, ..., 0.11866348, 0.1229296 ,
        0.14670206]])

HGO_features_filtered_matrix.shape

(300, 32)

HOG-BVW features matrix#

Given $n$ images $\mathcal{I}_1,\dots , \mathcal{I}_n$ of the same size $h\times w$.

Using the HOG method for features extraction based on BVW, a features matrix for those images can be build as follows:

\[\begin{split}X = \begin{pmatrix} v(\mathcal{I}_1) \\ v(\mathcal{I}_2) \\ \dots \\ v(\mathcal{I}_n) \end{pmatrix} \end{split}\]

Now $v(\mathcal{I}_i)$ is the a vector of BVW frequencies for the image $\mathcal{I}_i$.

This matrix is a 2D array of size $n\times p$ that contains the BVW frequencies for each image.

Extracting the vector of BVW frequencies for each image on the data-set (like it is shown in the diagram) and concatenating them by rows we obtain a HGO-BVW features matrix.

Getting HGO-BVW features matrix#

In this section we are going to use the class ImageFeaturesExtraction from the PyImageML package , which has been developed to extract features matrices from data-sets of images, allowing different method for features extraction as well as different functionalities like working with both color and gray images, many filters, different formats and strategies for the different features extraction methods and so on.

In this section ImageFeaturesExtraction will be applied with the HOG method for feature extraction based on BVW, using the data-set of images presented above.

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter=None, reshape=True, 
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True, statistics='BVW', n_clusters=100)

HGO_features_matrix = img_feature_extraction.fit_transform(X=X)

c:\Users\fscielzo\anaconda3\Lib\site-packages\sklearn\cluster\_kmeans.py:1412: FutureWarning: The default value of `n_init` will change from 10 to 'auto' in 1.4. Set the value of `n_init` explicitly to suppress the warning
  super()._check_params_vs_input(X, default_n_init=10)

HGO_features_matrix

array([[ 3,  9,  7, ...,  3,  2, 15],
       [ 1,  5, 12, ...,  2,  0, 13],
       [ 5,  8, 20, ...,  0,  4, 40],
       ...,
       [ 3,  5, 13, ...,  1,  4, 14],
       [ 9, 19, 14, ...,  2,  3,  3],
       [10,  5,  7, ...,  6,  0, 10]], dtype=int64)

HGO_features_matrix.shape

(300, 100)

Filtering with sobel

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter='sobel', reshape=True, 
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True, statistics='BVW', n_clusters=100)

HGO_features_filtered_matrix = img_feature_extraction.fit_transform(X=X)

c:\Users\fscielzo\anaconda3\Lib\site-packages\sklearn\cluster\_kmeans.py:1412: FutureWarning: The default value of `n_init` will change from 10 to 'auto' in 1.4. Set the value of `n_init` explicitly to suppress the warning
  super()._check_params_vs_input(X, default_n_init=10)

HGO_features_filtered_matrix

array([[ 4,  5,  5, ...,  0,  7, 11],
       [ 3,  2, 15, ...,  0,  8,  2],
       [ 0,  2, 12, ...,  0, 10,  0],
       ...,
       [ 3,  8,  1, ..., 11,  6,  0],
       [ 0,  4,  0, ...,  0, 13, 14],
       [ 4,  2,  6, ...,  0,  1,  1]], dtype=int64)

HGO_features_filtered_matrix.shape

(300, 100)

HOG - Not Reshaped features matrix#

Given $n$ images $\mathcal{I}_1,\dots , \mathcal{I}_n$ of the same size $h\times w$.

Using the HOG method for features extraction based on not reshaped HOG features, a features matrix for those images can be build as follows:

\[\begin{split}X = \begin{pmatrix} v(\mathcal{I}_1) \\ v(\mathcal{I}_2) \\ \dots \\ v(\mathcal{I}_n) \end{pmatrix} \end{split}\]

Now $v(\mathcal{I}_i)$ is the a vector of HOG not reshaped features for the image $\mathcal{I}_i$

This matrix is a 2D array of size $n\times p$ that contains the not reshaped HOG features for each image.

Extracting the not reshape HGO features for each image on the data-set and concatenating them by rows we obtain a HGO - not reshaped features matrix.

The problem of this approach is that leads to a dimensionality problem on $p$, so it needs to be applied along with dimensionality reduction techniques such as PCA.

Getting HGO - Not Reshaped features matrix#

In this section we are going to use the class ImageFeaturesExtraction from the PyImageML package , which has been developed to extract features matrices from data-sets of images, allowing different method for features extraction as well as different functionalities like working with both color and gray images, many filters, different formats and strategies for the different features extraction methods and so on.

In this section ImageFeaturesExtraction will be applied with the HOG method for feature extraction based on not reshaping, using the data-set of images presented above.

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter=None, reshape=False,
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True)

HGO_features_matrix = img_feature_extraction.fit_transform(X=X)

HGO_features_matrix

array([[0.23803907, 0.04517031, 0.11159217, ..., 0.13129353, 0.08770483,
        0.09647536],
       [0.24584797, 0.02077305, 0.0480539 , ..., 0.22148548, 0.13344476,
        0.09433893],
       [0.23915977, 0.23915977, 0.14695772, ..., 0.23245254, 0.23245254,
        0.11289806],
       ...,
       [0.45553614, 0.        , 0.        , ..., 0.        , 0.12639003,
        0.        ],
       [0.26050084, 0.28689935, 0.28689935, ..., 0.12227303, 0.11154988,
        0.27509187],
       [0.05024678, 0.00761688, 0.13056183, ..., 0.0925665 , 0.11715885,
        0.21169244]])

HGO_features_matrix.shape

(300, 20416)

Filtering with sobel

img_feature_extraction = ImageFeaturesExtraction(method='HOG', image_height=img_height, image_width=img_width, filter='sobel', reshape=False,
                                                 orientations=orientations, pixels_per_cell=pixels_per_cell, cells_per_block=cells_per_block, 
                                                 transform_sqrt=True)

HGO_features_matrix = img_feature_extraction.fit_transform(X=X)

HGO_features_matrix

array([[0.16068715, 0.02210637, 0.08932776, ..., 0.21273152, 0.08234161,
        0.12393073],
       [0.22659834, 0.12988755, 0.04618949, ..., 0.24691583, 0.12855758,
        0.11606701],
       [0.22888939, 0.11324927, 0.22888939, ..., 0.2248361 , 0.2248361 ,
        0.17443813],
       ...,
       [0.41634108, 0.        , 0.        , ..., 0.        , 0.07710316,
        0.06034293],
       [0.29468638, 0.09417174, 0.02467003, ..., 0.21154874, 0.09705226,
        0.24505233],
       [0.07725128, 0.04226268, 0.05836576, ..., 0.11682897, 0.22934459,
        0.13684988]])

HGO_features_matrix.shape

(300, 20416)

Convolutional Neural Networks Method#

Using Convolutional Neural Networks (CNNs) for extracting image features offers several advantages:

Hierarchical Feature Learning: CNNs are designed to automatically learn hierarchical representations of features from raw input data. In the case of images, lower layers typically learn simple features like edges and textures, while higher layers learn more complex features like object parts and whole objects. By using a pre-trained CNN, you leverage the knowledge encoded in these learned features without having to manually engineer them.
Transfer Learning: Pre-trained CNN models, such as VGG, ResNet, or Inception, have been trained on large-scale datasets like ImageNet for image classification tasks. These models have learned to recognize a wide variety of visual patterns and concepts. By using these pre-trained models as feature extractors, you can transfer this learned knowledge to new tasks or datasets with relatively little data. This is particularly useful when you have a small dataset or limited computational resources for training your own models from scratch.
Dimensionality Reduction: The features extracted by CNNs typically reside in high-dimensional spaces, capturing rich information about the input images. These features can be used to represent images in a more compact and meaningful way compared to raw pixel values. This can be particularly beneficial for tasks such as image retrieval, where you want to efficiently compare images based on their visual content.
Robustness to Variations: CNNs are designed to be robust to various transformations and distortions in the input images, such as changes in scale, rotation, illumination, and partial occlusion. The features learned by CNNs tend to capture invariant properties of objects, making them effective for tasks like object recognition and detection under different conditions.
Interpretable Representations: The features learned by CNNs often correspond to semantically meaningful concepts in the images, such as object shapes, textures, or parts. This makes the extracted features more interpretable compared to handcrafted features, allowing for better understanding of the underlying characteristics of the data.

# Load CNN model
model = VGG16()
# Remove the output layer
model = Model(inputs=model.inputs, outputs=model.layers[-2].output)
# Extracting the image height and width expected for the used CNN model.
img_height_CNN = model.inputs[0].shape[1]
img_width_CNN = model.inputs[0].shape[2]

# Load an image from file
img_path = X[103]
x = tf.keras.preprocessing.image.load_img(img_path, target_size=(img_width_CNN, img_height_CNN))
# Convert the image pixels to a numpy array
x = tf.keras.preprocessing.image.img_to_array(x)
# Reshape data for the model
x = np.expand_dims(x, axis=0)
# Prepare the image for the VGG model
x = preprocess_input(x)
# Get CNN features
CNN_features = model.predict(x)

CNN_features

array([[0.7327867, 0.       , 6.7262554, ..., 2.1139264, 5.3945417,
        0.       ]], dtype=float32)

CNN_features.shape

(1, 4096)

CNN features matrix#

Given $n$ images $\mathcal{I}_1,\dots , \mathcal{I}_n$ of the same size $h\times w$.

Using the CNN method for features extraction, a features matrix for those images can be build as follows:

\[\begin{split}X = \begin{pmatrix} v(\mathcal{I}_1) \\ v(\mathcal{I}_2) \\ \dots \\ v(\mathcal{I}_n) \end{pmatrix} \end{split}\]

Now $v(\mathcal{I}_i)$ is a vector with the CNN features for the image $\mathcal{I}_i$ $\hspace{0.1cm}\Rightarrow\hspace{0.1cm}$ $v(\mathcal{I}_i)= $ CNN_features.

This matrix is a 2D array of size $n\times p$ that contains the CNN features of each image.

Extracting the vector CNN_features for each image on the data-set and concatenating them by rows we obtain a CNN features matrix.

Getting CNN features matrix#

In this section we are going to use the class ImageFeaturesExtraction from the PyImageML package , which has been developed to extract features matrices from data-sets of images, allowing different method for features extraction as well as different functionalities like working with both color and gray images, many filters, different formats and strategies for the different features extraction methods and so on.

In this section ImageFeaturesExtraction will be applied with the CNN method for feature extraction, using the data-set of images presented above.

img_feature_extraction = ImageFeaturesExtraction(method='CNN')

CNN_features_matrix = img_feature_extraction.fit_transform(X=X)

CNN_features_matrix

array([[0.39077538, 0.        , 0.        , ..., 0.        , 0.        ,
        0.        ],
       [0.        , 2.8053548 , 1.1582053 , ..., 0.        , 3.9529238 ,
        0.13838303],
       [0.        , 1.5708483 , 0.9057204 , ..., 0.        , 1.3198075 ,
        0.67053926],
       ...,
       [0.31342962, 0.04839906, 2.9200916 , ..., 2.756431  , 0.48978698,
        0.223643  ],
       [0.        , 0.        , 3.3499594 , ..., 0.        , 2.0666275 ,
        0.        ],
       [0.        , 0.8232522 , 0.51099145, ..., 5.909257  , 5.0205564 ,
        0.        ]], dtype=float32)

CNN_features_matrix.shape

(300, 4096)

pixel_1_1	pixel_1_2	pixel_1_3	pixel_1_4	pixel_1_5	pixel_1_6	pixel_1_7	pixel_1_8	pixel_1_9	pixel_1_10	pixel_1_11	pixel_1_12	pixel_1_13	pixel_1_14	pixel_1_15	pixel_1_16	pixel_1_17	pixel_1_18	pixel_1_19	pixel_1_20	pixel_1_21	pixel_1_22	pixel_1_23	pixel_1_24	pixel_1_25	pixel_1_26	pixel_1_27	pixel_1_28	pixel_1_29	pixel_1_30	pixel_1_31	pixel_1_32	pixel_1_33	pixel_1_34	pixel_1_35	pixel_1_36	pixel_1_37	…	pixel_184_204	pixel_184_205	pixel_184_206	pixel_184_207	pixel_184_208	pixel_184_209	pixel_184_210	pixel_184_211	pixel_184_212	pixel_184_213	pixel_184_214	pixel_184_215	pixel_184_216	pixel_184_217	pixel_184_218	pixel_184_219	pixel_184_220	pixel_184_221	pixel_184_222	pixel_184_223	pixel_184_224	pixel_184_225	pixel_184_226	pixel_184_227	pixel_184_228	pixel_184_229	pixel_184_230	pixel_184_231	pixel_184_232	pixel_184_233	pixel_184_234	pixel_184_235	pixel_184_236	pixel_184_237	pixel_184_238	pixel_184_239	pixel_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
26	19	28	50	75	17	37	32	47	75	72	28	29	20	24	21	49	87	91	80	56	48	87	117	98	81	86	73	57	56	65	99	68	76	81	85	83	…	87	89	96	98	90	78	68	68	67	52	59	59	71	71	79	75	63	63	68	74	80	145	175	171	127	174	153	113	126	105	110	113	128	121	115	110	109
44	96	109	49	70	50	65	73	100	83	89	117	151	148	169	160	62	82	115	78	86	152	190	205	211	193	187	183	180	188	144	109	145	108	66	56	126	…	155	155	157	158	156	151	153	160	161	161	161	156	159	159	124	125	153	151	153	156	158	157	153	152	154	153	153	152	152	155	155	153	152	151	151	149	149
62	102	75	82	156	122	90	113	37	44	44	52	45	40	73	52	38	85	100	47	135	176	93	114	59	53	34	30	46	37	53	87	87	81	71	102	76	…	108	120	117	111	108	106	111	114	114	97	105	116	109	109	107	108	109	104	108	114	109	100	114	107	116	120	110	108	115	110	109	109	105	108	109	131	118
133	105	51	110	130	127	137	84	103	120	129	140	111	80	144	103	99	111	107	59	103	111	84	88	84	115	139	146	153	161	176	159	148	131	155	122	71	…	80	81	82	80	81	83	85	86	89	91	92	93	93	96	98	99	95	79	59	47	45	34	40	46	87	100	95	93	94	91	85	89	93	98	99	92	92
235	246	250	247	248	245	241	237	240	248	249	244	242	241	245	231	230	236	241	243	242	242	242	242	242	242	242	242	242	242	242	242	243	243	243	243	243	…	161	154	150	150	151	162	148	137	143	156	157	163	149	148	144	139	133	147	140	140	142	151	138	150	140	125	128	133	135	138	159	161	119	107	142	150	145
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
93	86	95	87	89	153	138	99	118	93	104	104	122	83	84	104	72	86	63	70	101	75	81	133	121	88	87	88	91	88	100	107	80	88	98	101	100	…	107	107	107	107	107	108	108	108	108	108	108	109	109	110	110	110	109	108	108	109	109	110	111	111	111	111	111	111	111	111	111	110	109	109	109	110	111
4	6	7	9	11	8	7	6	3	2	2	3	4	5	6	6	6	6	6	6	8	8	8	8	8	11	15	19	23	28	18	11	10	12	17	21	30	…	122	126	121	104	104	106	111	112	118	114	97	109	121	115	116	120	124	112	106	110	126	111	107	101	91	85	79	91	80	75	87	86	74	99	94	79	77
11	10	10	10	9	9	9	9	10	10	11	10	10	10	11	12	12	13	13	13	13	13	13	12	13	13	14	14	15	17	18	19	20	21	23	23	25	…	1	2	2	2	2	2	1	1	1	1	1	1	1	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	1	1	1	1	1	1	2	2
5	6	6	7	7	10	10	10	10	11	12	12	12	12	13	14	14	16	17	18	19	21	23	26	26	26	26	26	26	27	29	30	32	33	36	36	34	…	23	28	39	64	35	32	21	17	22	27	52	41	35	44	73	67	56	45	39	37	34	44	53	57	52	51	38	21	8	9	23	2	0	1	6	4	27
148	150	152	156	157	156	156	159	158	154	152	154	149	142	133	128	117	112	111	109	107	105	104	110	117	124	130	141	146	150	155	158	157	157	157	158	161	…	53	52	61	65	63	62	60	57	56	54	57	58	66	63	57	63	51	52	51	48	57	51	65	62	61	63	55	50	41	47	49	44	50	48	33	43	48

R_1_1	G_1_1	B_1_1	R_1_2	G_1_2	B_1_2	R_1_3	G_1_3	B_1_3	R_1_4	G_1_4	B_1_4	R_1_5	G_1_5	B_1_5	R_1_6	G_1_6	B_1_6	R_1_7	G_1_7	B_1_7	R_1_8	G_1_8	B_1_8	R_1_9	G_1_9	B_1_9	R_1_10	G_1_10	B_1_10	R_1_11	G_1_11	B_1_11	R_1_12	G_1_12	B_1_12	R_1_13	…	B_184_228	R_184_229	G_184_229	B_184_229	R_184_230	G_184_230	B_184_230	R_184_231	G_184_231	B_184_231	R_184_232	G_184_232	B_184_232	R_184_233	G_184_233	B_184_233	R_184_234	G_184_234	B_184_234	R_184_235	G_184_235	B_184_235	R_184_236	G_184_236	B_184_236	R_184_237	G_184_237	B_184_237	R_184_238	G_184_238	B_184_238	R_184_239	G_184_239	B_184_239	R_184_240	G_184_240	B_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
23	28	21	15	22	12	25	31	19	47	53	40	72	79	63	14	21	4	34	41	26	29	35	21	44	50	40	72	77	72	69	74	69	25	29	30	26	…	108	188	171	153	167	150	130	127	111	88	138	125	98	118	104	75	121	110	77	124	114	78	139	130	93	130	123	83	125	117	76	121	112	73	120	111	72
43	48	26	94	100	80	105	114	97	42	54	40	62	75	65	42	55	48	57	69	66	66	76	75	95	103	102	80	84	84	88	90	88	116	116	124	150	…	163	146	155	162	146	155	162	145	154	162	145	154	163	148	156	166	148	156	167	146	154	165	145	153	166	145	152	165	143	152	165	136	152	168	136	152	168
31	87	13	74	126	54	50	97	27	56	105	34	127	180	108	88	148	73	55	117	41	83	137	65	17	55	0	36	57	0	29	61	0	16	79	6	12	…	116	115	123	120	106	112	110	105	109	110	111	116	117	105	111	114	101	112	113	98	113	113	92	110	110	97	112	114	101	112	116	123	134	138	110	121	125
153	127	110	121	102	81	62	50	25	117	112	82	139	131	100	141	125	97	157	132	107	104	79	56	119	100	76	131	119	97	138	129	106	155	137	117	125	…	74	103	101	88	98	96	83	96	94	81	97	95	82	94	92	79	88	86	73	92	90	77	96	94	81	101	99	86	102	100	87	95	93	80	95	93	80
235	235	237	246	246	248	250	250	252	247	247	249	248	248	250	245	245	247	241	241	243	237	237	239	240	240	242	248	248	250	249	249	252	244	243	249	242	…	124	133	124	109	136	127	112	141	132	117	143	134	119	146	137	122	167	158	143	169	160	145	127	118	103	115	106	91	150	141	126	158	149	134	153	144	129
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
99	97	59	92	90	52	101	99	61	93	91	53	95	93	55	159	157	119	144	142	104	105	103	65	124	122	84	99	97	57	111	108	67	112	108	66	130	…	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	104	112	115	103	111	114	105	110	114	106	109	114	107	110	115	108	111	116
4	5	0	6	7	2	7	8	3	9	10	5	11	12	7	8	9	4	7	8	3	6	7	2	3	4	0	2	3	0	2	3	0	3	4	0	4	…	25	106	86	22	100	80	17	111	93	30	100	82	20	94	77	15	105	89	29	103	89	28	91	76	16	116	101	42	109	97	37	94	83	20	92	81	17
26	5	2	25	4	1	25	4	1	25	4	1	24	3	0	24	3	0	23	4	0	23	4	0	26	4	1	27	3	1	29	3	2	28	3	1	30	…	0	6	0	0	6	0	0	6	0	0	6	0	0	3	1	0	2	1	0	2	1	0	2	1	0	2	1	0	2	1	0	3	1	2	3	1	2
18	0	0	19	1	0	19	1	0	20	2	0	20	2	0	23	5	1	23	6	0	23	6	0	23	6	0	24	6	0	27	6	1	27	6	1	25	…	5	95	37	10	73	26	8	49	11	2	26	0	0	26	2	3	33	18	19	6	0	0	0	0	0	4	0	0	14	2	2	13	0	0	41	22	14
105	155	224	107	157	226	109	159	228	113	163	229	115	164	231	115	163	229	115	163	229	119	165	230	119	164	229	115	160	225	113	159	222	113	161	225	108	…	36	78	60	38	70	52	30	65	47	25	56	38	16	62	44	24	64	46	26	59	40	23	65	46	29	63	44	30	48	29	14	58	39	24	63	44	29

pixel_1_1	pixel_1_2	pixel_1_3	pixel_1_4	pixel_1_5	pixel_1_6	pixel_1_7	pixel_1_8	pixel_1_9	pixel_1_10	pixel_1_11	pixel_1_12	pixel_1_13	pixel_1_14	pixel_1_15	pixel_1_16	pixel_1_17	pixel_1_18	pixel_1_19	pixel_1_20	pixel_1_21	pixel_1_22	pixel_1_23	pixel_1_24	pixel_1_25	pixel_1_26	pixel_1_27	pixel_1_28	pixel_1_29	pixel_1_30	pixel_1_31	pixel_1_32	pixel_1_33	pixel_1_34	pixel_1_35	pixel_1_36	pixel_1_37	…	pixel_184_204	pixel_184_205	pixel_184_206	pixel_184_207	pixel_184_208	pixel_184_209	pixel_184_210	pixel_184_211	pixel_184_212	pixel_184_213	pixel_184_214	pixel_184_215	pixel_184_216	pixel_184_217	pixel_184_218	pixel_184_219	pixel_184_220	pixel_184_221	pixel_184_222	pixel_184_223	pixel_184_224	pixel_184_225	pixel_184_226	pixel_184_227	pixel_184_228	pixel_184_229	pixel_184_230	pixel_184_231	pixel_184_232	pixel_184_233	pixel_184_234	pixel_184_235	pixel_184_236	pixel_184_237	pixel_184_238	pixel_184_239	pixel_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
26	19	28	50	75	17	37	32	47	75	72	28	29	20	24	21	49	87	91	80	56	48	87	117	98	81	86	73	57	56	65	99	68	76	81	85	83	…	87	89	96	98	90	78	68	68	67	52	59	59	71	71	79	75	63	63	68	74	80	145	175	171	127	174	153	113	126	105	110	113	128	121	115	110	109
44	96	109	49	70	50	65	73	100	83	89	117	151	148	169	160	62	82	115	78	86	152	190	205	211	193	187	183	180	188	144	109	145	108	66	56	126	…	155	155	157	158	156	151	153	160	161	161	161	156	159	159	124	125	153	151	153	156	158	157	153	152	154	153	153	152	152	155	155	153	152	151	151	149	149
62	102	75	82	156	122	90	113	37	44	44	52	45	40	73	52	38	85	100	47	135	176	93	114	59	53	34	30	46	37	53	87	87	81	71	102	76	…	108	120	117	111	108	106	111	114	114	97	105	116	109	109	107	108	109	104	108	114	109	100	114	107	116	120	110	108	115	110	109	109	105	108	109	131	118
133	105	51	110	130	127	137	84	103	120	129	140	111	80	144	103	99	111	107	59	103	111	84	88	84	115	139	146	153	161	176	159	148	131	155	122	71	…	80	81	82	80	81	83	85	86	89	91	92	93	93	96	98	99	95	79	59	47	45	34	40	46	87	100	95	93	94	91	85	89	93	98	99	92	92
235	246	250	247	248	245	241	237	240	248	249	244	242	241	245	231	230	236	241	243	242	242	242	242	242	242	242	242	242	242	242	242	243	243	243	243	243	…	161	154	150	150	151	162	148	137	143	156	157	163	149	148	144	139	133	147	140	140	142	151	138	150	140	125	128	133	135	138	159	161	119	107	142	150	145
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
93	86	95	87	89	153	138	99	118	93	104	104	122	83	84	104	72	86	63	70	101	75	81	133	121	88	87	88	91	88	100	107	80	88	98	101	100	…	107	107	107	107	107	108	108	108	108	108	108	109	109	110	110	110	109	108	108	109	109	110	111	111	111	111	111	111	111	111	111	110	109	109	109	110	111
4	6	7	9	11	8	7	6	3	2	2	3	4	5	6	6	6	6	6	6	8	8	8	8	8	11	15	19	23	28	18	11	10	12	17	21	30	…	122	126	121	104	104	106	111	112	118	114	97	109	121	115	116	120	124	112	106	110	126	111	107	101	91	85	79	91	80	75	87	86	74	99	94	79	77
11	10	10	10	9	9	9	9	10	10	11	10	10	10	11	12	12	13	13	13	13	13	13	12	13	13	14	14	15	17	18	19	20	21	23	23	25	…	1	2	2	2	2	2	1	1	1	1	1	1	1	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	1	1	1	1	1	1	2	2
5	6	6	7	7	10	10	10	10	11	12	12	12	12	13	14	14	16	17	18	19	21	23	26	26	26	26	26	26	27	29	30	32	33	36	36	34	…	23	28	39	64	35	32	21	17	22	27	52	41	35	44	73	67	56	45	39	37	34	44	53	57	52	51	38	21	8	9	23	2	0	1	6	4	27
148	150	152	156	157	156	156	159	158	154	152	154	149	142	133	128	117	112	111	109	107	105	104	110	117	124	130	141	146	150	155	158	157	157	157	158	161	…	53	52	61	65	63	62	60	57	56	54	57	58	66	63	57	63	51	52	51	48	57	51	65	62	61	63	55	50	41	47	49	44	50	48	33	43	48

R_1_1	G_1_1	B_1_1	R_1_2	G_1_2	B_1_2	R_1_3	G_1_3	B_1_3	R_1_4	G_1_4	B_1_4	R_1_5	G_1_5	B_1_5	R_1_6	G_1_6	B_1_6	R_1_7	G_1_7	B_1_7	R_1_8	G_1_8	B_1_8	R_1_9	G_1_9	B_1_9	R_1_10	G_1_10	B_1_10	R_1_11	G_1_11	B_1_11	R_1_12	G_1_12	B_1_12	R_1_13	…	B_184_228	R_184_229	G_184_229	B_184_229	R_184_230	G_184_230	B_184_230	R_184_231	G_184_231	B_184_231	R_184_232	G_184_232	B_184_232	R_184_233	G_184_233	B_184_233	R_184_234	G_184_234	B_184_234	R_184_235	G_184_235	B_184_235	R_184_236	G_184_236	B_184_236	R_184_237	G_184_237	B_184_237	R_184_238	G_184_238	B_184_238	R_184_239	G_184_239	B_184_239	R_184_240	G_184_240	B_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
23	28	21	15	22	12	25	31	19	47	53	40	72	79	63	14	21	4	34	41	26	29	35	21	44	50	40	72	77	72	69	74	69	25	29	30	26	…	108	188	171	153	167	150	130	127	111	88	138	125	98	118	104	75	121	110	77	124	114	78	139	130	93	130	123	83	125	117	76	121	112	73	120	111	72
43	48	26	94	100	80	105	114	97	42	54	40	62	75	65	42	55	48	57	69	66	66	76	75	95	103	102	80	84	84	88	90	88	116	116	124	150	…	163	146	155	162	146	155	162	145	154	162	145	154	163	148	156	166	148	156	167	146	154	165	145	153	166	145	152	165	143	152	165	136	152	168	136	152	168
31	87	13	74	126	54	50	97	27	56	105	34	127	180	108	88	148	73	55	117	41	83	137	65	17	55	0	36	57	0	29	61	0	16	79	6	12	…	116	115	123	120	106	112	110	105	109	110	111	116	117	105	111	114	101	112	113	98	113	113	92	110	110	97	112	114	101	112	116	123	134	138	110	121	125
153	127	110	121	102	81	62	50	25	117	112	82	139	131	100	141	125	97	157	132	107	104	79	56	119	100	76	131	119	97	138	129	106	155	137	117	125	…	74	103	101	88	98	96	83	96	94	81	97	95	82	94	92	79	88	86	73	92	90	77	96	94	81	101	99	86	102	100	87	95	93	80	95	93	80
235	235	237	246	246	248	250	250	252	247	247	249	248	248	250	245	245	247	241	241	243	237	237	239	240	240	242	248	248	250	249	249	252	244	243	249	242	…	124	133	124	109	136	127	112	141	132	117	143	134	119	146	137	122	167	158	143	169	160	145	127	118	103	115	106	91	150	141	126	158	149	134	153	144	129
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
99	97	59	92	90	52	101	99	61	93	91	53	95	93	55	159	157	119	144	142	104	105	103	65	124	122	84	99	97	57	111	108	67	112	108	66	130	…	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	104	112	115	103	111	114	105	110	114	106	109	114	107	110	115	108	111	116
4	5	0	6	7	2	7	8	3	9	10	5	11	12	7	8	9	4	7	8	3	6	7	2	3	4	0	2	3	0	2	3	0	3	4	0	4	…	25	106	86	22	100	80	17	111	93	30	100	82	20	94	77	15	105	89	29	103	89	28	91	76	16	116	101	42	109	97	37	94	83	20	92	81	17
26	5	2	25	4	1	25	4	1	25	4	1	24	3	0	24	3	0	23	4	0	23	4	0	26	4	1	27	3	1	29	3	2	28	3	1	30	…	0	6	0	0	6	0	0	6	0	0	6	0	0	3	1	0	2	1	0	2	1	0	2	1	0	2	1	0	2	1	0	3	1	2	3	1	2
18	0	0	19	1	0	19	1	0	20	2	0	20	2	0	23	5	1	23	6	0	23	6	0	23	6	0	24	6	0	27	6	1	27	6	1	25	…	5	95	37	10	73	26	8	49	11	2	26	0	0	26	2	3	33	18	19	6	0	0	0	0	0	4	0	0	14	2	2	13	0	0	41	22	14
105	155	224	107	157	226	109	159	228	113	163	229	115	164	231	115	163	229	115	163	229	119	165	230	119	164	229	115	160	225	113	159	222	113	161	225	108	…	36	78	60	38	70	52	30	65	47	25	56	38	16	62	44	24	64	46	26	59	40	23	65	46	29	63	44	30	48	29	14	58	39	24	63	44	29

Feature Extraction Methods for Images

Contents

Feature Extraction Methods for Images#

Requirements#

Reading the data#

Defining Response and Predictors#

Pixel Method#

Extracting pixel features from gray-scale image#

Extracting pixel features from color image#

Pixel features array#

Sequential data: features tensor#

Tabular data: features matrix#

Vectorial representation of an image#

Getting pixels features matrix for gray-scale images#

Getting pixels features matrix for color images#

Getting pixels features tensor for gray-scale images#

Getting pixels features tensor for color images#

Histogram of Oriented Gradients Method#

Based on Statistics#

Based on Bags of Visual Words (BVW)#

HOG-stats features matrix#

Getting HGO-stats features matrix#

HOG-BVW features matrix#

Getting HGO-BVW features matrix#

HOG - Not Reshaped features matrix#

Getting HGO - Not Reshaped features matrix#

Convolutional Neural Networks Method#

CNN features matrix#

Getting CNN features matrix#

pixel_1_1	pixel_1_2	pixel_1_3	pixel_1_4	pixel_1_5	pixel_1_6	pixel_1_7	pixel_1_8	pixel_1_9	pixel_1_10	pixel_1_11	pixel_1_12	pixel_1_13	pixel_1_14	pixel_1_15	pixel_1_16	pixel_1_17	pixel_1_18	pixel_1_19	pixel_1_20	pixel_1_21	pixel_1_22	pixel_1_23	pixel_1_24	pixel_1_25	pixel_1_26	pixel_1_27	pixel_1_28	pixel_1_29	pixel_1_30	pixel_1_31	pixel_1_32	pixel_1_33	pixel_1_34	pixel_1_35	pixel_1_36	pixel_1_37	…	pixel_184_204	pixel_184_205	pixel_184_206	pixel_184_207	pixel_184_208	pixel_184_209	pixel_184_210	pixel_184_211	pixel_184_212	pixel_184_213	pixel_184_214	pixel_184_215	pixel_184_216	pixel_184_217	pixel_184_218	pixel_184_219	pixel_184_220	pixel_184_221	pixel_184_222	pixel_184_223	pixel_184_224	pixel_184_225	pixel_184_226	pixel_184_227	pixel_184_228	pixel_184_229	pixel_184_230	pixel_184_231	pixel_184_232	pixel_184_233	pixel_184_234	pixel_184_235	pixel_184_236	pixel_184_237	pixel_184_238	pixel_184_239	pixel_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
26	19	28	50	75	17	37	32	47	75	72	28	29	20	24	21	49	87	91	80	56	48	87	117	98	81	86	73	57	56	65	99	68	76	81	85	83	…	87	89	96	98	90	78	68	68	67	52	59	59	71	71	79	75	63	63	68	74	80	145	175	171	127	174	153	113	126	105	110	113	128	121	115	110	109
44	96	109	49	70	50	65	73	100	83	89	117	151	148	169	160	62	82	115	78	86	152	190	205	211	193	187	183	180	188	144	109	145	108	66	56	126	…	155	155	157	158	156	151	153	160	161	161	161	156	159	159	124	125	153	151	153	156	158	157	153	152	154	153	153	152	152	155	155	153	152	151	151	149	149
62	102	75	82	156	122	90	113	37	44	44	52	45	40	73	52	38	85	100	47	135	176	93	114	59	53	34	30	46	37	53	87	87	81	71	102	76	…	108	120	117	111	108	106	111	114	114	97	105	116	109	109	107	108	109	104	108	114	109	100	114	107	116	120	110	108	115	110	109	109	105	108	109	131	118
133	105	51	110	130	127	137	84	103	120	129	140	111	80	144	103	99	111	107	59	103	111	84	88	84	115	139	146	153	161	176	159	148	131	155	122	71	…	80	81	82	80	81	83	85	86	89	91	92	93	93	96	98	99	95	79	59	47	45	34	40	46	87	100	95	93	94	91	85	89	93	98	99	92	92
235	246	250	247	248	245	241	237	240	248	249	244	242	241	245	231	230	236	241	243	242	242	242	242	242	242	242	242	242	242	242	242	243	243	243	243	243	…	161	154	150	150	151	162	148	137	143	156	157	163	149	148	144	139	133	147	140	140	142	151	138	150	140	125	128	133	135	138	159	161	119	107	142	150	145
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
93	86	95	87	89	153	138	99	118	93	104	104	122	83	84	104	72	86	63	70	101	75	81	133	121	88	87	88	91	88	100	107	80	88	98	101	100	…	107	107	107	107	107	108	108	108	108	108	108	109	109	110	110	110	109	108	108	109	109	110	111	111	111	111	111	111	111	111	111	110	109	109	109	110	111
4	6	7	9	11	8	7	6	3	2	2	3	4	5	6	6	6	6	6	6	8	8	8	8	8	11	15	19	23	28	18	11	10	12	17	21	30	…	122	126	121	104	104	106	111	112	118	114	97	109	121	115	116	120	124	112	106	110	126	111	107	101	91	85	79	91	80	75	87	86	74	99	94	79	77
11	10	10	10	9	9	9	9	10	10	11	10	10	10	11	12	12	13	13	13	13	13	13	12	13	13	14	14	15	17	18	19	20	21	23	23	25	…	1	2	2	2	2	2	1	1	1	1	1	1	1	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	2	1	1	1	1	1	1	2	2
5	6	6	7	7	10	10	10	10	11	12	12	12	12	13	14	14	16	17	18	19	21	23	26	26	26	26	26	26	27	29	30	32	33	36	36	34	…	23	28	39	64	35	32	21	17	22	27	52	41	35	44	73	67	56	45	39	37	34	44	53	57	52	51	38	21	8	9	23	2	0	1	6	4	27
148	150	152	156	157	156	156	159	158	154	152	154	149	142	133	128	117	112	111	109	107	105	104	110	117	124	130	141	146	150	155	158	157	157	157	158	161	…	53	52	61	65	63	62	60	57	56	54	57	58	66	63	57	63	51	52	51	48	57	51	65	62	61	63	55	50	41	47	49	44	50	48	33	43	48

R_1_1	G_1_1	B_1_1	R_1_2	G_1_2	B_1_2	R_1_3	G_1_3	B_1_3	R_1_4	G_1_4	B_1_4	R_1_5	G_1_5	B_1_5	R_1_6	G_1_6	B_1_6	R_1_7	G_1_7	B_1_7	R_1_8	G_1_8	B_1_8	R_1_9	G_1_9	B_1_9	R_1_10	G_1_10	B_1_10	R_1_11	G_1_11	B_1_11	R_1_12	G_1_12	B_1_12	R_1_13	…	B_184_228	R_184_229	G_184_229	B_184_229	R_184_230	G_184_230	B_184_230	R_184_231	G_184_231	B_184_231	R_184_232	G_184_232	B_184_232	R_184_233	G_184_233	B_184_233	R_184_234	G_184_234	B_184_234	R_184_235	G_184_235	B_184_235	R_184_236	G_184_236	B_184_236	R_184_237	G_184_237	B_184_237	R_184_238	G_184_238	B_184_238	R_184_239	G_184_239	B_184_239	R_184_240	G_184_240	B_184_240
u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	…	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8	u8
23	28	21	15	22	12	25	31	19	47	53	40	72	79	63	14	21	4	34	41	26	29	35	21	44	50	40	72	77	72	69	74	69	25	29	30	26	…	108	188	171	153	167	150	130	127	111	88	138	125	98	118	104	75	121	110	77	124	114	78	139	130	93	130	123	83	125	117	76	121	112	73	120	111	72
43	48	26	94	100	80	105	114	97	42	54	40	62	75	65	42	55	48	57	69	66	66	76	75	95	103	102	80	84	84	88	90	88	116	116	124	150	…	163	146	155	162	146	155	162	145	154	162	145	154	163	148	156	166	148	156	167	146	154	165	145	153	166	145	152	165	143	152	165	136	152	168	136	152	168
31	87	13	74	126	54	50	97	27	56	105	34	127	180	108	88	148	73	55	117	41	83	137	65	17	55	0	36	57	0	29	61	0	16	79	6	12	…	116	115	123	120	106	112	110	105	109	110	111	116	117	105	111	114	101	112	113	98	113	113	92	110	110	97	112	114	101	112	116	123	134	138	110	121	125
153	127	110	121	102	81	62	50	25	117	112	82	139	131	100	141	125	97	157	132	107	104	79	56	119	100	76	131	119	97	138	129	106	155	137	117	125	…	74	103	101	88	98	96	83	96	94	81	97	95	82	94	92	79	88	86	73	92	90	77	96	94	81	101	99	86	102	100	87	95	93	80	95	93	80
235	235	237	246	246	248	250	250	252	247	247	249	248	248	250	245	245	247	241	241	243	237	237	239	240	240	242	248	248	250	249	249	252	244	243	249	242	…	124	133	124	109	136	127	112	141	132	117	143	134	119	146	137	122	167	158	143	169	160	145	127	118	103	115	106	91	150	141	126	158	149	134	153	144	129
…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…	…
99	97	59	92	90	52	101	99	61	93	91	53	95	93	55	159	157	119	144	142	104	105	103	65	124	122	84	99	97	57	111	108	67	112	108	66	130	…	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	105	113	116	104	112	115	103	111	114	105	110	114	106	109	114	107	110	115	108	111	116
4	5	0	6	7	2	7	8	3	9	10	5	11	12	7	8	9	4	7	8	3	6	7	2	3	4	0	2	3	0	2	3	0	3	4	0	4	…	25	106	86	22	100	80	17	111	93	30	100	82	20	94	77	15	105	89	29	103	89	28	91	76	16	116	101	42	109	97	37	94	83	20	92	81	17
26	5	2	25	4	1	25	4	1	25	4	1	24	3	0	24	3	0	23	4	0	23	4	0	26	4	1	27	3	1	29	3	2	28	3	1	30	…	0	6	0	0	6	0	0	6	0	0	6	0	0	3	1	0	2	1	0	2	1	0	2	1	0	2	1	0	2	1	0	3	1	2	3	1	2
18	0	0	19	1	0	19	1	0	20	2	0	20	2	0	23	5	1	23	6	0	23	6	0	23	6	0	24	6	0	27	6	1	27	6	1	25	…	5	95	37	10	73	26	8	49	11	2	26	0	0	26	2	3	33	18	19	6	0	0	0	0	0	4	0	0	14	2	2	13	0	0	41	22	14
105	155	224	107	157	226	109	159	228	113	163	229	115	164	231	115	163	229	115	163	229	119	165	230	119	164	229	115	160	225	113	159	222	113	161	225	108	…	36	78	60	38	70	52	30	65	47	25	56	38	16	62	44	24	64	46	26	59	40	23	65	46	29	63	44	30	48	29	14	58	39	24	63	44	29