Simple Introduction to Compressive Sensing and Dictionary Learning

An interesting Theory that recently I got to study is Compressive Sensing and Dictionary Learning, and here I will share with you some intuition that I had during my studies trying to keep thighs easy and practical.
I made also a Presentation to Compressive Sensing and Dictionary Learning and here you can also find my Jupyter Notebook repository on this topic (I suggest you run it on colaboratory).

What is Compressive Sensing?

Compressive Sensing is a new Theory that after the First paper Robust Uncertainty Principles: Exact Signal Reconstruction from Highly Incomplete Frequency Information (June 2004) written by the noted Emmanuel Candes, Justin Romberg and Terence Tao, where they gave the proof that is possible to exactly recover an object from an incomplete frequency samples, by a convex optimization problem.
While the Second paper of Compressive Sensing history is For Most Large Underdetermined Systems of Equations, the Minimal ` 1 -norm Near-Solution Approximates the Sparsest Near-Solution (August 2004) written by David L. Donoho; here we have the most famous approach to the problem, that is the one that use the inexact linear equation $y \approx \Phi \cdot \alpha$ , where $\Phi$ is a given n by m matrix and $\alpha$ is a sparse vector. In general this problem requires combinatorial optimization and so is considered intractable. But his convex-relaxation using -minimization is convex, and is totally tractable.

If you want more go here

Let’s have some intuition of what is going on…

Let’s start to think what it means that $y = \Phi \cdot \alpha$ , if $\Phi$ is n by n matrix is means that we have n hyper-planes of dimension n each one that, if they are linearly indipendent, they will end to identify a single point. Then if we have that $\Phi$ is (n-1) by n this system of equation will end on a line, and so on…
More dimensions we remove from the row then more dimensions we give to the hiper-plane that is the solution of the equations!!!
So we could think that it seems that there is no single solution… but we should not forget that we want the solution to be the most sparse possible, and for this reason, the solution exists and is only one.

At least since 0-norm is an NP-problem we transform the problem to convex using 1-norm

But under which conditions is it possible the unique and complete recovery?

Theorem:
Given $A \in \mathbb{C}^{m \times N}$ , the following properties are equivalent:

a) If $A \cdot x = A \cdot z$ and both are s-sparse, then .

b) The null-space of does not contain any 2s-sparse vector other than the 0 vector

c) Every set of 2s columns of is linearly independent

Recovery algorithms

So up to now, we talk about some theory, let’s start talk coding…

Basis Pursuit Consist in solve this simple convex problem: $\underset{x}{\text{minimize}} \quad ||x||_1 \\ \text{subject to} \quad y = A \cdot x$ Lasso Is the famous convex problem: $\underset{x}{\text{minimize}} \quad ||y - A \cdot x||_2 \\ \text{subject to} \quad ||x||_1 \leq \tau$ Orthogonal Matching Pursuit It's a greedy method that do the follow steps:

Initialization: $S^0 = \emptyset, \quad x^0 = 0$
Repeat until a stopping criterion is met:
$S^{n+1} = S^n \cup \{ j_{n+1}\}, \quad j_{n+1}= argmax\{|A^H(y- A\cdot x^n)|\}$ $x^{n+1} = argmin_z\{|y - A \cdot z|_2 , supp(z) \subset S^{n+1}\}$
Output $x^* = x^{n+1}$

Dictionary Learning

It’s a collection of algorithms focused on building, from some training data, a set of atoms/vectors that span a new space with the characteristic that the representation of the signal in such space is given by a sparse vector and each atom don’t need to be orthogonal to the other ones like for example in PCA.

So it’s also a method useful to build the matrix $\Phi$ that we used previously in the Compressive Sensing Theory .
And to solve this task we need to solve the following optimization problem:

$\underset{x, D}{\text{minimize}} \quad \sum_i || y(i) -D \cdot x(i) ||_2 + \lambda ||x(i)||_1 \\ \text{subject to} \quad \quad \quad ||D(j)||_2 = 1, \quad \forall j$

This is, of course, a non-convex optimization problem since the objective is non-convex.

How to solve this type of non-convex problem?

In literature there are many different approaches to this problem and we will see just two of them:

Method of Optimal Directions (MOD)

It consist simply in solve the sparse coding problem with one of the methods previously seen like Lasso and after updating the dictionary by computing the analytical solution of the problem given by where the + is for the Moore-Penrose pseudoinverse.
After this update D need to be renormalized to fit the constraints and this algorithm is recursive up to convergence.

Online Dictionary Learning

When the size of the training size is too big or when the input data come in form of a stream, in such cases we are in the field of study of online learning which essentially suggests iteratively updating the model upon the new data points x becoming available.

This method allows us to gradually update the dictionary as new data becomes available for sparse representation learning and helps drastically reduce the amount of memory needed to store the dataset (which often has a huge size).
The implementation of this algorithm is taken by the paper Online Dictionary Learning for Sparse Coding by Julien Mairal, Francis Bach, Jean Ponce, Guillermo Sapiro.

Thank you for reading and I remember you to star my repo and see the jupyter notebook on colab

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
image		image
CompressiveSensing_and_DictionaryLearning.ipynb		CompressiveSensing_and_DictionaryLearning.ipynb
README.md		README.md
my_data.zip		my_data.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple Introduction to Compressive Sensing and Dictionary Learning

What is Compressive Sensing?

Let’s have some intuition of what is going on…

But under which conditions is it possible the unique and complete recovery?

Recovery algorithms

Dictionary Learning

How to solve this type of non-convex problem?

About

Releases

Packages

Languages

UmbertoJr/Compressive-Sensing-and-Dictionary-Learning

Folders and files

Latest commit

History

Repository files navigation

Simple Introduction to Compressive Sensing and Dictionary Learning

What is Compressive Sensing?

Let’s have some intuition of what is going on…

But under which conditions is it possible the unique and complete recovery?

Recovery algorithms

Dictionary Learning

How to solve this type of non-convex problem?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages