MC-DEPI Analysis Notebooks

This repository contains analysis notebooks for the paper:

Monte-Carlo Diffusion-Enhanced Photon Inference: Distance Distributions and Conformational Dynamics in single-molecule FRET

Antonino Ingargiola, Shimon Weiss, Eitan Lerner (2018) https://doi.org/10.1101/385252

These notebooks also serve as an example on using the depi package.

To run these notebooks online click here

Data

Raw data files for the 7d and 17 dsDNA samples are available on Figshare:

Ingargiola, A; Weiss, S; Lerner, E (2018): Photon-HDF5 files. figshare. Fileset.

Workflow Overview

Preprocessing: burst search and grouping

Notebooks:

PIE Analysis and save bursts.ipynb

Batch run notebook.ipynb

Group Results by Name.ipynb

Export TCSPC decays from ns-ALEX Photon-HDF5.ipynb

A notebook[1] for burst-search and population selection (D-only, FRET) is executed in batch (using [2]) on all the Photon-HDF5 data files for a given sample. Next, burst photon data (timestamps, D/A labels, TCSPC nanotimes) for each (sample, population) pair are grouped in a single data file[4] for further analysis.

Notebook 4 exports fluorescence decays histograms from a smFRET-PIE data file in Photon-HDF5 format. This notebook is used to export IRF decays used to simulate realistic TCSPC nanotimes in the following steps.

Experimental data analysis and simulation fitting

Notebooks:

Burst Analysis-DEPI-exp.ipynb

Burst Analysis-DEPI-sim2-E_nanotime.ipynb (7d sample)

Burst Analysis-DEPI-sim2-E_nanotime-17d-bg-irf.ipynb (17d sample)

We perform the experimental and simulated data analysis for each sample in similar fashion. For the experimental analysis[5] we use bursts aggregated from from multiple measurements in the previous step. The simulated data is generated by an MC-DEPI simulation with user-defined distance distribution and self-diffusion parameters[6, 7]. Simulations have the same number of photons as the experimental burst photon-data, with "recoloring" (i.e. reassigned D and A labels) and with with simulated TCSPC nanotimes. The simulation takes into account multiple conformational states with arbitrary transition matrix, a distance distribution model for each state, a D-A diffusion relaxation time for each state, acceptor photo-blinking, correction factors (gamma, donor leakage, acceptor direct excitation from donor laser), background counts. In the paper we compare FRET histograms and D and A fluorescence decays between experiments and simulations. Other analysis carried out in the notebooks are FCS, BVA.

The notebooks [8] and [9] repeat the simulations 100 times using the same input parameters, but different seeds for the random number generation. In this way, we assess the dispersion of the simulation due to the sole Monte Carlo noise. The dispersion is assessed both graphically and computing the standard deviation of the loss function.

The notebooks, [6] and [7], first perform a single user-define simulation, then they compute the loss function and finally the run an optimization procedure to find the best parameters fitting the experimental data. The standard deviation of the loss function is important for the convergence of the optimization algorithm.

Additional Notebooks

Continous-Time Markov Chain.ipynb

This notebooks demonstrates an implementation of Continuos-Time Markov Chain matrix formalism using numpy.

Dependencies

Python >= 3.6
FRETBursts >= 0.7
depi >= 0.1+14.g413c350
scikit-optimze >= 0.5.2+39.g000b9d8
randomgen ==1.14.4 (next-generation RNG, soon to be included in numpy)

Installation

Follow the instructions below to create a reproducible conda environment to run the notebooks in this repository on a local computer (Windows, macOS or Linux).

Note: you can also run the notebooks on the cloud (no installation) by clicking here

Install the free Anaconda 3 python distribution and follow these steps.

In a terminal, type each single line (assuring that there is not error after each line):

conda activate depi_env
conda install fretbursts ipython
pip install randomgen
pip install git+https://github.com/scikit-optimize/scikit-optimize/ --upgrade
pip install pycorrelate
conda install depi

The previous command installs the depi python package and all dependencies in an conda environment called depi_env.

Type ipython and try runnning:

import depi

You should get no error. Type quit to exit ipython.

We need to create a "kernel" which allows using that environment from the notebook. In the same terminal as before (after exiting ipython) type:

python -m ipykernel install --name depi_env --display-name "MC-DEPI (Python 3.6)"

Download the notebooks used for the 2018 MC-DEPI paper from github:

https://github.com/tritemio/mcdepi2018-paper-analysis/archive/master.zip

In the folder where you put these notebooks, make a subfolder data/results (one inside the other) and extract there the archive you download from:

https://ndownloader.figshare.com/files/12753497?private_link=4080f1df435c07e7bd21

(this is the burst data for the dsDNA smFRET-PIE measurements used in the paper).

Finally, in a new terminal, launch the notebook with jupyter notebook. Create a notebook. Make sure you choose the MC-DEPI kernel.
From the notebook tab, navigate to the depi notebooks and open:

Burst Analysis-DEPI-sim2-E_nanotime.ipynb

Select the MC-DEPI kernel and run (at least) the first part until the single-condition DEPI simulation. No error should occur. Running the full fit may require several hours of computations depending on initial conditions, model, number of iterations and processing power.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MC-DEPI Analysis Notebooks

Data

Workflow Overview

Preprocessing: burst search and grouping

Experimental data analysis and simulation fitting

Additional Notebooks

Dependencies

Installation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
binder		binder
results		results
Batch run notebook.ipynb		Batch run notebook.ipynb
Burst Analysis-DEPI-exp.ipynb		Burst Analysis-DEPI-exp.ipynb
Burst Analysis-DEPI-sim2-E_nanotime-17d-bg-irf.ipynb		Burst Analysis-DEPI-sim2-E_nanotime-17d-bg-irf.ipynb
Burst Analysis-DEPI-sim2-E_nanotime.ipynb		Burst Analysis-DEPI-sim2-E_nanotime.ipynb
Burst Analysis-DEPI-sim2-same-condition-loss-std-d17.ipynb		Burst Analysis-DEPI-sim2-same-condition-loss-std-d17.ipynb
Burst Analysis-DEPI-sim2-same-condition-loss-std.ipynb		Burst Analysis-DEPI-sim2-same-condition-loss-std.ipynb
Continous-Time Markov Chain.ipynb		Continous-Time Markov Chain.ipynb
Export TCSPC decays from ns-ALEX Photon-HDF5.ipynb		Export TCSPC decays from ns-ALEX Photon-HDF5.ipynb
Group Results by Name.ipynb		Group Results by Name.ipynb
LICENSE		LICENSE
PIE Analysis and save bursts.ipynb		PIE Analysis and save bursts.ipynb
README.md		README.md
exptools.py		exptools.py
nbrun.py		nbrun.py

License

tritemio/mcdepi2018-paper-analysis

Folders and files

Latest commit

History

Repository files navigation

MC-DEPI Analysis Notebooks

Data

Workflow Overview

Preprocessing: burst search and grouping

Experimental data analysis and simulation fitting

Additional Notebooks

Dependencies

Installation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages