|
1 | 1 | # Chemometric data analysis tutorials
|
2 | 2 |
|
3 |
| -[](https://mybinder.org/v2/gh/IPTC-DataAnalysisCourse/chemometrics-tutorials/HEAD) |
4 |
| - |
5 | 3 | This repository contains a series of tutorials on multivariate analysis of metabolic profiling datasets:
|
6 | 4 | - Import, scaling & normalisation.ipynb: Introduction to normalisation, scaling and data transformations.
|
7 | 5 | - Multivariate Analysis - PCA.ipynb: Multivariate chemometric analysis using Principal Component Analysis.
|
8 | 6 | - Multivariate Analysis - Supervised Analysis with PLS-DA.ipynb: Discrimnination of 2 classes with PLS-DA.
|
9 |
| - - Univariate Analysis.ipynb: Univariate analysis with linear models |
10 | 7 |
|
11 |
| -To run these tutorials download the contents of this repository and run the Jupyter Notebooks. Alternatively, these can be run on the browser via Binder, by clicking on the sticker above (subject to load on the Binder servers). |
| 8 | +To run these tutorials download the contents of this repository and run the Jupyter Notebooks. Alternatively, these can be run on the browser via Google Colaboratory by clickinking on the links at the IPTC - Data Analysis Course [repository](https://github.com/IPTC-DataAnalysisCourse). |
12 | 9 |
|
13 | 10 | All the data required to run these notebooks is provided on the 'data' folder. The dataset used in this tutorial comes from the following publication:
|
14 |
| -- Blaise, Benjamin J. et al. “Metabotyping of Caenorhabditis elegans reveals latent phenotypes.” Proceedings of the National Academy of Sciences of the United States of America 104 50 (2007): 19808-12 . |
| 11 | +- Lovestone, S., Francis, P., Kloszewska, I., Mecocci, P., Simmons, A., Soininen, H., Spenger, C., Tsolaki, M., Vellas, B., Wahlund, L.-O., Ward, M. and (2009), AddNeuroMed—The European Collaboration for the Discovery of Novel Biomarkers for Alzheimer's Disease. Annals of the New York Academy of Sciences, 1180: 36-46. [https://doi.org/10.1111/j.1749-6632.2009.05064.x](https://doi.org/10.1111/j.1749-6632.2009.05064.x). |
15 | 12 |
|
16 |
| -It is a set of 139 proton high-resolution magic angle spinning NMR spectroscopy (1H HR-MAS NMR) spectra from |
17 |
| -C. elegans nematodes. There are two main biological sources of variation in the dataset: |
18 |
| -- Genotype (1: wild-type, 2: *sod-2* mutants) |
19 |
| -- Age/life stage (1: younger L2 worms, 2: L4 worms) |
| 13 | +It is a set of urine samples 577 individuals processes by using Liquid Chromatography - Mass Spectrometery instrument. |
| 14 | +There are two main biological sources of variation in the dataset: |
| 15 | +- Gender/Sex (0: Biological female (n=294)) , 1: Biological male (n=283)) |
| 16 | +- Age |
20 | 17 |
|
21 | 18 |
|
22 | 19 | ## Installation instructions:
|
23 |
| -To run these tutorials, first download and install the latest Anaconda Python 3.x distribution available. |
24 |
| -The following packages also need to be installed: |
25 |
| - - plotly: Available via conda (open an Anaconda prompt and type "conda install plotly") |
26 |
| -After installing plotly, either clone this repository or download as .zip file. Access the notebooks contents via the Jupyter notebook environment. |
| 20 | +To run these tutorials localy, first download and install the latest Anaconda Python 3.10.x distribution available. |
| 21 | +The following packages also need to be installed (installed specific version if stated for compatibility): |
| 22 | +- plotly: Available via conda (open an Anaconda prompt and type "conda install plotly") |
| 23 | +- SciPy: 1.11.3 |
| 24 | +- NumPy: 1.26.0 |
| 25 | +- scikit-learn: 1.2.1 |
| 26 | +- Pandas: 2.1.1 |
| 27 | +- Matplotlib: 3.6.2 |
| 28 | +- seaborn |
| 29 | +- kneed: 0.8.5 |
27 | 30 |
|
28 | 31 | All other dependencies required to run the tutorials (pyChemometrics toolbox) are bundled with the repository, and no specific installation is required.
|
29 | 32 |
|
| 33 | +After installing plotly, either clone this repository or download as .zip file. Access the notebooks contents via the Jupyter notebook environment. |
| 34 | + |
30 | 35 | For more information on how to use Jupyter notebooks see [Using the Jupyter notebook](https://docs.anaconda.com/ae-notebooks/user-guide/basic-tasks/apps/jupyter/)
|
31 | 36 |
|
0 commit comments