This repository contains the materials shown during the practical sessions of the course Data Science in Chemical Engineering taught at Politecnico di Milano by Professor Alessandro Stagni.
The repository is organized into 12 main modules:
- Hands On Python - Introduction to Python programming
- SVD - Singular Value Decomposition
- PCA - Principal Component Analysis
- Unsupervised Machine Learning - Clustering and dimensionality reduction
- Supervised Machine Learning - Classification and prediction
- Regression - Linear and non-linear regression techniques
- Model Selection - Methods for choosing and validating models
- Neural Networks - Fundamentals of neural networks
- Deep Learning - Advanced neural network architectures
- Reduced Order Models - Model order reduction techniques
- Physics Informed Models - Integration of physical principles with data science
- Pandas and Imbalanced Datasets - Data manipulation and handling imbalanced data
Consult also the detaield Installation guide
- Python 3.10 or higher (recommended: Python 3.12)
- conda (Anaconda or Miniconda)
Clone the repository:
git clone https://github.com/Riccaraccio/Data-Science.git
cd Data-ScienceCreate and activate the conda environment:
conda create -n data-science python=3.12
conda activate data-scienceInstalls all required packages including PyTorch, scikit-learn, Jupyter, and visualization tools:
pip install -e .After installation, you can start JupyterNotebook for interactive work:
jupyter notebook- This page offers a concise Python programming tutorial that teaches the language basics through practical examples: https://learnxinyminutes.com/python/
- W3Schools is a popular educational website that provides free tutorials, references, and examples: https://www.w3schools.com/python/
- Github repository containing free programming books: https://github.com/EbookFoundation/free-programming-books/tree/main
- Github repository containing ideas for programming exercises: https://github.com/practical-tutorials/project-based-learning
