Data Science using Python
“From scientific Python to deployed, responsible machine learning — the complete data scientist's journey.”
A complete, hands-on, international-standard data science programme. Master the full lifecycle — scientific Python and mathematics, data wrangling, exploratory analysis, statistics, the entire machine-learning workflow with scikit-learn, deep learning, natural-language processing, time-series forecasting, and production MLOps — then ship a responsible, portfolio-ready capstone. Aligned with the CRISP-DM lifecycle, the ACM Data Science curricula and the NIST AI Risk Management Framework. Built for students who know a little Python and want to become job-ready data scientists.
📜 Aligned with: CRISP-DM lifecycle · ACM/IEEE Data Science Body of Knowledge · NIST AI Risk Management Framework · scikit-learn & PyTorch ecosystems
What you'll be able to do
Course syllabus
Eleven modules take you from scientific Python to deployed, responsible machine-learning systems and a portfolio-ready capstone. Work at your own pace.
Set up a reproducible data-science environment and master NumPy vectorisation, the linear algebra, calculus and probability intuition that every model depends on.
Pull data from files, databases, APIs and the web, then clean, merge, reshape and engineer it into tidy, analysis-ready datasets.
Profile data, expose patterns and outliers, and tell a clear visual story with Matplotlib, Seaborn and Plotly.
Reason under uncertainty: distributions, sampling, estimation, hypothesis testing, A/B testing and Bayesian thinking — with SciPy and statsmodels.
The complete supervised-learning workflow in scikit-learn: regression, classification, pipelines, cross-validation and honest model evaluation.
Ensembles and gradient boosting, clustering, dimensionality reduction, feature engineering and hyper-parameter tuning that win competitions.
Build neural networks from neuron to CNN with PyTorch and Keras — backpropagation, training tricks, transfer learning and computer vision.
Turn text into insight: preprocessing, embeddings, classification, and modern transformer models with spaCy and Hugging Face.
Decompose, test and forecast temporal data with ARIMA, exponential smoothing and machine-learning forecasters — and evaluate them properly.
Take a model to production: experiment tracking with MLflow, packaging with Docker, serving with FastAPI, and monitoring for drift.
Fairness, explainability, privacy and governance with the NIST AI RMF — plus a GitHub portfolio, case-study interviews and your data-scientist roadmap.
Take a real dataset through the full CRISP-DM lifecycle — frame the problem, wrangle and explore, model and evaluate, then deploy and document — and submit a portfolio-ready project with a written report and a recorded walkthrough.
Who this course is for
- Aspiring data scientists and ML engineers building a job-ready portfolio
- Analysts and developers moving from reporting into predictive modelling
- Engineering, science and finance graduates who want applied machine learning
- Anyone comfortable with basic Python ready to master statistics, ML and deep learning