Overview of learning objectives

Note

The following extracts the primary learning objectives from the lecture slides. Exercises typically mirror these objectives (covering practical implementation, illustrative cases, or specific exam prep/questions). Objectives that are covered in the exercises exclusively are highlighted.

The rise of analytics

Illustrate how the rise of data analytics capabilities is enabled by data availability, advances in computing power, new algorithms, and maturing analytics processes.
Distinguish between descriptive, predictive, and prescriptive analytics.
Explore the Python and Jupyter analytics ecosystem (exercise).

Data preparation and exploration

Describe variables, relationships between variables, and underlying structure in the data (e.g., clusters).
Prepare and explore data sets using appropriate descriptive and visual techniques.
Explain the role of exploratory data analysis in business decision-making.

Analytical data architecture

Outline the architecture of data warehouses.
Explain dimensional modeling concepts and common schemata.
Design fact and dimension schemata from business questions and operational data.

Regression 1

Explain the stages of a model-based analytics workflow using linear regression as an example.
Interpret linear regression models, including coefficients, OLS estimation, and model evaluation.
Describe how regression models are implemented in Python.

Regression 2

Explain how business problems can be formulated as binary classification tasks and modeled using logistic regression.
Understand how logistic regression uses a linear predictor and sigmoid function to produce probabilities, apply thresholds, and interpret log-odds and coefficients.
Evaluate classification models using confusion matrices and metrics such as accuracy, precision, recall, and F1 score.
Apply model predictions to decision-making by selecting appropriate thresholds based on business costs and expected value.

Machine learning 1

Distinguish between supervised and unsupervised machine learning approaches and explain the generalization problem in supervised machine learning.
Describe the workflow of supervised machine learning, including feature engineering, train–test splitting, model training, cross-validation, and evaluation.
Connect conceptual machine learning procedures to Python implementations, including preprocessing, model training, and evaluation using scikit-learn. (see exercise)

Machine learning 2

Explain the core mechanics of selected algorithms, including regularization in linear models, distance-based prediction (k-NN), recursive partitioning (decision trees), ensemble learning (random forests), and margin maximization (SVMs).
Select and justify an appropriate modeling approach for a given predictive task, considering data characteristics, performance metrics, and the trade-off between interpretability and predictive power.

Big data 1

Explain the characteristics of big data and their implications for analytics workflows.
Compare data warehouse, data lake, and logical data warehouse architectures.
Describe the text analytics pipeline from preprocessing to representation and modeling.

Analytics in organizations

Explain the strategic role of data analytics in data-driven organizations.
Discuss the ethical and legal boundaries of data analytics in organizations.