The course is part of the NORA Summer Research School 2026.
Causality and Machine Learning introduces PhD candidates and researchers to foundational concepts and modern methods at the intersection of causality and machine learning. It is designed to bridge traditional causal inference, including potential outcomes, structural causal models, graphical reasoning, identification, and semiparametric estimation, with modern machine learning methodologies for flexible nuisance estimation, policy learning, and data-driven causal analysis.
A central goal of the course is to help students formulate causal questions precisely, understand the assumptions required for identification, and implement estimation methods using modern computational tools. The course emphasizes both mathematical clarity and practical implementation.
Format: Lectures, guided examples, hands-on coding sessions, group discussion, and final project.
Credits: 5
Instructors
To complete the summer school, students will be asked to develop a project in the field of causality and machine learning. Students may work individually or in pairs. The final submission should be a PDF report including the project title and the name(s) of the student(s). The main text should describe the problem, the methods used, and the results obtained.
Reports should be in PDF file of at most 8 pages, excluding the bibliography, and must be submitted by August 16, 2026 to fabio.zennaro@uib.no. Templates can be found here:
We envision three main types of projects, which may focus on one or several of the topics covered across the different days of the course.
Students may work with a publicly available benchmark dataset. The goal is to compare different causal learning or estimation methods discussed during the course. Students are encouraged to discuss practical issues such as model misspecification, nuisance estimation, overlap, finite-sample behavior, and sensitivity to tuning choices.
Benchmark data include for instance:
Students may bring their own dataset, or use a dataset from their field of interest, and formulate a causal question that can be studied using the tools from the course. The project should clearly define the learning goal. For instance, for estimation this includes specifying treatment, outcome, covariates, target population, and causal estimand. The main emphasis should be on translating a substantive question into a precise causal problem, rather than only on obtaining numerical results.
You are also welcome to define your own causal question and choose an appropriate method. Some possible directions include:
Students may choose a paper on causal graphical models, causal inference, or causal machine learning, and work on reproducing, implementing, or extending some of its main ideas. This may involve coding the proposed method, reproducing a simulation study, applying the method to a new dataset, or critically analyzing the assumptions and theoretical results. Students may also focus on a theoretical or mathematical problem inspired by the paper.
For instance, you can read and re-implement a method from the causal machine learning literature, and apply it either to toy/simulated data or to your own data. Some suggested directions are:
Each project should include a final report containing:
By the end of the course, students should be able to:
| Day | Topic | Lecturer |
|---|---|---|
| 1 | Probability theory and random variables | Prof. Johan Pensar |
| 1 | Graphs and conditional independence | Prof. Johan Pensar |
| 1 | Bayesian networks | Prof. Johan Pensar |
| 1 | d-separation and I-equivalences | Prof. Johan Pensar |
| 2 | Pearl’s causal hierarchy | Prof. Fabio Massimo Zennaro |
| 2 | Structural causal models | Prof. Fabio Massimo Zennaro |
| 2 | Identifiability of causal queries | Prof. Fabio Massimo Zennaro |
| 2 | Backdoor adjustment, confounders and selection bias | Prof. Fabio Massimo Zennaro |
| 3 | Potential outcomes and assumptions for identification | Johan de Aguas |
| 3 | Causal estimands: ATE, ATT, CATE and policy effects | Johan de Aguas |
| 3 | Classical estimation: outcome regression and IPW | Johan de Aguas |
| 3 | Semiparametric estimation: AIPW, DML and TMLE | Johan de Aguas |
| 4 | Where do causal models come from? Expert knowledge, RCTs, observational data | Prof. Pekka Parviainen |
| 4 | Constraint-based structure learning | Prof. Pekka Parviainen |
| 4 | Score-based structure learning | Prof. Pekka Parviainen |
| 4 | Learning beyond equivalence classes | Prof. Pekka Parviainen |
Here is a list of references, including papers and textbooks, covering the topics presented at the summer school. You are invited to have a look at these sources to get a foundation of the ideas we will work on during the week.
During the summer school, attendance will be registered. After the successful submission of the final project, participants in the course will be registered in the UiB system so that their credits will be available nationally through FSWeb.
For international students, a formal certificate granting the credits can be requested by contacting the organizers or UiB administration.
Questions about the course, readings, computational setup, or final project can be directed to: fabio.zennaro@uib.no