Data Science

Stochastic Processes for Data Science

Course objectives

GENERAL DESCRIPTION The goal of this course is to provide an overview of stochastic processes, with applications to Data Science in mind. Stochastic processes and probability are important in data science because they can be used to model and analyze a wide range of data sets, from financial data to sensor data. The course will cover three parts: a gentle introduction to combinatorial stochastic processes, Gaussian processes, and probabilistic causality. Programming in R, Matlab or Python is useful, but it is not essential. Programs in R will be used. SPECIFIC OBJECTIVES: 1. Knowledge and understanding: Understand the basics of combinatorial stochastic processes and Gaussian processes, and their applications in data science. Understand the fundamentals of probabilistic causality and be able to apply these concepts to real-world data science problems. 2. Application: Apply stochastic process to real-world data sets, using programming languages such as R, Matlab, or Python. 3. Autonomy of judgement: Analyze the benefits and limitations of different stochastic process models and determine the best model to use for a given data set. 4. Communication: Communicate effectively about stochastic processes, including design constraints, solutions, and potential applications. 5. Learning skills: Develop studies in the field of stochastic processes for data science, including the ability to undertake research in this area.

Channel 1

ENRICO SCALAS Lecturers' profile

Program - Frequency - Exams

Course program

This course consists of three parts. Based on the background and the interests of a cohort of students, each part can be expanded and dealt with in more detail at the expenses of other parts. Programming in R, Matlab or Python is useful, but it is not essential. Programs in R will be used. Part I: A gentle introduction to combinatorial stochastic processes Part II: Gaussian processes Part III: Probabilistic causality

Prerequisites

Basic knowledge of mathematical analysis and probability.

Books

GARIBALDI U., SCALAS E., Finitary Probabilistic Methods in Econophysics (2010), CAMBRIDGE UNIVERSITY PRESS VERSHYNIN R., High-Dimensional Probability, An Introduction with Applications in Data Science (2018), CAMBRIDGE UNIVERSITY PRESS PEARL J., Causality (Second edition) (2009), CAMBRIDGE UNIVERSITY PRESS

Frequency

Non-compulsory attendance

Exam mode

Written exam and oral exam. The written exam may be either a series of take home exercises to be sent to the lecturer within a predefined deadline or an open book classroom exam. The oral exam will be upon request.

Bibliography

Please, see the classroom site

Lesson mode

Regular Classes

Lesson code10615930
Academic year2025/2026
CourseData Science
CurriculumSingle curriculum
Year1st year
Semester2nd semester
SSDMAT/06
CFU6

Course catalogue

11/11/2025 - Recruiting Day Iconsulting - December 3 at 10:00 AM

04/11/2025 - Bando per conferimento Borse di Studio intitolate alla memoria di "Antonio Ventura" e dedicate ai laureati magistrali delle Facoltà di Ingegneria

03/11/2025 - Call for Applications for 27 Collaboration Grants for Support Activities

Stochastic Processes for Data Science

Course objectives

Program - Frequency - Exams

Course program

Prerequisites

Books

Frequency

Exam mode

Bibliography

Lesson mode

Data Science

Featured announcements

11/11/2025 - Recruiting Day Iconsulting - December 3 at 10:00 AM

04/11/2025 - Bando per conferimento Borse di Studio intitolate alla memoria di "Antonio Ventura" e dedicate ai laureati magistrali delle Facoltà di Ingegneria

03/11/2025 - Call for Applications for 27 Collaboration Grants for Support Activities

Stochastic Processes for Data Science

Course objectives

Program - Frequency - Exams

Course program

Prerequisites

Books

Frequency

Exam mode

Bibliography

Lesson mode