BIG DATA COMPUTING
Course objectives
General goals: The course is aimed at training students on fundamental algorithmic and programming techniques in big-data computing, tackling a variety of data mining problems on computational models used for managing massive information structures. Specific goals: Ability to analyze, model, and solve typical "Big Data" tasks by implementing machine learning pipelines using PySpark over distributed environments. Knowledge and understanding: At the end of the course the students will have deep understanding of programming models for distributed data analysis on large clusters of computers, as well as of advanced computational models for processing massive amounts of data (e.g., data streaming, MapReduce-style parallelism, and I/O-efficient algorithms). Applying knowledge and understanding: Students will be able to design and analyze algorithms in different big data settings, to write efficient code taking into account architectural features of modern computing platforms (including distributed systems), and to make use of good programming practices and advanced programming frameworks, such as Hadoop. Critical and judgmental skills: Students will be able to distinguish the proper settings in which to use different computational paradigms for big data analysis, to evaluate the advantages and disadvantages of each model, and to face challenges arising in the design and implementation of diverse big data applications. Communication skills: The students will be able to communicate effectively, summarizing the main ideas in the design of big data systems and algorithms clearly and presenting accurate technical information. Ability of learning: The goal for the class is to be broad and to touch upon a variety of techniques, introducing standard practices as well as cutting-edge research topics in this area, making it possible for the students to extend their knowledge independently according to technological changes and evolution.
Program - Frequency - Exams
Course program
Prerequisites
Books
Frequency
Exam mode
Bibliography
Program - Frequency - Exams
Course program
Prerequisites
Books
Frequency
Exam mode
Bibliography
Program - Frequency - Exams
Course program
Prerequisites
Books
Teaching mode
Frequency
Exam mode
Bibliography
Lesson mode
Program - Frequency - Exams
Course program
Prerequisites
Books
Teaching mode
Frequency
Exam mode
Bibliography
Lesson mode
- Lesson code1041764
- Academic year2025/2026
- CourseComputer Science
- CurriculumSingle curriculum
- Year2nd year
- Semester1st semester
- SSDINF/01
- CFU6