The broad diversity of the labs and institutions involved in the Data Science Ph.D. program is reflected in the list of courses available.
The list below is only the list of suggested courses, but the students can select their courses from the even broader list of courses available from all the institutions involved.
This list is about the courses provided in Academic Year 2017-2018. The list of courses for Academic Year 2020-2021 is available here
Agent Based Macroeconomics
Provided by: S.ANNA
Lecturers: Andrea ROVENTINI
Hours: 18
Semester: 2
Educational Goals: The course provides an overview of agent-based macroeconomics. After having discussed the limit of standard Dynamic Stochastic General Equilibrium (DSGE) models, the agent-based computational economic approach is presented, stressing how it can be employed for economic policy. Different agent-based models are then introduced (i) to stress the role of agent heterogeneity and interactions for the emergence of endogenous growth; (ii) to study how the economy self-organize after business-cycle shocks; (ii) to analyse the joint impact of monetary and macro-prudential policies; (iv) to assess how endogenous growth, catching-up and divergence emerge in an open-economy, multy-country framework. Finally, the family of the Schumpeter meeting Keynes (K+S) models is presented, stressing its capability to jointly account for endogenous growth and fluctuations and macro and micro stylized facts. The K+S model is employed as a laboratory to study the short-and long-run impact of different ensembles of innovation, industrial, monetary, fiscal, monetary, labour market climate policies. Syllabus: (1) From DSGE models to macroeconomic ABMS, (2) Exploring the role of heterogeneity and interactions with “simple” macro ABMs. (3) Topics in agent-based macroeconomics. (4) The family of Schumpeter meeting Keynes models.
Agent Based Modeling
Provided by: S.ANNA
Lecturers: Marco VALENTE
Hours: 18
Semester: 2
Educational Goals: This course is intended to serve as a broad introduction to the huge literature using agent-based computational approaches to the study of economic dynamics. It is organized in three parts. The first one (“Why?”) will discuss the roots of the critiques to the mainstream paradigm from a methodological, empirical and experimental perspective. We shall briefly review the building blocks of mainstream models (rationality, equilibrium, interactions, etc.) and shortly present some of the evidence coming from cognitive psychology and experimental economics, network theory and empirical studies, supporting the idea that bounded rationality, non-trivial interactions, non-equilibrium dynamics, heterogeneity, etc. are irreducible features of modern economies. In the second part (“What?”) we shall discuss what ACE is and what are its main tools of analysis. We will define an ABM and present many examples of classes of ABMS, from the simplest (cellular automata, evolutionary games) to the most complicated ones (micro-founded macro models). The third part (“How?”) aims at understanding how ABMs can be designed, implemented and statistically analyzed. We shall briefly present the basics of programming, by both discussing the pros and cons of using simulation platforms (Matlab, NetLogo, Swarm, LSD, etc.) vs. computer languages (Java, C++, etc.) and providing some simple “hands-on” applications to cellular automata. Finally, we will see how the outputs of ABMs simulation should be treated from a statistical point of view (e.g., Montecarlo techniques) and we will discuss two hot topics in ABM research: empirical validation and policy analysis. Syllabus: Part I. Why? Why Agent-Based Computational Economics (ACE) and Agent-Based Models (ABMs)? Empirical and theoretical underpinnings. Part II. What? The structure of ABMs; Flexibility of ABMs; Examples. Part III: How? Designing and implementing ABMS; Statistical analysis of ABMs; Applications. Part IV: Selected Topics in ACE; Empirical validation; Macroeconomic policy; Object-oriented programming.
Applied methods: Causal inference in Microeconometrics
Provided by: S.ANNA
Location: Nizza
Lecturers: Manule BAGUES, Mauro SYLOS LABINI
Hours: 20
Semester: 2
Educational Goals: The course introduces PhD students to the main empirical strategies for causal inference. It has a practical flavor; emphasis is not on proofs but on intuitions and on applications. Basic knowledge of Econometrics and Statistics is assumed. Software used is STATA. Outline: 1. Introduction, 2. Randomized Control Trials, 3. Instrumental variables, 4. Identification based on observables, 5. Differences-in- differences, 6. Regression discontinuity designs, 7. Non-standard standard error issues, 8. Introduction to Structural analysis
Basic Principles and Applications of Brain Imaging Methodologies to Neuroscience
Provided by: IMT
Lecturers: Emiliano RICCIARDI, Monica BETTA Simone ROSSI, Luca CECCHETTI
Hours: 64
Educational Goals: The course aims at introducing the fundamentals of brain metabolism and brain imaging methodologies. Neuroimaging techniques provided cognitive and social neuroscience with an unprecedented tool to investigate the neural correlates of behavior and mental functions. Here we will review the basic principles, research and clinical applications of positron emission tomography (PET), functional magnetic resonance imaging (fMRI), electroencephalography (EEG) and magnetoencephalography (MEG), non-invasive brain stimulation tools. Solid background in the concepts common to many types of neuroimaging, ranging from study design to data processing and interpretation, will be discussed to address neuroscientific questions. In particular, we will first review the basics of neurophysiology to understand the principles of brain imaging. Then, methodologies of data processing for the main brain imaging tools will be provided to the students with hands-on sessions: students will become familiar with the main pipelines for PET, fMRI and EEG data reconstruction, realignment, spatio-temporal normalization, first and second-level analyses. At the end of the course, students are expected to have general background knowledge of the basic principles, methodologies and applications of the most important brain functional techniques and to be prepared to evaluate the applicability of, and the results provided by, these methodologies for different problems in cognitive and clinical neuroscience.
Basics of databases, exploratory data analysis and programming
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Anna MONREALE, Franco TURINI
Hours: 48
Semester: 2
Timetable: TBA
Educational Goals: The objective of the module is to align the competences of the students in computer science and base, in particular about databases, structured data analysis and programming languages.
Link: http://masterbigdata.it/en/content/alignment
Big data analytics
Provided by: UNIPI, CNR
Location: Polo Fibonacci, University of Pisa
Lecturers: Fosca GIANNOTTI
Hours: 40
Semester: 1
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Educational Goals: In our digital society, every human activity is mediated by information technologies. Therefore, every activity leaves digital traces behind, that can be stored in some repository. Phone call records, transaction records, web search logs, movement trajectories, social media texts and tweets, Every minute, an avalanche of “big data” is produced by humans, consciously or not, that represents a novel, accurate digital proxy of social activities at global scale. Big data provide an unprecedented “social microscope”, a novel opportunity to understand the complexity of our societies, and a paradigm shift for the social sciences. Objective of the course is twofold: an introduction to the emergent field of big data analytics and social mining, aimed at acquiring and analyzing big data from multiple sources to the purpose of discovering the patterns and models of human behavior that explain social phenomena and an introduction to the technological scenario of scalable analytics.
Prerequisites: The students are expected to be familiar with key management (financial & managerial accounting, cash flow analysis, org design, business processes) and strategy (Porter’s models, innovation management basics) concepts before starting the course. For management engineering students, the course is highly recommended at 2nd year of the MSc degree (useful complement for PSSP). Recommended for students wishing to apply for Junior Consulting projects. For data science & computer science students, EGI course is recommended.
Programme: Module 1: Foundations of competitive intelligence
- Systems thinking for management
- CI process and Key Intelligence Topics
- Sources and collection techniques
- Organizing CI in the companies
Module 2: Competitor and Market intelligence tools
- Competitive benchmarking (to assess competitive cost of operations, to analyze the true capabilities of a rival, as well as its immediate future actions)
- Blindspots
- Business ecosystems, platforms and business model innovation
Module 3: Corporate foresight tools
- Technology intelligence tools (i.e. patent analysis)
- Scenario analysis tools and techniques
- Weak signals and early warning system
Link: http://didawiki.cli.di.unipi.it/doku.php/bigdataanalytics/bda/start
Big Data and healthcare
Provided by: S.ANNA
Lecturers: Giuseppe TURCHETTI
Hours: 10
Semester: 2
Educational Goals: Obiettivo principale del corso è quello di analizzare le potenzialità dei Big Data nel settore della sanità, sia con riferimento alle attività di ricerca e sviluppo delle imprese biomedicali (pharma, medical devices, biotech) che con riferimento agli obiettivi di salute e di sostenibilità economica dei sistemi sanitari nazionali. Una attenzione particolare, inoltre, verrà dedicata a esplorare e definire modelli di servizio e di business che favoriscano, grazie ai Big Data, un maggiore engagement del paziente - e del cittadino in generale - nei confronti della aderenza alla terapia, del controllo attivo (empowerment) e della gestione (self management) della propria malattia o del proprio wellness, anche in una prospettiva di prevenzione. Infine, verranno svolte esecitazioni pratiche finalizzate a disegnare nuovi modelli organizzativi e di interazione tra diversi stakeholders nel settore della sanità (cittadini, attori del sistema sanitario, industria biomedicale, pagatori, ecc.) e modelli di stima dell'impatto di salute ed economico derivante dall'utilizzo di Big Data in sanità.
Big Data Ethics
Provided by: UNIPI, S.ANNA
Location: Computer Science Department, Unversity of Pisa
Lecturers: A. MONREALE, S. RUGGIERI, G, COMANDE'
Hours: 40
Semester: 2
Timetable: TBA
Educational Goals: The module aims to introduce ethical and legal notions of privacy, anonymity, transparency and discrimination, also referring the Directives and Regulations of the European Union and their ongoing evolution. Tho module will show Privacy-by-Design models and technologies that are useful to protect the users' rights and that allow the analysis of Big Data without harming the right to the protection of personal data, to transparency and to a fair treatment.
Link: http://masterbigdata.it/en/content/big-data-ethics
Big data in/for/from the Public Sector
Provided by: S.ANNA
Lecturers: Fabio PACINI
Hours: 10
Semester: 2
Educational Goals: The aim of the course is to provide a general understanding of Public and Constitutional law to the students, who will get a conceptual framework of the processes through which each public policy is constructed, implemented and eventually evaluated by political and administrative institutions. The first module (10 hrs.) will focus on the current and future implications of the use of Big Data within the public decision-making across the different policy areas. In order to obtain a necessary alignment of competences for students coming from different backgrounds, the first part of the module will be devoted to a concise description of political decision-making from the point of view of Public and Constitutional law, both in Italy and abroad. The course will then deal with the analysis of the current and possible uses of Big Data within these public decision-making processes, as well as the ethical and political issues that it raises. The second module (10 hrs.) will focus, with an interactive approach, on the opportunities that can be given by the exploitation of Big Data produced and made available by political and administrative institutions.
Big data sources, crowdsourcing, crowdsensing
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Stefano CRESCI
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: This module presentes techniques and methods for acquisition of Big Data from a large sources of data available, including mobile phone data, GPS data, customer purchase data, social network data, open and administrative data, environmental and personal sensor data. We discuss also several participatory methods for crowdsourcing or crowdsensing collection of data through ad hoc campains like serious games and viral diffusion.
Link: http://masterbigdata.it/en/content/big-data-sources-crowdsourcing-crowds...
Bioinformatics
Provided by: UNIPI
Location: Polo Fibonacci, University of Pisa
Lecturers: Nadia PISANTI
Hours: 48
Semester: 2
Timetable: https://www.di.unipi.it/en/education/mcs/timetable-wif
Educational Goals: This course has the goal to give the student an overview of algorithmic methods that have been conceived for the analysis of genomic sequences. We will focus both on theoretical and combinatorial aspects as well as on practical issues such as whole genomes sequencing, sequences alignments, the inference of repeated patterns and of long approximated repetitions, the computation of genomic distances, and several biologically relevant problems for the management and investigation of genomic data. The exam has the goal to evaluate the students understanding of the problems and the methods described in the course. Moreover, the exam is additionally meant as a chance to learn how a scientific paper is like, and how to make an oral presentation on scientific/technical topics, that is designed for a specific audience.
Prerequisites: A Basic course on algorithms
Programme: A brief introduction to molecular biology: DNA, proteins, the cell, the synthesis of a protein.
Sequences Alignments: Dynamic Programming methods for local, global, and semi-local alignments. Computing the Longest Common Subsequences. Multiple Alignments.
Pattern Matching: Exact Pattern Matching: algorithms (Knuth-)Morris-Pratt, Boyer-Moore, Karp-Rabin with preprocessing of the pattern. Algorithm with preprocessing of the text: use of indexes.
Motifs Extraction: KMR Algorithm for the extracion of exact motifs and its modifications for the inference of approximate motifs.
Finding Repetitions: Algorithms for the inference of long approximate repetitions. Filters for preprocessing.
Fragment Assembly: Genomes sequencing: some history, scientific opportunities, and practical problems. Some possible approaches for the problem of assembling sequenced fragments. Link with the “Shortest common superstring” problem, the Greedy solution. Data structures for representing and searching sequencing data.
New Generation Sequencing: Applications of High Throughput Sequencing and its algorithmic problems and challenges. Investigating data types resulting from the existing biotechnologies, and the possible data structures and algorithms for their storage and analysis.
Link: http://www.di.unipi.it/scripts/PD2/corso.php?lang=en&cds=WIF-LM&anno=201...
http://didawiki.cli.di.unipi.it/doku.php/bio/start
Cloud Computing & Big-Data
Provided by: S.ANNA
Lecturers: Tommaso CUCINOTTA
Hours: 30
Semester: 1
Educational Goals: This course provides an overview of the challenges to face, and the technical solutions to embrace, when building large-scale, fault-tolerant, distributed and replicated real-time cloud services. These systems need to be capable of serving millions/billions of requests per second with industrial-grade reliability, availability and performance, and are composed of thousands of components spanning across millions of machines, worldwide. The course focuses on design, development and operations of scalable software systems, including big-data processing and analytics, where the huge volumes of data to handle mandates the use of heavily distributed algorithms. The course covers also basic concepts on architectures of data-centre/cloud infrastructures.
Prerequisites: Computer architectures and networks
Cloud Computing & Big-Data Lab
Provided by: S.ANNA
Lecturers: Tommaso CUCINOTTA
Hours: 30
Semester: 2
Educational Goals: Hands-on follow-up to the Cloud Computing & Big-Data course. This is an applied course where students will put in practice the theoretical/abstract concepts acquired in the general course on Cloud Computing & Big-Data. During the practical sessions, we'll have a deep dive on such concepts as: machine virtualization (KVM) and OS-level virtualization (LXC) on Linux; virtual networking on Linux; network programming and distributed RPC frameworks; programming abstractions for cloud and distributed computing; elasticity in practice; big-data programming frameworks; Hadoop Map-Reduce; Apache Spark. Requisites: (1) Cloud Computing & Big-Data course (both of 2016 courses on Cloud Computing and Big-Data & Analytics work) (2) Computer programming & scripting.
Prerequisites: Computer architectures and networks
Complements of Bioinformatics
Provided by: S.ANNA
Lecturers: Andrea ZUCCOLO
Hours: 20
Semester: 1
Programme: What is bioinformatics: a historical perspective from 1953 up to the next-gen sequencing;
Similarity searches: intro, basics, global vs local, dynamic programming, dot-plot, fasta, blast scoring matrix; pattern discovery
Following the journey of a genome: from reads to assembly to annotation to comparative genomics:
o Overall sequencing strategy (de novo vs reference based)
o Technical differences (sequencers, strategies: Hierarchical-BAC based, Whole Genome Shotgun, mixed strategies etc.)
o Genome assemblers: greedy graph, OLC, De Brujin's graph based
o The “validation” and “quality assessment” of a genome assembly
o Annotation: genes, pseudogenes, promoters, transcription factors, repeats (and Transposable Elements)
o Comparisons: SNPs, rearrangements, epigenomic, RNA seq
Complex Networks for Data Science
Provided by: IMT
Lecturers: Guido CALDARELLI, T. SQUARTINI, G. CIMINI
Hours: 40
Programme: Part I: Introduction to complex systems and networks
Complexity, self-similarity, scaling, self-organised criticality.
Definition of graphs, real networks and their properties.
Centrality metrics, ranking and reputation.
Models of static networks, models of network growth.
Part II: Algorithms and models
Spectral properties of graphs.
Community detection.
Bipartite and multilayer networks.
Applications: World Trade Web
Part III: Statistical mechanics of networks
Information theory, Exponential Random Graphs.
Hypothesis testing on networks and reconstruction of networks.
Null models for time series.
Generating functions formalism.
Part IV: Dynamical processes on networks
Mean field and master equations.
Percolation and epidemic models.
Contagion: the case of financial networks.
Game theory on networks, networks from time series, visibility graphs.
Cultural Heritage and Law
Provided by: IMT
Lecturers: Lorenzo CASINI
Hours: 60
Educational Goals: International Law, EU Law, and Domestic Law on Cultural Heritage. Basic elements of comparative law. Definition of Cultural Heritage. The institution of protection of cultural heritage in Italy. Fundamental principles and main public interests: protection, circulation, access. Problems and cases (Case law). - European Landscape Convention and Domestic Law on Landscape. Basic elements of comparative law. Principles and main issues: definition of landscape; levels of governance; public law instruments.Problems and cases (Case law).
Data Driven Innovation
Provided by: UNIPI, S.ANNA
Location: Computer Science Department, Unversity of Pisa
Lecturers: Alberto DI MININ, Andrea PICCALUGA
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: The module aims to show the main characteristics of the innovation processes in companies and institutions. After some basics of innovation economics, the management of the innovation processes will be presented (role of R&D, Open Innovation, etc.). The module also shows new innovation opportunities available after the last progresses in large scale data acquisition and elaboration, the basics of business models and start-ups. An exercise of business model innovation will try to explore che big data potential in opening new business opportunities.
Link: http://masterbigdata.it/en/content/data-driven-innovation
Data Journalism & Story Telling
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Luce DE BIASE, Andrea MARCHETTI
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: The module aims to teach how to present the knowledge extracted from big data using multimedia story telling. It also shows some of the most recent and meaningful experiences of journalism and story telling based on quantitative information extracted from different data sources.
Link: http://masterbigdata.it/en/content/data-journalism-story-telling
Data Management for Business Intelligence
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Salvatore RUGGIERI
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: The module shows technologies and systems for accessing, managing and analysing Big Data for decision support. Technologies and analysis of problems are shown using examples and case studies in lab. The student will acquire skills on the main technologies for business intelligence and big data management, including data warehouse and online analytical processing technologies.
Link: http://masterbigdata.it/en/content/data-management-business-intelligence-0
Data Mining
Provided by: UNIPI
Location: Polo Fibonacci, University of Pisa
Lecturers: Dino PEDRESCHI
Hours: 80
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Educational Goals: DATA MINING: FOUNDATIONS
The formidable advances in computing power, data acquisition, data storage and connectivity have created unprecedented amounts of data. Data mining, i.e., the science of extracting knowledge from these masses of data, has therefore been affirmed as an interdisciplinary branch of computer science. Data mining techniques have been applied to many industrial, scientific, and social problems, and are believed to have an ever deeper impact on society. The course objective is to provide an introduction to the basic concepts of data mining and the process of extracting knowledge, with insights into analytical models and the most common algorithms.
DATA MINING: ADVANCED ASPECTS AND APPLICATIONS
The second part of the course completes the knowledge of the first module with: a review of advanced techniques for the mining of new forms of data; a review of the main application areas and case studies.
Programme: DATA MINING: FOUNDATIONS
Fundamentals of data mining and the knowledge discovery process
Explorative Data Analysis and Visual analytics
Frequent patterns and Rules
Clustering: partition based techniques, density based techniques and hierarchical techniques
Classification: decision trees and Bayesian Methods
Analytical experiments with data mining tools
DATA MINING: advanced aspects and applications
Mining of time series and spatio-temporal data
Mining of sequential data and graphs
Advanced techniques for classification, clustering and outlier detection
Language, standard and architectures of data mining systems
Social impact of data mining
Data mining and privacy protection
Case studies in socio-economic domains (marketing and CRM, mobility and transport, public health, etc.)
Link: http://www.di.unipi.it/scripts/PD2/corso.php?lang=en&cds=WDS-LM&anno=201...
http://didawiki.cli.di.unipi.it/doku.php/dm/start
Data Mining and Machine Learning
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Dino PEDRESCHI, Fosca GIANNOTTI
Hours: 40
Semester: 2
Timetable: TBA
Educational Goals: The module provides an introduction to base concepts of data mining and knowledge extraction process, introducing analytical models and algorithms for clustering, classification and pattern discovery, also referring Big Data sources.
Link: http://masterbigdata.it/en/content/data-mining-machine-learning
Data Science Colloqium
Provided by: ALL
Lecturers: Dino PEDRESCHI (coord.)
Hours: 40
Data Science for Quantitative Finance
Provided by: UNIPI, SNS
Location: Computer Science Department, Unversity of Pisa
Lecturers: Fabrizio LILLO
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: The course presents the main elements for understanding financial markets, their structure, and technological infrastructure. Specifically, the course provides a background on basic empirical modeling of financial time series, from low to ultrahigh frequency, identifying the key data science aspects including data storage, latency, high dimensional inference, etc. It also covers semantic analysis of texts from news feed and social networks for financial forecasting. Finally, the course introduces some elements of computational and numerical applications to financial problems, ranging from pricing to optimal execution and portfolio optimization.
Link: http://masterbigdata.it/en/content/data-science-quantitive-finance
Data Visualization & Visual analytics
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Salvatore RINZIVILLO
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: The module aims to present the basic methods and techniques for the visualization and presentation of the information obtained from different sources: structured data (relational, hierarchical, trees), network data (social network), temporal data, spatial data and spatio-temporal data. Studying the existing methods and tools, some scenarios of visual analytics will be presented.
Link: http://masterbigdata.it/en/content/data-visualization-visual-analytics
Decision-Making in Economics and Management
Provided by: IMT
Lecturers: Massimo RICCABONI
Hours: 20
Educational Goals: The main goals of the course are: (1) to take economic theories and methodologies out into the world, applying them to interesting questions of individual behavior and societal outcomes; (2) to develop a basic understanding of human psychology and social dynamics as they apply to marketing contexts; (3) to become familiar with the major theory and research methods for analyzing consumer behavior; (4) to develop market analytics insight into consumer actions. Most of time will be devoted to close reading of research papers, including discussion of the relative merits of particular methodologies. Students will participate actively in class discussion, engage with cutting-edge research, evaluate empirical data, and write an analytical paper. The course aims at enabling students to develop and enhance their own skills and interests as applied microeconomists.
Economic Networks
Provided by: S.ANNA
Location: Nizza
Lecturers: Giorgio FAGIOLO
Hours: 18
Semester: 2
Educational Goals: This course introduces the “science of networks” for economists. The first part of the course discusses examples of real-world networks in hard and social sciences. We ask why networks are important for economists and what are the main network-related questions as far as models and empirical analyses are concerned. We then present more formally graph theory and explore network statistics. We finally move to models of network formation and present some relevant applications to economics (e.g. trade networks). Syllabus: • What is a network • Examples of networks • Why networks in economics and social sciences • Networks and graphs • Measures and metrics on networks • Distribution of metrics and measures in large networks • Models of network formation • Null statistical network models • Tutorial #1: Network data and network visualization • Tutorial #2: Network measures and metrics • Tutorial #3: Distributional analysis and network randomization • Economic applications.
Ethics and legal dimensions of data science
Provided by: S.ANNA
Lecturers: Giovani COMANDE' (coord.)
Hours: 10
Semester: 2
Educational Goals: The aim of the course is to provide a general understanding of Public and Constitutional law to the students, who will get a conceptual framework of the processes through which each public policy is constructed, implemented and eventually evaluated by political and administrative institutions. The first module (10 hrs.) will focus on the current and future implications of the use of Big Data within the public decision-making across the different policy areas. In order to obtain a necessary alignment of competences for students coming from different backgrounds, the first part of the module will be devoted to a concise description of political decision-making from the point of view of Public and Constitutional law, both in Italy and abroad. The course will then deal with the analysis of the current and possible uses of Big Data within these public decision-making processes, as well as the ethical and political issues that it raises. The second module (10 hrs.) will focus, with an interactive approach, on the opportunities that can be given by the exploitation of Big Data produced and made available by political and administrative institutions.
European Statistical System
Provided by: UNIPI
Lecturers: Monica PRATESI (coord.)
Programme: On 27 October 2017 in Room F2 from 12.15 to 13.45 and on 30 October 2017 in Room L2 from 12.15 to
13.45, Prof. Daniela Ghio (Ministero dell'Interno) will take the Seminars on "The European Statistical
System";
- On 10, 13, 17 and 20 November 2017 from 12.15 to 13.45, Prof. Luigi Biggeri (University of Florence) will
take lectures on "The estimation and computation of Income, Consumption and PPPs in the European
Statistical System" (Room F2 and L2);
- On 9 November 2017 from 12.15 to 13.45 in Room F2, Prof. Tommaso Rondinella (ISTAT Roma) will take
lectures on "Measuring Equitable and Sustainable Well-being: the Italian BES and the Sustainable
Development Goals";
- On 21st of November 2017 and the 5th of December 2017 at 16.00, Prof. Natalie Shlomo, Magna
Room, distance lectures;
- On 30 th November 2017 and 1 st December 2017 from 12.15 to 13.45 (Room F2), Prof. Ulrich Rendtel
Freie University , lectures “Issues of the analysis of socio-economic panel surveys”;
- On 27th November 2017 from 12.15 to 13,45 (Room L2 )Lecture Prof. Parthasarathi Lahiri, Maryland
Population Research Center, “Big Data, Big Promise, Big Challenge: Can Small Area Estimation Play a Role
in the Big Data Centric World?”
Financial Analysis
Provided by: SNS
Lecturers: ?
Hours: 40
Genomica avanzata
Provided by: S.ANNA, UNIPI
Lecturers: Mario Enrico PE', Andrea ZUCCOLO
Hours: 64
Semester: 2
Educational Goals: Il corso si propone di fornire conoscenze di base sulla struttura, la funzione e l’evoluzione dei genomi di procarioti e di eucarioti. Saranno considerate le diverse metodiche utilizzate per lo studio dei genomi e sarà discusso come l’adozione di approcci genomici hanno cambiato il modo di affrontare le problematiche biologiche. Il corso inoltre prevede di introdurre gli studenti all’utilizzo e alla comprensione degli strumenti bioinformatici necessari alla gestione e all’analisi dei dati provenienti da esperimenti di sequenziamento. Accanto alla presentazione delle tecniche e degli algoritmi sottostanti saranno proposte attività pratiche su dati reali relativi a genomi batterici, animali e vegetali.
High frequency finance and market microstructure
Provided by: SNS
Lecturers: Fabrizio LILLO
Hours: 40
High Performance & Scalable Analytics, NO-SQL Big Data Platforms
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Roberto TRASARTI, Claudio LUCCHESE
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: The aim of this course is to introduce the student with the high performance Big Data management tools. The student will gain expertise in the use od NO-SQL platforms for the analysis and mining of large data volumes, thus performing tasks that would not be feasible with traditional data bases.
Link: http://masterbigdata.it/en/content/high-performance-scalable-analytics-n...
How to do research
Provided by: S.ANNA
Lecturers: Giorgio BUTTAZZO
Hours: 30
Semester: 2
Educational Goals: The course will cover the entire process needed during a research study, from the initial phase in which a new problem is addressed, formalized, and solved, up to the final phase in which the achieved results have to be communicated to the scientific community. The course is divided in six lectures of 3 hours each. The first lecture explains the meaning of research and how to approach the various steps involved in the process. The second lecture explains how to write papers, set a good structure, outline the contents, make figures, cite references, and avoid common mistakes. The third lecture explains how the scientific publication process works and how to participate in it, simulating a typical paper selection process of a conference. The fourth lecture explains how to write successful research projects. The fifth lecture will simulate a program committee meeting, where papers are discussed, evaluated, and comments are sent back to the authors for revision. The sixth lecture explains how to make presentations, good slides, and have the appropriate attitude when presenting the work. The seventh lecture part is devoted to paper presentations, simulating a small conference run by the students.
Hybrid programming for scientific computing
Provided by: SNS
Lecturers: Giordano MANCINI
Hours: 40
Identification, Analysis and Control of Dynamical Systems
Provided by: IMT
Lecturers: Alberto BEMPORAD
Hours: 20
Educational Goals: The course provides an introduction to dynamical systems, with emphasis on linear systems. After introducing the basic concepts of stability, controllability and observability, the course covers the main techniques for the synthesis of stabilizing controllers (state-feedback controllers and linear quadratic regulators) and of state estimators (Luenberger observer and Kalman filter). The course also covers data-driven approaches of parametric identification to obtain models of dynamical systems from a set of data, with emphasis on the analysis of the robustness of the estimated models w.r.t. noise on data and on the numerical implementation of the algorithms.
Informatica Chimica
Provided by: SNS
Lecturers: ?
Hours: 40
Information Retrieval
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Paolo FERRAGINA
Hours: 40
Semester: 2
Timetable: TBA
Educational Goals: The module aims to teach the software modules used to build a modern search engine and the analysis of the performance and the computational limits of the algorithmics solutions currently used in each of them. Practical and theoretical fundmentals for the organization and the analysis of IR systems.
Link: http://masterbigdata.it/en/content/information-retrieval
Introduction to Cognitive and Social Psyschology
Provided by: IMT
Lecturers: Pietro PIETRINI, Emiliano RICCIARDI
Hours: 24
Educational Goals: This course will provide an introduction to general themes in Cognitive and Social Psychology. In the first part of the course, we will review seminal findings that had a major impact on our knowledge of cognitive processes and social interactions, as well as more recent studies that took advantage of neuroimaging, electrophysiology and brain stimulation methods to shed new light on decision-making and social behaviors. During the second part of the course, students will be asked to perform a brief presentation of a research article and to critically discuss positive aspects and limitations of the study. The course will include seminars and lectures by renowned researchers in the field and will educate PhD candidates about the influence of social aspects of the human nature on cognitive and brain functioning (and vice-versa) in an intellectually motivating manner.
Introduction to Evolutionary Biology
Provided by: S.ANNA
Lecturers: Andrea ZUCCOLO
Hours: 15
Semester: 2
Educational Goals: The main goal of the course is to introduce the basic principles underlying evolutionary biology and to present and discuss rationale and methods of phylogenetic inference. We'll discuss the process of selecting and gathering appropriate datasets for subsequent analyses and we'll explain three widely used methods of phylogenetic inference: parsimony, distance, and likelihood methods. Particular attention will be paid to the methods used to evaluate objectively the reliability and accuracy of the resulting inferences. Finally, we'll critically consider the results of phylogenetic reconstruction—how they can shed light on past evolutionary events, such as gene duplications and lateral gene transfers, as well as how they can be used for other purposes, such as predicting gene function and resolving RNA secondary structures.
Introduction to neural networks
Provided by: S.ANNA
Lecturers: Giorgio BUTTAZZO
Hours: 30
Semester: 2
Prerequisites: C programming and matrix math
Programme: Motivations of research. Main neural models. Learning paradigms.
Associative memories and Hopfield networks
Layered networks.
Kohonen networks
Reinforcement Learning
Legally Compliant Data Science
Provided by: S.ANNA, UNIPI
Lecturers: Giovanni COMANDE'
Hours: 36
Semester: 1
Machine Learning
Provided by: IMT
Lecturers: Giorgio Stefano GNECCO
Hours: 20
Educational Goals: The course provides an introduction to basic concepts in machine learning. Topics include: learning theory (bias/variance tradeoff; Vapnik-Chervonenkis dimension and Rademacher complexity, cross-validation, feature selection); supervised learning (linear regression, logistic regression, support vector machines); unsupervised learning (clustering, principal and independent component analysis); semisupervised learning (Laplacian support vector machines); online learning (perceptron algorithm); hidden Markov models.
Methodologies for the social sciences
Provided by: SNS
Lecturers: ?
Hours: 40
Microeconometria
Provided by: S.ANNA
Lecturers: Federico TAMAGNI
Hours: 30
Semester: 2
Educational Goals: Il corso presenta le principali tecniche econometriche per dati micro e panel: proprietà e problemi di inconsistenza delle stime OLS; modelli a variabile dipendente discreta; modelli panel lineari (statici e dinamici); soluzioni ai problemi di endogeneita' e selezione del campione; tecniche per l'analisi distribuzionale (stime kernel e regressioni quantiliche). Le lezioni prevedono sessioni di illustrazione ed esercitazione al computer, utilizzando il software statistico STATA.
Microeconomics
Provided by: S.ANNA
Lecturers: Laura MAGAZZINI, Federico TAMAGNI
Hours: 20
Semester: 2
Educational Goals: The course aims at providing students the econometric tools for dealing with the analysis of individual-level data on the economic behavior of individuals or firms. Regression methods for the analysis of static and dynamic panel data models and estimation of limited dependent variable models will be considered, including discrete choice models, censored and truncated regressions, and sample selection. Besides the theoretical background, students will be exposed to the discussion and the analysis of examples and empirical applications. The course requires knowledge of basic econometric tools for the analysis of linear models (regression methods) and of the method of maximum likelihood for estimation. The software STATA will be employed for the analysis. A basic working knowledge is preferred. Outline: (1). Linear panel data models: 1.1. The ‘panel’ solution to the omitted variable bias. 1.2. Estimation: fixed effects versus random effects. 1.3. Dynamic panel data models: the Nickel (1981) bias; GMM estimation. (2). Discrete choice models: 2.1. The linear probability model. 2.2. Logit and probit. 2.3. Multinomial models. (3). Count data models. (4). Truncation and censoring: 4.1. Truncated regression. 4.2. The censored regression model (Tobit). 4.3. Sample selection.
Mobility Data Analysis
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Mirco NANNI
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: The purpose of the course is to introduce the main analysis techniques for spatio-temporal data, with a particular focus on human mobility (including vehicles), aimed to better understand the overall mobility of a territory. The presentation will be supported by several case studies developed with the SoBigData.eu laboratory.
Link: http://masterbigdata.it/en/content/mobility-data-analysis
Neurogenomics
Provided by: SNS
Lecturers: ?
Hours: 40
Peer to peer systems and blockchains
Provided by: UNIPI
Location: Polo Fibonacci, University of Pisa
Lecturers: Laura RICCI
Hours: 48
Semester: 2
Timetable: https://www.di.unipi.it/en/education/mcs/timetable-wif
Educational Goals: This course introduces the basic principles and tools to define and develop a peer to peer (P2P) system, with a focus on Distributed Hash Tables and the distributed technology of blockchains and on the cryptocurencies. The first part of the course introduces the general concepts underlying any P2P system (topology, information diffusion,...). This part also present several case studies (Bittorrent, the KAD network,..).Cryptocurrencies and, more in general, blockchains, are a recent ”killer application” in the area of P2P systems. The second part of the course presents and discusses the blockchain technology and the decentralized digital currencies (cryptocurrencies) such as Bitcoin. The course introduces both the theory and principles at the basis of cryptocurrencies operations and practical examples of their use. This part introduces the cryptocurrency ecosystem and discusses the existing and potential interaction of cryptocurrencies with the banking, financial, legal and regulatory environment. Lastly the course details how innovative applications exploit blockchain technology
Prerequisites: A basic course in networking
Programme: P2P Topologies
– Peer to Peer (P2P) systems: general concepts
– Unstructured Overlays: Flooding, Random Walks, Epidemic Diffusion
– Structured Overlays: Distributed Hash Tables (DHT), Routing on a DHT
– Case Studies:
∗ Bittorrent as a Content Distribution Network: KAD implementation of
the Kademlia DHT, game-based cooperation
• Complex Network for the analysis of P2P systems
– Network models
∗ Random Graphs and Small Worlds
∗ Small World navigability: Watts Strogatz and Kleinberg.
∗ Complex networks navigability
• Cryptocurrencies and Blockchains
– basic concepts:
∗ review of basic cryptographic tools (digital signatures, cryptographic hash,
Merkle trees.,..)
∗ blockchains: definitions
∗ distributed consensus: definitions
– the Bitcoin blockchains
∗ Nakamoto consensus
∗ Bitcoin mining mechanism
∗ pseudoanonymity: traceability and mixing
∗ the Bitcoin P2P Network
∗ Bitcoin ecosystem
∗ scalability issues
∗ Bitcoin Extensions/alternatives: altcoins, sidechains, the StellarConsensus
Protocol
– Applications of blockchains
∗ Ethereum: programming smart contracts
∗ Blockchain 1.0: cryptocurrencies
∗ Blockchain 2.0: financial instruments built on cryptocurrencies
∗ Blockchain 3.0: applications beyond cryptocurrencies (DNS, lotteries, vot-
ing, IoT...)
Link: https://elearning.di.unipi.it/course/view.php?id=89
PhD+: Research valorization, innovatiom entrepreneural mindset
Provided by: UNIPI
Lecturers: (vari)
Semester: 2
Timetable: https://www.unipi.it/index.php/phd/item/11226-programme-2017
Educational Goals: PhD+ is a unique programme aimed at fostering innovation and entrepreneurial mind-set in students and graduates of the University of Pisa, PhD students and PhDs of all Superior Graduate Schools in Tuscany, and academics. PhD+ consists of a series of interactive and engaging lectures combined with coaching and mentoring activities, given by top-level experts in innovation and technology transfer.
PhD+ is one of the best practice of training in research valorisation, innovation and entrepreneurship, also recognized by the Network of Design for Resilient Entrepreneurship, within the ENDuRE European project. To date, PhD+ has received many national and international awards, also thanks to the successes of its multi-awarded spin-offs.
From this year onwards the PhD+ will represent the qualified training offer within the Contamination Lab project.
The 2018 edition will take place from 6th February to 8th March and will present new features:
• more topics related to research valorisation and European funding opportunities will be discussed;
• • this edition of the course will be more internationally-oriented, thanks to the collaboration with the Brasilian Federal University of Pernanbuco, allowing the students of these Universities to attend the lectures in video streaming by means of the Unipi Mediateca platform;
• it will be cutting edge on new technological trends.
The past PhD+ edition lectures and seminars are available in video streaming on the Mediateca platform of the University of Pisa.
Link: https://www.unipi.it/index.php/research/itemlist/category/590
Principles of Brain Anatomy and Physiology
Provided by: IMT
Lecturers: Luca CECCHETTI, Michele EMDIN
Hours: 36
Educational Goals: The course aims at introducing the fundamentals of brain anatomy and physiology. In the first part of the course we will revise the basics of neuron structure and function, as well as synaptic mechanisms and cytoarchitectonic properties of the cortical mantle, with particular regards to visual, auditory, somatosensory and motor systems. Moving from this fine-grained description of the human brain, we will focus on gross neuroanatomy: through the use of in-vivo state-of-the-art techniques, such as structural MRI and diffusion weighted imaging, we will review gyri and sulci of the cortex, subcortical structures, brainstem nuclei and major white matter fasciculi. The second part of the course will be devoted to the study of functional neuroanatomy, with insights on the relationship between specific brain structures and human cognition, collected using functional, metabolic and receptors mapping, as well as lesion studies. In particular, the the following topics will be covered: central and peripheral nervous systems, occipital parietal frontal temporal and limbic areas, subcortical nuclei and white matter fasciculi, cerebellum, methodologies of structural brain imaging: VBM, cortical thickness and folding, VLSM, Diffusion Weighted Imaging and Tractography (theory and methodologies of data processing, hands-on sessions). The last part of the course will instead cover topics related to peripheral and autonomous nervous system.
Principles of Concurrent and Distributed Programming
Provided by: IMT
Lecturers: Rocco DE NICOLA
Hours: 20
Educational Goals: The course objective is to introduce the basics of concurrent programming problems through an illustration of the concepts and techniques related to modeling systems in which there are more components that are simultaneously active and need to coordinate and compete for the use of shared resources. At the end of the course the student will have a good understanding of the constructs for concurrent programming and be able to use them to write and analyze concurrent programs.
Programming for data science
Provided by: UNIPI
Location: Polo Fibonacci, University of Pisa
Lecturers: Salvatore RUGGIERI, Giuseppe PRENCIPE
Hours: 80
Semester: 1
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Educational Goals: This is an introductory course to computer programming and related mathematical/logic background for students without a Bachelor in Computer Science or in Computer Engineering. The objective is to smoothly introduce the student to the programming concepts and tools needed for typical data processing and data analysis tasks. The course consists of lectures and practice in computer labs. The student will be able to use computer programming languages and related mathematical notions for problem reasoning and solving. The student will be able to separate apart the problem constraint and solutions from the actual coding in a specific computer programming language. Computational thinking is the expected ability at the end of the course.
Prerequisites: Basic mathematical notions as given in most of Bachelor programs.
Programme: Sets, relations, functions, combinatorics, grammars, automata.
Propositional and first order logic.
Induction and recurrence relations.
Imperative programming.
Object oriented programming.
Programming stack and development tools.
Python programming.
C programming.
Link: http://www.di.unipi.it/scripts/PD2/corso.php?lang=en&cds=WDS-LM&anno=201...
http://didawiki.cli.di.unipi.it/doku.php/mds/pds/start
Python Programming for Complex Networks
Provided by: IMT
Lecturers: Guido CALDARELLI
Hours: 20
Educational Goals: We present in this course the latest results in the field of complex networks. They can solve immediate problems when studying the Internet and the WWW, and they can help sort information on a variety of other systems. This course is structured in such a way as to start with a specific problem and then present the theoretical tools needed to model or to sort out the most relevant information. In the course we shall follow the various students from the setup of the software in their pc (a basic in Python will be provided in another course) to the application of existing software and the writing of specific one for the personal line of research of the students.
Quantum Information I
Provided by: SNS
Lecturers: ?
Hours: 40
Quantum information II
Provided by: SNS
Lecturers: ?
Hours: 40
Social network analysis
Provided by: UNIPI
Location: Polo Fibonacci, University of Pisa
Lecturers: Dino PEDRESCHI
Hours: 40
Semester: 2
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Educational Goals: Over the past decade there has been a growing public fascination with the complex “connectedness” of modern society. This connectedness is found in many contexts: in the rapid growth of the Internet and the Web, in the ease with which global communication now takes place, and in the ability of news and information as well as epidemics and financial crises to spread around the world with surprising speed and intensity. These are phenomena that involve networks and the aggregate behavior of groups of people; they are based on the links that connect us and the ways in which each of our decisions can have subtle consequences for the outcomes of everyone else. This course is an introduction to the analysis of complex networks, with a special focus on social networks and the Web - their structure and function, and how it can be exploited to search for information. Drawing on ideas from computing and information science, applied mathematics, economics and sociology, the course describes the emerging field of study that is growing at the interface of all these areas, addressing fundamental questions about how the social, economic, and technological worlds are connected. Data-driven analysis of complex networks using a variety of models and software tools.
Programme: Big graph data and social, information, biological and technological networks
The architecture of complexity and how real networks differ from random networks: node degree and long tails, social distance and small worlds, clustering and triadic closure. Comparing real networks and random graphs. The main models of network science: small world and preferential attachment.
Strong and weak ties, community structure and long-range bridges. Robustness of networks to failures and attacks. Cascades and spreading. Network models for diffusion and epidemics. The strength of weak ties for the diffusion of information. The strength of strong ties for the diffusion of innovation.
Practical network analytics with Cytoscape and Gephi. Simulation of network processes with NetLogo.
Link: http://www.di.unipi.it/scripts/PD2/corso.php?lang=en&cds=WDS-LM&anno=201...
http://didawiki.cli.di.unipi.it/doku.php/wma/start
Socio-Economic Networks
Provided by: IMT
Lecturers: Massimo RICCABONI
Hours: 20
Educational Goals: The topic of the course will be the analysis of socio-economic networks. The course will consist of two parts: (1) micro level networks of individuals and firms, (2) macro-level networks of sectors and countries. The first part will focus on social networks and the division of (innovative) labor within and across firm boundaries. The second part on the empirics of macro networks in economics will have a strong focus on international trade, investments and human mobility. Both parts will give you a brief overview on the literature, which predominantly adopted an econometric approach to the analysis of networks.
Statistical methods for data science
Provided by: UNIPI, SNS
Location: Polo Fibonacci, University of Pisa
Lecturers: Fabrizio LILLO, Salvatore RUGGIERI
Hours: 40
Semester: 2
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Educational Goals: The student who completes successfully the course will have a solid knowledge on the main concepts and tools of statistical analysis, including the definition of a statistical model, the inference of its parameters with confidence intervals, the use of hypothesis testing. and some basic knowledge of the statistics of linear time series. Finally the student will be able to use the language R for performing statistical analyses. The student will be able to understand the main concept of statistical analysis and to choose and apply the appropriate tool to the case under study. The student will also be able to use the language R for performing statistical analyses.
Prerequisites: Basic knowledge of calculus. Basic knowledge of probability might be useful even if not indispensable.
Programme: The program covers the basic methodologies, techniques and tools of statistical analysis. This includes basic knowledge of probability theory, random variables, convergence theorems, statistical models, estimation theory, and hypothesis testing. Other topics covered include bootstrap, expectation-maximization, and basic knowledge of time series analysis. Finally the program covers the use of the language R for statistical analysis.
Link: http://www.di.unipi.it/scripts/PD2/corso.php?lang=en&cds=WDS-LM&anno=201...
http://didawiki.cli.di.unipi.it/doku.php/mds/smd/start
Statistical Methods for Large, Complex Data
Provided by: S.ANNA
Lecturers: Francesca CHIAROMONTE
Hours: 10
Semester: 2
Educational Goals: Outline: Lecture 1: Computational Assessment of Statistical Procedures. Resampling (e.g. Jacknife, Boostrap), Cross-Validation schemes and their uses. Lecture 2: High Dimensional Supervised Problems. Linear and Generalized Linear Models (review, including traditional feature selection), Shrinkage and Sparsification (Ridge, LASSO and other developments). Lecture 3: Ultra-High Dimensional Supervised Problems. Feature Screening algorithms for linear and generalized linear models, and model-free. Lecture 4: Ultra-High Sample Sizes. Significance and Effect Sizes, Subsampling strategies for Big Data.
Stochastic processes and partial differential equations
Provided by: SNS
Lecturers: ?
Hours: 40
Stochastic Processes and Stochastic Calculus
Provided by: IMT
Lecturers: Irene CRIMALDI
Hours: 30
Educational Goals: This course aims at introducing some important stochastic processes and Ito stochastic calculus. Some proofs are sketched or omitted in order to have more time for examples, applications and exercises. In particular, the course deals with the following topics: - Markov chains (definitions and basic properties, classification of states, invariant measure, stationary distribution, some convergence results and applications, passage problems, random walks, urn models, introduction to the Markov chain Monte Carlo method), - conditional expectation and conditional variance, - martingales (definitions and basic properties, Burkholder transform, stopping theorem and some applications, predictable compensator and Doob decomposition, some convergence results, game theory, random walks, urn models), - Poisson process, Birth-Death processes, - Wiener process (definitions, some properties, Donsker theorem, Kolmogorov-Smirnov test) and Ito calculus (Ito stochastic integral, Ito processes and stochastic differential, Ito formula, stochastic differential equations, Ornstein-Uhlenbeck process, Geometric Brownian motion, Feynman-Kac representation formula).
Prerequisites: Matrix Algebra + Foundations of Probability and Statistical Inference
Strategic and competitive intelligence
Provided by: UNIPI
Location: Polo Fibonacci, University of Pisa
Lecturers: Antonelle MARTINI
Hours: 40
Semester: 1
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Educational Goals: CI programs have goals such as proactively detecting business opportunities or threats, eliminating or reducing blind- spots, risks and/or surprises; and reducing reaction time to competitor and marketplace changes. The end product of any worthwhile CI activity is what practitioners term ‘actionable intelligence’ – i.e. intelligence that management can act upon; perspective. It is more than analysing competitors: it is a process for gathering information, converting it into intelligence (about products, customers, competitors, and any aspect of the environment) and then using it in decision making. In this sense, big data brings big change to CI. The course is very interactive and includes also in-class seminars with experts on emerging topics, defined each year (i.e. patent analysis, due diligence, social network analysis for business). It provides many tools and techniques; HBS cases are used. Students will apply these tools in groups when analysing a preselected case company. They are expected to present early stage versions of their CI reports and, in the final workshop, they will present the results of their CI analysis, which is then discussed in plenary.
Studying society with big and digital data
Provided by: SNS
Location: Institute of Humanities and Social Sciences, Scuola Normale Superiore, Piazza degli Strozzi 1, Firenze
Lecturers: Elena PAVAN
Hours: 20
Semester: 2
Timetable: 2 may 2018 15:00 - 17.30
3 may 2018 15:00 - 17.30
9 may 2018 15:00 - 17.30
11 may 2018 10:00 - 12.30
16 may 2018 15:00 - 17.30
17 may 2018 10:00 - 12.30
22 may 2018 15:00 - 17.30
23 may 2018 15:00 - 17.30
Educational Goals: This introductory course aims to provide students with an overview on the nexus between big/digital data and the study of social and political dynamics. The course leans on a conceptualization of Big Data as a complex set of cultural, political and scientific knowledge practices that challenge the traditional modes in which research questions are posed and framed, analyses are performed, as well as the ways in which results are communicated to the public and thus affect public discourse and debates.
Programme: Consistently, it is articulated in 4 blocks, each dealing with a specific aspect of doing social research in and through big and
digital data:
Block 1. Big and Digital Data: an Epistemological Shift for Social Sciences?: this block provides an overview of the implications of having increasingly large-scale datasets available to study social and political dynamics. This block will also discuss big/digital data not only as a tool for social research but also as an object of study in its own right.
Block 2. Datafied Research: practices, implications, and ethical considerations: this block deals with the methodological steps that underpin social and political research activities through big/digital data – from the identification of adequate digital/datafied objects, to query design, data collection, organization and analysis, and the communication of research results. Particular attention will be given to ethical considerations that should imbue the research process throughout.
Block 3. Studying political participation through big and digital data: this block focuses specifically on the issue of political participation and how to study it through big and digital data. Topics included in this block comprise social movements and collective action dynamics, public opinion, electoral campaigning and fake news.
Block 4. Social media and digital objects scraping: this block discusses the meanings, the potentials, and the limitations of different digital objects – e.g., Facebook posts, tweets, websites, images, Wikipedia pages, etc. – for social and political research. Different objects will be discussed in connection with the affordances of the different platforms from which they originate.
Summer school on Data Science
Provided by: CNR
Lecturers: (vari) TBD
Semester: 2
Survey Methods
Provided by: UNIPI
Lecturers: Monica PRATESI (coord.)
Technologies for web marketing
Provided by: UNIPI
Location: Polo Fibonacci, University of Pisa
Lecturers: Salvatore RUGGIERI
Hours: 40
Semester: 2
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Educational Goals: The student who completes the course successfully will have a solid knowledge about information technologies for marketing decisions in the web, on how to market effectively, on how to being truly connected with customers, on how to improve the customer experience on a web site, on how to invest available resources, and on how measure success using web marketing technologies. The student will be able to understand and classify the large number of problems that arise in the application field of web marketing. The student will have be aware of the many privacy and legal issues related to web tracking, user profiling, and to the application of advertising, personalization and social media marketing strategies.
Prerequisites: Some knowledge of how the Internet as a network, and some Internet programming (HTML, Javascript). Students must be fluent in English (the course is part of a Master degree held in English).
Link: http://www.di.unipi.it/scripts/PD2/corso.php?lang=en&cds=WBI-LM&anno=201...
Text Analytics and Opinion Mining
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Andrea ESULI
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: This module introduces the main methods of analysis and mining of opinions and personal evaluations for users based on Big Data generated on the web or other sources. Emphasis will be put on text mining method applied to text originated on social media. Lessons will be supported by case studies developed in the SoBigData.eu lab.
Link: http://masterbigdata.it/en/content/text-analytics-and-opinion-mining
Time Series Analysis
Provided by: SNS
Lecturers: ?
Hours: 40
Topics in Statistical Learning
Provided by: S.ANNA
Lecturers: Francesca CHIAROMONTE
Hours: 30
Semester: 2
Educational Goals: This course will introduce the students to a number of topics in contemporary Statistical Learning selected from the following list: (i) Unsupervised classification; Clustering methods. (ii) Unsupervised dimension reduction; Principal Components Analysis and related techniques. (iii) Supervised classification methods. (iv) Linear and generalized linear models. (v) Non-parametric regression methods. (vi) Resampling methods, Cross Validation, the Bootstrap and permutation-based techniques. (vii) Feature selection and penalized fitting techniques for (generalized) linear models (viii) Supervised dimension reduction; Sufficient Dimension Reduction and related techniques. Compared to traditional courses on regression, linear models and multivariate statistics, the focus will be on analyzing actual datasets of interest to the students through group projects. We will also attempt to utilize a so-called active learning approach, leveraging publicly available MOOC materials.
Prerequisites: A working knowledge of basic statistical inference procedures (point estimation, confidence intervals, testing) and regression modeling.
Visual analytics
Provided by: UNIPI, CNR
Location: Polo Fibonacci, University of Pisa
Lecturers: Salvatore RINZIVILLO
Hours: 20
Semester: 2
Timetable: https://www.di.unipi.it/en/education/mds/timetable-mds
Link: http://didawiki.cli.di.unipi.it/doku.php/magistraleinformaticaeconomia/v...
Web Mining
Provided by: UNIPI, CNR
Location: Computer Science Department, Unversity of Pisa
Lecturers: Raffaele PEREGO, Franco Maria NARDINI
Hours: 20
Semester: 2
Timetable: TBA
Educational Goals: This module presents how to analyse traces that users leave from querying Web search engines (query log). It presents the main applications of Web mining including: i) how to profile the interests/activities of users, ii) how to use information from query logs for forecasting social indicators and optimizing Web search engines. Teaching activities will be supported by several case studies developed in the SoBigData.eu laboratory.