Soheil Jamshidi
Empirical analysis of defects in R Markdown.
Rel. Marco Torchiano. Politecnico di Torino, Corso di laurea magistrale in Ingegneria Informatica (Computer Engineering), 2026
|
Preview |
PDF (Tesi_di_laurea)
- Tesi
Licenza: Creative Commons Attribution Non-commercial No Derivatives. Download (1MB) | Preview |
Abstract
Executable documents such as R Markdown combine narrative text, executable code, and rendered outputs in a single source artifact. This approach supports reproducible research and repeatable reporting, but it also creates a wider failure surface than conventional scripts: defects may arise from code logic, data handling, chunk options and execution order, YAML metadata, package dependencies, environment configuration, and the rendering/conversion toolchain. Despite the popularity of R Markdown in data analysis and scientific communication, there is limited empirical evidence on which defect types dominate when code and documents co-evolve. This thesis provides an empirical analysis of defects in R and R Markdown projects by mining GitHub repositories and classifying bug-fix commits using an executable-document–aware taxonomy.
Candidate bug-fix commits are identified through keyword-based heuristics applied to commit history
Relatori
Anno Accademico
Tipo di pubblicazione
Numero di pagine
Corso di laurea
Classe di laurea
URI
![]() |
Modifica (riservato agli operatori) |
