Reviews
- SLR: structured/systematic literature review
- SMS: structured mapping study. More output, it analyzes the quality of the papers
- SR:
It is recommended to report the review methodology in a flowchart

SMS map chart

SMS assesses the quantity of the papers but does not tells too much about the quality. That’s what an SLR is helpful for
Meta-analysis: rare in SE, common in medicine
- analyzing other reviews to spot trends
CASP (Critical Appraisals Skill Program) is a checklist to assess rigor, credibility, and relevance in empirical research
Do not use RegEx in query strings in papers (it is not reliable)
https://dl.acm.org/search/advanced : select “the ACM guide to computing literature” (it will fetch material even from Springer and other relevant sources)
MSR (Mining software repository) Analyzing data stored in software repositories
- mining: extracting knowledge (meaningful patterns, trends)
- combine SE and data science to get relevant information
Types of repo
- version control systems
- issue trackers (Jira, Bugzilla)
- code review systems (Gerrit, Phabricator)
- communication logs (mailing lists, Slack)
- knowledge websites
Mining tools:
- PyDriller
- repositories APIs
Clean your data that can skew the results
Replicability
- you store your results (so everyone can replicate your analysis on the dataset)
- you define commits intervals so that everyone can fetch the same repo at the same state
Exercise
- a proposal of MSR
- define data source to mine
- clarify inclusion criteria
- describe how would you analyze the data
- discuss threats and limitations
- describe how would you present the results