Course overview
General information
- Course: Markup languages and reproducible programming in statistics (202000010)
- Course coordinator: Pablo Mosteiro (p.mosteiro@uu.nl)
- Teachers: Pablo Mosteiro (p.mosteiro@uu.nl) & Hanne Oberman (h.i.oberman@uu.nl)
- Schedule: MyTimetable
Course description
This course gives an overview of the state-of-the-art in statistical markup, reproducible programming and scientific digital representation. Students will get to know the professional field of statistical markup and its innovations and challenges. It consists of meetings in which students will learn about markup languages (LaTeX and Markdown), learn efficient programming with R Markdown, experience developing Shiny web apps, get to know version control with Git and will create and maintain their own data archive repository and personal (business card) page through GitHub. Combining these lectures, the students get acquainted with different viewpoints on marking up statistical manuscripts, areas of innovation, and challenges that people face when working with, analyzing and reporting (simulated) data. Knowledge obtained from this course will help students face multidimensional problems during their professional career.
Support
We will not use Brightspace. Instead, all course materials can be found here on the course website, https://mlarpis.github.io/markup.
- For handing in assignments and deliverables, you can fork this template repository following the flow outlined in What to hand in.
- If you regularly update your fork, we can keep track of your progress.
- Please post your content-related questions as an issue in the course repository.
Questions regarding personal circumstances may be shared through e-mail with the teaching team. Please don’t use e-mail for other questions unless it cannot be avoided - we’d prefer if you ask us in class or open an issue in the course repo.
Syllabus
This course website serves as syllabus. Background information, e.g. about course aims, can be found on Osiris
Teaching methods
The learning activities in this course include lectures/seminars, supervised labs with exercises/tutorials, and throughout the course students will work on a portfolio with deliverables.
There are six topics discussed in this course (see the course landing page). Each topic includes a lecture with tutorial elements (or a seminar) and one or more exercises. Some topics have an associated deliverable (graded assignment).
For example, for the topic of reproducible research, the lecture slides discuss the definition of reproducibility, what reproducibility can look like in the context of statistical research, and the requirements for reproducible research compendiums. The exercise is a tutorial on developing research compendiums with R and Quarto. And the deliverable is a complete and reproducible research compendium for a statistical research project.
Lectures
Depending on the topic, the lecture contains some interactive elements, or will have a showcase/walkthrough of a development workflow (e.g. R package development). The lectures contribute to learning objective a: ‘Students develop fundamental knowledge and understanding in the state of the art in statistical markup languages and reproducible programming and development’ (see the Osiris course text).
Exercises
The exercises associated with each topic allow students to gain experience with the skills in this course. The exercises are not assessed summatively (i.e., with a grade), but they may provide formative feedback to the students (i.e., self-test). After completing the exercises, students are equipped to start working on the course deliverables (graded assignments). The exercises are aimed for students to practice learning objectives c and d: ‘They can efficiently organize a reproducible programming and development process’ and ‘They can produce repositories up to the standards of international programming and coding conventions and initiatives’ (see the Osiris course text).
Deliverables
The deliverables are graded assignments, collected in a portfolio. Students start working on the deliverables during the course, and can request feedback from the instructors and their peers. An overview of the deliverables is given in the section Assessment and feedback. The deliverables are handed in after the course ends, collected in a zip folder with their complete portfolio. During the process of working on deliverables, students will progress towards learning objectives b and e: ‘They can determine the most effective markup strategies to address a typesetting problem’ and ‘They can produce publications up to the typesetting standards of international peer- reviewed journals’ (see the Osiris course text). The requirements for the portfolio can be found under Assessment and feedback.
Procedure
The lectures in this course have mandatory attendance. After each lecture, students make exercises, which are handed in week-by-week. Throughout the course, starting after some of the topics, students work on deliverables, which are handed in in a portfolio after the course ends. Students combine all exercises and deliverables into a portfolio, which is handed in after course ends. Students are allowed to make use of generative AI in this course, see the GenAI rules.