Week 1

Welcome to GES668

weeks
Published

August 28, 2024

Summary
Review the course syllabus and structure. Get started with spatial data using {sf} and the tidyverse.

Overview

Welcome to Building Spatial Datasets!

Our first session starts with a general introduction to the course and a quick introduction to using R for working with spatial data.

Key Objectives

  • Provide an overview of the scope and schedule of the course
  • Introduce the concept of reproducibility and key reproducible practices for data management, programming, collaborating with colleagues, organizing projects, and tracking work
  • Configure RStudio and create a RStudio project
  • Introduce the concept of simple features and how the concept is implemented by the {sf} package
  • Create a Quarto document to practice using the {tidyverse} and {sf} package

Key Skills

  • Creating self-contained projects with RStudio
  • Installing packages with install.packages()
  • Loading packages with library()
  • Using the ? operator to look up function documentation
  • Using str() or View() to look at an object
  • Using an assignment operator (<- or =) to save an output to a new object
  • Creating and editing Quarto documents with RStudio
  • Using sf::st_read() to read a sf object from a file path or URL

Prepare

Install R and RStudio on your computer. If you already have R installed, please check to make sure your installation is up to date. For more details, check out the Prerequisites section from R for Data Science (one of a few books we’ll use in our class).

Install Git on your computer. The easiest option is to install the GitHub Desktop application (which installs Git at the same time). Alternatively, Happy Git and GitHub for the useR details a few different ways to install Git on Windows, OSX, or Linux.

Sign up for a GitHub user account. I use GitHub to distribute assignments and you will use GitHub to submit assignments. GitHub makes it easier to use Git for version control and is the most popular platform for collaborative development work. People use GitHub for everything from writing R packages to textbooks!

Join our course Discord. I use Discord to share class announcements and encourage students to use the Discord to share links and questions throughout the semester.

Complete the student survey. This is how you share your GitHub username. I’d also love to know more about what experiences you’re bringing to the class and what you hope to learn.

If you have not done any programming before, I’d recommend taking some time to review The Very Basics and Packages and Help Pages (section 2 and 3 from Hands-On Programming with R).

Required readings

  • Ch. 1 Introduction in Robin Lovelace, Jakub Nowosad, and Jannes Muenchow Geocomputation with R, 2nd ed. (CRC Press, 2025), https://r.geocompx.org.
  • Ch. 2 Geographic data in R in Lovelace, Nowosad, and Muenchow Geocomputation with R.
  • Chris Brunsdon and Alexis Comber “Opening Practice: Supporting Reproducibility and Critical Spatial Data Science,” Journal of Geographical Systems 23, no. 4 (October 1, 2021): 477–496, doi:10.1007/s10109-020-00334-2.
  • Greg Wilson et al. “Good Enough Practices in Scientific Computing,” PLOS Computational Biology 13, no. 6 (June 22, 2017): e1005510, doi:10.1371/journal.pcbi.1005510.
  • Ch. 3 Workflow: basics in Hadley Wickham, Garrett Grolemund, and Mine Çetinkaya-Rundel R for Data Science: Import, Tidy, Transform, Visualize, and Model Data, 2nd edition. (Sebastopol, CA: O’Reilly Media, 2023), https://r4ds.hadley.nz/.

Optional readings

Additional readings on working with spatial data in R:

Additional readings on reproducible practices:

Participate

🖥️ Welcome to GES 668

🖥️ Introduction to Spatial Data with R

Practice

🛠️️ Exercise 01