Midterm Project Workflow

STAT 218 - Week 6, Lecture 1

February 12th, 2024

Midterm Project Guideline

The aim of the project

  • The overarching goal of the projects in this course is to empower life science students by cultivating a profound sense of agency.
  • This involves
    • actively engaging with the learning process,
    • making informed decisions, and
    • taking ownership of your academic journey.
  • Through the selection of your unique data set, you will have the opportunity to construct
    • a personalized case study,
    • formulate research questions,
    • employ diverse statistical procedures, and
    • effectively communicate your findings.

Important Deadlines

  • Your midterm project deadline is extended to Feb 29, but you can upload earlier if you’d like (please refresh your course webpage).

  • Important Deadlines:

    • Choosing a Dataset (due: Feb 18)
    • Determining Variables (due: Feb 20)
    • Meeting with Dr. Demirci at least once, preferably during office hours (by appointment between Feb 19- Feb 28)
    • Uploading the midterm project (due: Feb 29)

Workflow of This Week

Choosing Your Data Set for the Midterm Project - I

  • Midterm template and necessary files are available on Canvas as a separate module.

  • For the midterm project, you will select your own dataset from OpenIntro Data Sets or another suitable source.

  • If your group members share the same major, it’s recommended to choose a dataset related to your field of study.

Important

You should decide on your dataset this week and upload it to Canvas by Feb 18.

STEPS TO DECIDE YOUR DATA SET

Choosing Your Data Set for the Midterm Project - II

Follow these steps FOR THIS WEEK (DUE FEB 18).

If your data is coming from OpenIntro Data Sets:

library(tidyverse)
library(openintro)
data(datasetname) # Put your dataset name here
glimpse(datasetname) # Put your dataset name here

If your data is coming from anywhere else on the web:

  • Download your data as a CSV file.
  • Load your data as follows:
library(tidyverse)
library(openintro)
midtermdata <- read_csv("data-set-name.csv")

UPLOAD THIS PART OF YOUR PROJECT BY THE END OF THIS WEEK!

Choosing Your Variables


Consider the following criteria while selecting your data set:

  • Include at least 2 Numeric Variables.
  • Have at least 2 Categorical Variables with only 2 groups each.
  • Include at least 1 Categorical Variable with more than 3 groups.