ds assessment report

Question assignments

Team member TODOs
Mine Çetinkaya-Rundel
Elijah Meyer
Maria Tackett
Mine Dogucu
Matt Beckman
Andy Zieffler
Chelsey Legacy

Data analysis

Pilot data from the Spring 2025 academic calendar from Duke University

Data

There are 226 participants that opted into our study. All who opted in finished.

Below are the percentages of missing observations by question. Questions that have at least 0.8 percent missing are highlighted in red.

question pct_missing
covid_map_4 0.01
movie_budgets_1b 0.00
att_check 0.00
he_said_she_said_2 0.00
website_testing_1_3 0.00
time_spent 0.00
storm_paths 0.00
movie_budgets_1 0.00
movie_budgets_1c 0.00
shop 0.00
banana_conclusions 0.00
covid_map_1 0.00
covid_map_2 0.00
covid_map_3 0.00
covid_map_5 0.00
he_said_she_said_1 0.00
he_said_she_said_3 0.00
build_a_plot_1 0.00
build_a_plot_2 0.00
blocks_1 0.00
blocks_2 0.00
realty_tree_1 0.00
realty_tree_2 0.00
image_recognition1_1 0.00
data_confidentiali1_1 0.00
activity_journal_1_1 0.00
movie_wrangling_1_6 0.00
movie_wrangling_2_6 0.00
movie_wrangling_3_6 0.00

Below is a table that shares the percentage of correct responses across students.

Note: movie_wrangling_3_6 should be answer 0, but answer 0 was not presented on assessment.

START_WITH(the Movies table) then

   KEEP_ROWS_WHERE(the season value is Fall) then

    COUNT(the number of rows) WHERE( best_picture value is Yes)
question pct_correct
att_check 1.00
covid_map_1 0.96
build_a_plot_2 0.95
he_said_she_said_2 0.94
he_said_she_said_1 0.92
movie_wrangling_2_6 0.90
banana_conclusions 0.89
movie_budgets_1c 0.86
realty_tree_1 0.85
build_a_plot_1 0.82
storm_paths 0.75
movie_budgets_1 0.74
data_confidentiali1_1 0.63
image_recognition1_1 0.59
movie_budgets_1b 0.55
covid_map_2 0.43
covid_map_5 0.38
blocks_1 0.38
blocks_2 0.38
he_said_she_said_3 0.32
covid_map_3 0.28
movie_wrangling_1_6 0.24
shop 0.24
covid_map_4 0.21
website_testing_1_3 0.20
activity_journal_1_1 0.08
realty_tree_2 0.08
movie_wrangling_3_6 0.00
time_spent 0.00

Time spent

Here are common student words for the question “How much time did you spend on this assessment?”

# A tibble: 10 × 2
   word              n
   <chr>         <int>
 1 30               77
 2 45               39
 3 40               38
 4 20               18
 5 35               16
 6 25               14
 7 hour             12
 8 50                9
 9 approximately     7
10 time              7

Key

question correct
storm_paths City a
movie_budgets_1 Plot C
movie_budgets_1b Plot A
movie_budgets_1c Plot C
shop Shop 1
banana_conclusions The headline makes a causal claim, but only a relationship can be concluded from this analysis.
covid_map_1 South
covid_map_2 CA
covid_map_3 FALSE
covid_map_4 FALSE
att_check Option 2
covid_map_5 No, we can not conclude a difference in total number of COVID cases. There is not enough information given on this graph to make the comparison.
he_said_she_said_1 FALSE
he_said_she_said_2 TRUE
he_said_she_said_3 Need additional information to determine this
build_a_plot_1 FALSE
build_a_plot_2 FALSE
blocks_1 E
blocks_2 False negative rate is larger
realty_tree_1 $151,424
realty_tree_2 $501,876
website_testing_1_3 There is evidence that the red version of the website will generate more clicks on the “Store” link if there are 50 users visiting the website.
image_recognition1_1 professory in the photos from the sciences are primarily white and male in lab coats, which is not representative of science professory today.
data_confidentiali1_1 Student’s class year
activity_journal_1_1 One of the columns will be exercise type; a categorical variable with levels such as Pilates, Weights, Walk, etc.
movie_wrangling_1_6 Use if-else statements
movie_wrangling_2_6 2
movie_wrangling_3_6 0

Distribution of responses

plot

table