Question assignments

Team member	TODOs
Mine Çetinkaya-Rundel
Elijah Meyer
Maria Tackett
Mine Dogucu
Matt Beckman
Andy Zieffler
Chelsey Legacy

Data analysis

Pilot data from the Spring 2025 academic calendar from Duke University

Data

There are 226 participants that opted into our study. All who opted in finished.

Missing observations

Below are the percentages of missing observations by question. Questions that have at least 0.8 percent missing are highlighted in red.

question	pct_missing
covid_map_4	0.01
movie_budgets_1b	0.00
att_check	0.00
he_said_she_said_2	0.00
website_testing_1_3	0.00
time_spent	0.00
storm_paths	0.00
movie_budgets_1	0.00
movie_budgets_1c	0.00
shop	0.00
banana_conclusions	0.00
covid_map_1	0.00
covid_map_2	0.00
covid_map_3	0.00
covid_map_5	0.00
he_said_she_said_1	0.00
he_said_she_said_3	0.00
build_a_plot_1	0.00
build_a_plot_2	0.00
blocks_1	0.00
blocks_2	0.00
realty_tree_1	0.00
realty_tree_2	0.00
image_recognition1_1	0.00
data_confidentiali1_1	0.00
activity_journal_1_1	0.00
movie_wrangling_1_6	0.00
movie_wrangling_2_6	0.00
movie_wrangling_3_6	0.00

Percent correct - multiple choice

Below is a table that shares the percentage of correct responses across students.

Note: movie_wrangling_3_6 should be answer 0, but answer 0 was not presented on assessment.

START_WITH(the Movies table) then

   KEEP_ROWS_WHERE(the season value is Fall) then

    COUNT(the number of rows) WHERE( best_picture value is Yes)

question	pct_correct
att_check	1.00
covid_map_1	0.96
build_a_plot_2	0.95
he_said_she_said_2	0.94
he_said_she_said_1	0.92
movie_wrangling_2_6	0.90
banana_conclusions	0.89
movie_budgets_1c	0.86
realty_tree_1	0.85
build_a_plot_1	0.82
storm_paths	0.75
movie_budgets_1	0.74
data_confidentiali1_1	0.63
image_recognition1_1	0.59
movie_budgets_1b	0.55
covid_map_2	0.43
covid_map_5	0.38
blocks_1	0.38
blocks_2	0.38
he_said_she_said_3	0.32
covid_map_3	0.28
movie_wrangling_1_6	0.24
shop	0.24
covid_map_4	0.21
website_testing_1_3	0.20
activity_journal_1_1	0.08
realty_tree_2	0.08
movie_wrangling_3_6	0.00
time_spent	0.00

Time spent

Here are common student words for the question “How much time did you spend on this assessment?”

# A tibble: 10 × 2
   word              n
   <chr>         <int>
 1 30               77
 2 45               39
 3 40               38
 4 20               18
 5 35               16
 6 25               14
 7 hour             12
 8 50                9
 9 approximately     7
10 time              7

Key

question	correct
storm_paths	City a
movie_budgets_1	Plot C
movie_budgets_1b	Plot A
movie_budgets_1c	Plot C
shop	Shop 1
banana_conclusions	The headline makes a causal claim, but only a relationship can be concluded from this analysis.
covid_map_1	South
covid_map_2	CA
covid_map_3	FALSE
covid_map_4	FALSE
att_check	Option 2
covid_map_5	No, we can not conclude a difference in total number of COVID cases. There is not enough information given on this graph to make the comparison.
he_said_she_said_1	FALSE
he_said_she_said_2	TRUE
he_said_she_said_3	Need additional information to determine this
build_a_plot_1	FALSE
build_a_plot_2	FALSE
blocks_1	E
blocks_2	False negative rate is larger
realty_tree_1	$151,424
realty_tree_2	$501,876
website_testing_1_3	There is evidence that the red version of the website will generate more clicks on the “Store” link if there are 50 users visiting the website.
image_recognition1_1	professory in the photos from the sciences are primarily white and male in lab coats, which is not representative of science professory today.
data_confidentiali1_1	Student’s class year
activity_journal_1_1	One of the columns will be exercise type; a categorical variable with levels such as Pilates, Weights, Walk, etc.
movie_wrangling_1_6	Use if-else statements
movie_wrangling_2_6	2
movie_wrangling_3_6	0

ds assessment report

Question assignments

Data analysis

Data

Time spent

Key

Distribution of responses

plot

table