Following growing awareness of replication concerns in research and the rise of Open Research (OR), psychologists have begun investigating what contributes to Questionable Research Practices (QRPs) amongst academics but less so amongst students. Undergraduates represent the next generation of psychologists, and recent changes in the undergraduate psychology curricula across UK universities have enhanced and modernised the provision of research methods and statistics teaching. Still, students continue to struggle with statistics, and how students perceive OR practices may be associated with their experiences learning statistics. In this two-study UK-wide project, we employed a series of online questionnaires to investigate the relationships between various factors including statistics understanding and attitudes and undergraduate students’ perceptions of OR and QRPs. In Study 1 (N = 267), we found that, in final-year psychology students, awareness of OR was related to perceptions of QRPs as well as statistics confidence and grades. In Study 2 (N = 695), in psychology students across all years, we found that attitudes toward statistics were associated with statistics confidence, competence, and attitudes toward learning about OR. We also found that statistics interpretation aptitude and statistics attitudes predicted perceptions of OR. Our findings have important implications for the teaching of OR, science, and statistics, and for incorporating OR into higher education curricula.
Following the report by the Open Science Collaboration (OSC, 2015), in which only 36% of studies were successfully reproduced and often with effect sizes three times smaller than the original studies, awareness of a ‘replication crisis’ has grown within psychology, as well as other fields including economics, political sciences, computer sciences, medical sciences, and even cancer biology (Cockburn et al., 2020; Coiera et al., 2018; Dreber & Johannesson, 2019; Errington et al., 2021; Freese & Peterson, 2017; Kovyliaeva, 2019). Consistent failures to replicate have given rise to concern about the reliability and transparency of research, with some suggesting the need for a ‘scientific revolution’ to combat these issues (Bartling & Friesike, 2014; Munafò et al., 2017; Wiggins & Chrisopherson, 2019).
The bias inherent in the publishing process is one factor that may contribute to the lack of replication of research. Negative aspects of academic culture, such as the ‘publish or perish’ mentality among researchers and a focus on ‘quantity over quality,’ hinder advancements of scientific knowledge (Doyle & Cuthill, 2015; Pennington, 2023; Smaldino & McElreath, 2016). Journals are more inclined to publish novel and positive or statistically significant findings, creating a ‘culture of significance’ (referred to as ‘publication bias’; Bakker et al., 2012; Easterbrook et al., 1991; Smaldino & McElreath, 2016), and this may exacerbate the ‘file drawer effect,’ where nonsignificant results are left unpublished and/or undocumented and therefore do not contribute to the scientific literature (Franco et al., 2014; Rothstein et al., 2006).
Another driver of replication failures may be the use of Questionable Research Practices (QRPs): behaviours that allow researchers to intentionally or unintentionally exploit the ‘degrees of freedom’ they have when conducting research and engage in confirmation bias (Banks et al., 2016; Dreber & Johannesson, 2019). For example, both p-hacking (massaging data until finding a significant result) and HARKing (Hypothesising After Results are Known; Kerr, 1998) are considered QRPs because they involve selectively reporting data in order to influence perceptions of publishability or to provide a statistically significant outcome (Andrade, 2021; Baker, 2016). Meta-research suggests that QRPs are commonly employed by researchers and academics; however, due to their historical ethical ambiguity, they have not always been recognised as problematic (John et al., 2012). Some researchers have argued that QRPs are defensible in particular instances and that more common QRPs, such as p-hacking and not reporting all outcome variables, are seen as more justifiable by many academics (John et al., 2012; Moran et al., 2022), although these perceptions are changing (see Pennington, 2023).
As a response to the replication crisis, new practices to improve scientific rigour, reproducibility, and transparency are being integrated into scientific processes. Open research (OR) practices, such as study preregistration, require researchers to outline their hypotheses, methods, and analyses in advance of data collection and/or analysis (Christensen et al., 2020; Parsons et al., 2022) to facilitate transparency and reduce researcher degrees of freedom during data analysis. Additional practices including the implementation of Registered Reports and open data, code, and materials (Chambers & Tzavella, 2022; Crüwell et al., 2018) also facilitate reproducibility and make research processes more robust. Collectively, employing OR practices can enrich science by ensuring that research is more transparent and accountable (Gigerenzer, 2018; see Korbmacher et al., 2023 for a review).
With the rapid development and adoption of new OR methods, some researchers have begun to investigate awareness, use, and perceptions of these practices, with much of the focus on academics and established researchers (Janke et al., 2019; Norris et al., 2022; Pennington et al., 2024). However, there has also been some work exploring perceptions of OR and QRPs within student populations (e.g., Krishna & Peter, 2018; Kvetnaya et al., 2019; Pownall, Pennington, et al., 2023). Undergraduate students have varied views and understandings of OR contingent on their research training; they also represent an important population to study as they will become the researchers of the future and are first exposed to research during their undergraduate degrees.
Like with established academics, QRPs are prevalent amongst undergraduate students as well. 40-90% of students report having engaged in QRPs or witnessed others using them (Krishna & Peter, 2018; Kvetnaya et al., 2019; Moran et al., 2022), with p-hacking and selectively reporting variables being especially common. Students’ motivations for using QRPs, however, may be quite different from those of academics: most students are not influenced by pressure to publish to the same extent as early-career and established researchers and have limited research experience (Bruton et al., 2020). Instead, students may be driven to use QRPs because of their perceptions of how significant results may impact their grades, their understanding of and attitudes toward statistics, and their approach to research projects in terms of motivations and stress associated with the work. Additionally, students may be particularly influenced by their lecturers’ views on OR and QRPs and how these ideas are presented in the curriculum (Krishna & Peter, 2018).
Although the issues associated with QRPs are common to many disciplines, psychology students in the UK are a useful population to study because, unlike other social science students, they conduct an empirical project to complete their degree programme, as required by the British Psychological Society. The high prevalence of QRPs among students suggests a need for comprehensive teaching of the replication crisis and OR within psychology programmes (Pennington, 2023), but OR practices are not yet a mandatory component of the undergraduate curriculum, and there are differing views on the importance of including OR in undergraduate teaching (Thibault et al., 2024). Therefore, the extent to which students at different (UK) universities are exposed to OR principles is likely to vary according to the perspectives of their lecturers and may play a large role in influencing their own proclivity for QRPs.
In universities in which OR is taught, it is typically incorporated into the teaching of research methods and statistics (Kvetnaya et al., 2019), and research suggests there is also a link between QRPs and understanding of statistics (e.g., Colling & Szűcs, 2021), as there are ‘delusions’ about statistics believed by both academic lecturers and students in psychology (Gigerenzer, 2018). For example, the ‘illusion of certainty’ is the belief that statistical significance alone can provide evidence that an effect exists, and this may contribute to the ‘culture of significance’ within psychology. Some researchers also incorrectly believe that p-values indicate the likelihood of replication, such that a p-value of 0.05 would have a 95% chance of successful replication. Such misguided ideas are pervasive, with 39% of professors, 20% of teachers of statistics classes, and 66% of undergraduates believing one or more of these statistical ‘delusions’ (Gigerenzer, 2018). Better understanding of different modes of statistical inference and how this relates to scientific inference is essential for quality research (Colling and Szűcs, 2021) and may play a protective role against the use of QRPs.
With so many misunderstandings about statistics even amongst seasoned researchers, it is perhaps not surprising that psychology students often find statistics a complex and difficult subject (Coetzee & Merwe, 2010) and consider statistics their most anxiety-provoking module (Chew & Dillon, 2014). Many students experience what is called ‘statistics anxiety’ (a lasting feeling of anxiety towards statistics which causes these individuals to always feel anxious when doing statistics or maths) at some point during their degree (Onwuegbuzie & Wilson, 2003), and statistics anxiety has been linked to increased procrastination among students and a reduction in their academic buoyancy and performance (Dunn, 2014; Onwuegbuzie, 2004; Paechter et al., 2017; Putwain & Daly, 2013). This anxiety may present as a barrier for students who are in the process of learning statistics by reducing their ability to grasp key concepts (Slootmaeckers et al., 2014) and may interfere with their understanding of scientific inference and thus their views toward OR and QRPs.
Researchers have previously explored relationships between the understanding and perceptions of OR and QRPS (e.g., Gilligan-Lee et al., under review; Krishna & Peter, 2018; Kvetnaya et al., 2019), and, separately, statistics anxiety and attitudes (e.g., Baloğlu, 2007; Chew & Dillon, 2014; Nesbit & Bourne, 2018; Perepiczka et al., 2011); however, no studies to date have assessed the relationships between perceptions of OR and perceptions of statistics in undergraduate students. It is also unclear how individual differences in statistics anxiety and understanding may interact with differences in perceptions of OR and QRPs and what implications this has for teaching statistics and OR at the undergraduate level and beyond.
In the current study, we explored the relationship between student perceptions of OR and their knowledge of, confidence with, and anxieties toward statistics in two separate samples of UK undergraduate psychology students. This project was divided into two parts: Study 1 was an exploratory investigation of data collected from final-year psychology undergraduate students at ten UK universities to examine the relationships between perceptions of OR and QRPs and statistics understanding and confidence; we also considered the influence of related factors such as their overall performance, experience with the project in terms of their supervisor and stress levels, and individual motivations. Study 2 was preregistered and conducted in collaboration with another project (STORM; Gilligan-Lee et al., under review). Here, we expanded upon Study 1 with a conceptual replication of key findings, the addition of measures related to statistics anxiety, and the inclusion of first- and second-year undergraduate students. We hypothesised that OR and QRP perceptions would be negatively associated with statistics anxiety and positively associated with statistics attitudes, knowledge, confidence, and ability to interpret statistical findings, as well as student grades.
Study 1
All materials and data are available on the Open Science Framework: https://osf.io/ume9k/.
Method
Participants
Participants were 267 final-year undergraduate psychology students (232 women, 34 men, and 1 nonbinary) with a mean age of 21.63 years (SD = 4.0) from ten UK universities. Participants were recruited through their universities’ student participant pools or by emails from their programme leaders advertising the study. All participants provided informed consent and received participant pool credit or a chance to win a £20 Amazon voucher. The study was approved by the ethics committees at University of the West of England and University of Cambridge.
Procedure
Participants followed a link to the survey on Qualtrics. They completed basic demographic questions and then answered a series of questionnaires, which were presented in a randomised order. The survey took approximately 15 minutes to complete, and the data were collected between December 2019 and January 2020.
Measures
We collected data on student awareness of OR, student self and lecturer perceptions of QRPs, statistics knowledge and confidence, student statistics and overall grades, student endorsement of QRPs, motivations, general and project-related stress in final year, opinions on significant findings in the final year project, and the quality of their supervisor relationship. The full list of questionnaires is available here: https://osf.io/4qpwb, and details of the measures are described below:
Self and Lecturer Perceptions of QRPs. These questionnaires were adapted from Krishna and Peter (2018) to measure participants’ perceptions and their assumptions about their lecturer’s perceptions of QRPs. Self and Lecturer Perceptions were measured by participants indicating whether they perceived various research practices as “sensible” or “problematic”. From the total of 15 research practices used by Krishna and Peter (2018), such as “Falsifying data” and “Reporting effect sizes”, a subset of ten were utilised, excluding the positive or neutral practices since these were not included in the analysis of the original study. One question pertaining to claiming that results were unaffected by demographic variables was unintentionally omitted. Scores were reversed and summed to create a total Self-Perception score and a total Lecturer-Perception score (each ranging from 10 to 70), with higher scores indicating greater endorsement of QRPs as sensible.
OR Awareness. The OR Awareness questionnaire was also adapted from Krishna and Peter (2018) and consisted of a series of 12 concepts related to OR (e.g., “Replication Crisis.”). Participants indicated whether they recalled learning the specific OR principle during their course by ticking a box or leaving it unticked. Responses were summed to create a total OR Awareness score ranging from 0-12, with higher scores indicating higher awareness of OR concepts.
Statistics Knowledge and Confidence. Participants answered six multiple-choice questions designed by the researchers to assess Statistics Knowledge and Statistics Confidence (e.g., “What does a small p-value indicate? Select all answers that you believe to be TRUE.”). Participants chose one or more options from a number of answers (option choices ranging from three to nine). A Statistics Knowledge score was computed as follows: for each answer choice, the response (or lack of response) was coded as correct or incorrect (correct = correctly ticked or correctly left unticked; incorrect = incorrectly ticked or incorrectly left empty). Each question had a different number of options so, to weight all questions equally, an average mark was calculated for each question (as percentage correct) which was then averaged to create a total knowledge score (ranging from 0 to 1), where higher scores indicated higher levels of statistical knowledge. After each knowledge question, participants were also asked to rate their confidence in their response on a 5-point Likert scale (1 = “very unconfident”, 5 = “very confident”). A Statistics Confidence score was computed by summing participant answers to each question (ranging from scores of 6 to 30), with higher scores indicating higher levels of participants’ confidence in their own answers.
Average Grades, Statistics Grades, and Related Information. Participants were asked questions about their Statistics Marks in their compulsory statistics modules and their Overall/Average Marks in their first and second years by indicating the range most appropriate for their marks (e.g., “40-49%”, “80-100%”; note that on the UK grading scale, a 40% is a pass mark or ‘D’, 50-59% is a ‘C’, 60-69% is a ‘B’, 70-79% is an ‘A’, and 80+% is considered exceptional) in their statistics modules and all modules overall. They were also asked whether they had taken any optional statistics modules; a positive response to this question prompted a question about their marks in these modules or additional statistics training. They were also asked about the methodology of their dissertation project (quantitative, qualitative, or mixed methods).
QRP Endorsement. Four scenarios involving instances of QRPs (data peeking, HARKing, p-hacking, and selective reporting) were displayed to participants to assess their Endorsement of QRPs. An example scenario is as follows: “Your ethics approval has taken longer than expected and you only have a limited amount of time for data collection before the final hand in date for your 3rd-year research project. Therefore, you decide to check data while they are being collected and terminate the data collection as soon as a significant result is found. How would you best categorise this decision?” Participants rated their views on a 7-point Likert scale (1 = “extremely inappropriate”, 7 = “extremely appropriate”) which was then summed to create a total QRP Endorsement score (ranging from 4 to 28), with higher scores indicating greater QRP endorsement.
Student Motivations. The Development-Demonstration Achievement Goal Questionnaire (Korn & Elliott, 2016) was used to assess participants’ motivations about their projects. Participants answered 12 statements about their Motivations for their final-year projects and rated them on a 7-point Likert scale (1 = “not at all true of me”, 7 = “Extremely true of me”). The items included 6 positive motivations, (e.g., “My focus is to develop my knowledge”), and 6 negative motivations, (e.g., “My aim is to avoid showing incompetence”). These were then summed to compute two total motivation scores (Positive Motivation and Negative Motivation), both ranging from 6 to 42. Higher Positive Motivation scores indicate stronger motivations to develop and demonstrate knowledge and competence, whereas higher Negative Motivation scores indicate stronger motivations to avoid demonstrating a lack of knowledge or competence.
General Stress and Project Stress. The General Stress questionnaire consisted of 12 items adapted from stress-related questions on the NEO-PI-R (Costa & McCrae, 2008), Barratt Impulsiveness Scale (BIS-11; Patton et al., 1995), and California Psychological Inventory (Boer et al., 2008) scales. This scale measured participants’ overall levels of stress experienced during their final year (e.g., “get stressed out easily”). Participants responded using a 7-point Likert scale (1 = “strongly applies to me”, 7 = “strongly does not apply to me”). Negative statements were reverse scored, and scores were summed to create a total General Stress score (ranging from 12 to 84), with higher scores indicating higher levels of stress. Project Stress assessed participants’ perceived stress relating to their project on a 5-point Likert scale (1 = “very stressful”, 5 = “not stressful at all”, then reverse scored). This involved four statements, (e.g. “Choosing the most appropriate statistical analysis method to test your hypothesis”), from which a total Project Stress score was then computed (ranging from 4 to 20) with higher scores indicating greater stress associated with aspects of the project work such as statistical decisions.
Hoping for Significance. This scale assessed to what degree participants believed that their project success was dependent upon finding significant results. Participants were presented with 5 statements, (e.g., “If my results are significant, I will get a higher mark (as write-up will be easier/study is stronger”), which they rated on a 7-point Likert scale (1 = “strongly agree”, 7 = “strongly disagree”, then reverse scored). A total Hoping for Significance score was computed from participants’ answers (ranging from 5 to 35), with higher scores indicating stronger desire to obtain significant results.
Supervisor Relationship. Participants rated their perceived Student-Supervisor Relationship by indicating their agreement with 8 statements, e.g. “I am satisfied with the support provided by my supervisor”, on a 7-point Likert scale (1 = “completely disagree”, 7 = “completely agree”, then reverse scored), to create a total Supervisor Relationship score ranging from 8 to 56, with higher scores indicating a better student/supervisor relationship. Data was also collected about whether they had regular meetings with their supervisor, about their supervisor’s academic rank, and if they had received additional supervision, and if so, by whom.
Results
Data Exclusions
The total number of survey entries was 329. Of these, 13 were test entries, 15 entries contained no data, 7 did not report a valid university name, and 27 entered demographic information but did not complete any full questionnaires. These 62 entries were excluded from all analyses, and the number of remaining participants was 267. Participants who did not complete a given questionnaire were excluded from the analyses for that questionnaire but included for questionnaires they did complete, and no data from partially completed questionnaires were included. Because the questionnaire order was randomised for each participant, slightly different numbers of participants completed each questionnaire.
Internal Reliability
Internal reliability of the scales used in the survey (excluding Statistics Marks, Average Grades, and OR Awareness, which consisted of only binary response items) was investigated. Since Cronbach’s alpha is best used to assess multi-item rating scales pertaining to a particular construct (Vaske et al., 2017), the internal reliability of Statistics Knowledge was not assessed since this scale does not measure a construct and instead gives a score similar to a percentage on an exam. All questionnaires met the criteria for high internal reliability (i.e. Cronbach’s α greater than 0.7, Peterson, 1994) except for Hoping for Significance at 0.58, but this was still included in the analyses.
Variable Cronbach's | α |
Hoping for Significance | 0.58 |
QRP Endorsement | 0.74 |
Lecturer Perceptions | 0.75 |
Self Perceptions | 0.76 |
General Stress | 0.77 |
Project Stress | 0.80 |
Statistics Confidence | 0.80 |
Negative Motivation | 0.87 |
Positive Motivation | 0.91 |
Supervisor Relationship | 0.94 |
Variable Cronbach's | α |
Hoping for Significance | 0.58 |
QRP Endorsement | 0.74 |
Lecturer Perceptions | 0.75 |
Self Perceptions | 0.76 |
General Stress | 0.77 |
Project Stress | 0.80 |
Statistics Confidence | 0.80 |
Negative Motivation | 0.87 |
Positive Motivation | 0.91 |
Supervisor Relationship | 0.94 |
Correlations
Exploratory analyses were conducted to assess the relationships between all variables using Pearson’s r correlations for normally distributed continuous data and Spearman’s 𝞺 for ordinal data such as student grades, which were reported in grade bands rather than numerically. Statistics are presented in terms of both traditional frequentist p-values as well as Bayes factors to evaluate strength of evidence (Dienes, 2011). There were no relevant previous data to guide an informed prior, and thus Bayesian analyses were computed using the default Jeffreys-Zellner-Siow (JZS) prior with a Cauchy distribution (r = 0.707; Rouder et al., 2009) in JASP (JASP Team, 2020). The JZS prior is a non-informative default and objective prior designed to minimise assumptions about the expected effect size. The alpha for our correlations was set at p < 0.001 as a conservative p-value, and effect sizes less than 0.2 were dismissed as noise (see Orben & Lakens, 2020, for a discussion of the “crud” factor). We considered BF10 > 6 as evidence for the alternative hypotheses in comparison to the null hypotheses and BF10 < 0.17 as evidence for the null hypotheses in comparison to the alternative hypotheses (e.g., Etz & Vandekerckhove, 2016). Results are shown in Figure 1, and statistically significant (p < 0.001, r > 0.2, and BF10 > 6) relationships are outlined below; p-values are not reported in text as all p’s < 0.001. In the event of disagreements between the p-values and Bayes factors, we only considered the results meaningful if both statistics exceeded the specified thresholds.
Note. Continuous variables for which Pearson’s correlations were computed are depicted here. Spearman’s 𝞺 were computed for ordinal data, and relationships of note are reported in text.
Note. Continuous variables for which Pearson’s correlations were computed are depicted here. Spearman’s 𝞺 were computed for ordinal data, and relationships of note are reported in text.
OR Awareness and QRPs. Greater OR awareness was moderately associated with higher statistics confidence (r = 0.37, BF10 = 980,945.88) and average grades (𝞺 = 0.26, BF10 = 444.54). OR awareness also predicted lower levels of student endorsement of QRPs (r = -0.33, BF10 = 32,636.88), less tendency to hope for significance (r = 0.29, BF10 = 2,790.78), and poorer self and lecturer QRP perceptions of QRPs (r = -0.28, BF10 = 1,111.64; r = -0.32, BF10 = 26,378.96). Likewise, higher student endorsement of QRPs was strongly associated with more positive self (r = 0.58, BF10 = 1.06×1018) and lecturer perceptions (r = 0.56, BF10 = 1.21×1017) of QRPs and also related to an increased hope for significant findings in their final project (r = 0.28, BF10 = 540.94). There was a strong positive relationship between self and lecturer perceptions of QRPs (r = 0.78, BF10 = 8.26×1042), and higher student self perceptions of QRPs were associated with a higher tendency to hope for significance (r = 0.27, BF10 = 354.70) and negatively related to their positive motivation (r = -0.34, BF10 = 57,603.00).
Bayes factors provided evidence for no relationship between OR awareness and negative motivation (r = 0.04, BF10 = 0.10), general stress (r = -0.08, BF10 = 0.16), and supervisor relationship (r = 0.07, BF10 = 0.14). There was also no relationship between QRP endorsement and statistics grades (𝞺 = -0.01, BF10 = 0.08), negative motivation (r = -0.04, BF10 = 0.10), general stress (r = 0,05, BF10 = 0.11), and project stress (r = 0.09, BF10 = 0.10). Both lecturer perceptions and self perceptions had no relationship with statistics grades (𝞺 = -0.04, BF10 = 0.12; 𝞺 = -0.08, BF10 = 0.13), general stress (r = 0.06, BF10 = 0.13; r = 0.06, BF10 = 0.15), and project stress (r = 0.05, BF10 = 0.12; r = 0.78, BF10 = 0.16).
Statistics Knowledge and Confidence. Statistics knowledge did not have any significant relationships with any other variables, and Bayes factors evidenced no relationship with general stress (r = 0.06, BF10 = 0.13) and supervisor relationship (r = 0.07, BF10 = 0.14). However, statistics confidence was negatively correlated with project stress (r = -0.29, BF10 = 840.47). Bayes factors also provided evidence for no relationship between statistics confidence and negative motivation (r = -0.002, BF10 = 0.09) or supervisor relationship (r = -0.01, BF10 = 0.10).
Student Grades, Motivations, and Stress. Student statistics grades were strongly positively correlated with their average grades (𝞺 = 0.70, BF10 = 6.12×1029). Students’ positive and negative motivations were related to each other (r = 0.54, BF10 = 2.09×1016), and general stress and project stress were positively associated with each other (r = 0.30, BF10 = 3,294.24). Additionally, lower general stress (r = 0.24, BF10 = 57.57) was correlated with a better supervisor relationship.
Bayes factors provided evidence for no relationship between both statistics/average grades and positive motivation (𝞺 = 0.05, BF10 = 0.10; 𝞺 = 0.003, BF10 = 0.10) as well as negative motivation (𝞺 = -0.04, BF10 = 0.11; 𝞺 = -0.06, BF10 = 0.10). There was also no relationship between statistics grades and general stress (𝞺 = -0.03, BF10 = 0.12), positive motivation and general stress (r = -0.07, BF10 = 0.15), or positive motivation and project stress (r = -0.07, BF10 = 0.13).
Study 1 Interim Summary
The results of Study 1 revealed strong relationships between several variables measuring undergraduate students’ perceptions of QRPs. Our measure examining QRP endorsement had a strong relationship with how students perceive QRPs and how they believe their lecturers to perceive QRPs, such that lower levels of QRP endorsement were associated with believing QRPs to be problematic and believing that one’s lecturers see QRPs as problematic as well. More awareness of OR (i.e., exposure to OR in their studies) was also moderately associated with less favourable perceptions of QRPs in both students and lecturers, decreased endorsement of QRPs, and a reduced likelihood of hoping for significant findings.
There were further relationships between awareness of OR and academic performance such that higher levels of OR awareness had a moderate relationship with higher grades on the course overall and even participants’ confidence in their own statistics aptitude. Higher levels of statistics confidence were associated with reduced project stress, and reduced general stress was related to better student-supervisor relationships. Higher positive motivation was also associated with less favourable views of QRPs within the students as well as their perceptions of their lecturers.
This exploratory study allowed for a broad survey of final-year students’ experiences with research and identified several interesting relationships between awareness of OR, endorsement of QRPs, academic performance, and statistics confidence. In Study 2, we aimed to conceptually replicate these relationships in a broader sample including psychology students on all years of their programme. We also aimed to further interrogate the relationship between these factors and other experiences of learning statistics, including statistics attitudes and anxiety. To do so, we partnered with the multi-year “Study of Open Research Methods” (STORM; Gilligan-Lee et al., under review), which aims to assess the attitudes, understanding, and awareness of OR practices in university students in successive years. In Study 2, we collected data on students’ statistics anxiety and understanding in a single year alongside the STORM questionnaire.
Study 2
Method
Preregistration and Open Materials
The planned analyses were preregistered (https://osf.io/3yx7j), and all materials and data are available online (https://osf.io/ume9k/).
Participants
A series of a priori power analyses were conducted using G*Power 3.0 (Faul et al., 2007) to determine the minimum sample size required. To obtain 80% power with an alpha of 0.05, 90 participants would be required to achieve equivalent effect sizes to Krishna and Peter (2018; f² = 0.19), who investigated similar constructs to our proposed hypotheses. For the planned hierarchical multiple regressions, we followed Tabachnick and Fidell’s (2014) suggestion of N ≥ 50 + 8K, where K is the number of predictors, to yield a required sample of 122. For the correlations, to achieve a small effect size (r = 0.20) with 80% power and our adjusted 0.001 alpha, we required at least 419 participants. We used the same recruitment methods as STORM (Gilligan-Lee et al., under review) had used in prior years, and as STORM typically recruited approximately 900 participants, we anticipated that we would reach our requirement of at least 419 participants.
Participants were recruited from the psychology participant pools at University of the West of England, University of Bristol, Aston University, and University of Surrey and received course credits for completing the study. Participants were also recruited from other UK universities by contacting programme leaders of all UK undergraduate psychology programmes and encouraging them to advertise the study to their students for a chance to win a £20 Amazon voucher for their participation. Given these recruitment methods, we had little control over the number of respondents, and thus we based our stopping rule on date: per our preregistration, we collected data until June 30, 2022. A total of 695 participants from 28 universities completed the questionnaire. The sample consisted of 573 women, 108 men, 11 nonbinary, and 3 people of another gender, with a mean age of 20.87 years (SD = 5.37). There were 334 participants in the first year of their studies, 194 in the second year, 30 in the third year, and 137 did not indicate their year of study. All participants provided informed consent, and the project was approved by the Ethics Committees at University of the West of England.
Procedure
Participants followed a link to the survey on Qualtrics. They completed basic demographic questions and then a series of questionnaires, which included measures related to students’ perceptions of OR and QRPs as well as their anxiety, attitudes, and understandings related to statistics. First, the scales measuring conceptual perceptions, situational perceptions, awareness and experience with OR (STORM questionnaire measures), self perceptions of QRPs, and general anxiety were presented in a random order, and then the remaining questionnaires were presented in a random order. The questionnaires took approximately 30 minutes to complete. The data were collected between March 2022 and June 2022.
Measures
The questionnaires administered in Study 2 consisted of several scales previously administered in Study 1: Self Perceptions of QRPs (Krishna & Peter, 2018), Statistics Knowledge, Statistics Confidence, and Student Grades (Overall Grades and Statistics Grades). We also included four questionnaires on perceptions of OR developed as part of STORM (Gilligan-Lee et al., under review), established questionnaires about statistics anxiety and attitudes (STARS [Cruise et al., 1985]; SATS-28 [Schau et al., 1995]), a questionnaire about data interpretation (from the science section of the American College Testing [ACT]), a questionnaire about generalised anxiety (STAI-T), and questions about students’ attitudes toward learning. The details of these measures are described below
Situational Perceptions of OR. Similar to the QRP Endorsement scale from Study 1, the Situational Perceptions of OR scale (Gilligan-Lee et al., under review) assessed participants’ views about the appropriate decisions in particular research scenarios. An example scenario is as follows: “Sadia completed a study showing that eating cabbage made people less happy. She wrote a paper and made it available online, before it was reviewed by a journal. She also put her raw data online (anonymised) and anyone could access it. A) Sadia should not have shared her paper until an academic journal had reviewed it. B) It is ok for Sadia to make her data available online.” A series of four scenarios was presented, after which participants rated two statements on a 5-point Likert scale (0 = “strongly disagree”, 4 = “strongly agree”). Items articulating a QRP were reverse-scored, and participants’ answers were summed to create a total Situational Perception score (ranging from 0 to 32), with higher scores indicating greater agreement with responsible research practices.
Conceptual Perceptions of OR. A second scale developed by Gilligan-Lee et al. (under review) was used to assess participants’ perceptions of 8 key concepts of OR, including replication, statistical power, preregistration, preprints, open research materials, open education materials, open access journals, and open data. Participants were presented with 16 statements (two for each concept, e.g., “Before they start to collect data, researchers should state their plans for the study in a time-stamped file online”) and rated them on a 5-point Likert scale (-2 = “strongly disagree”, 2 = “strongly agree”). Items articulating a QRP were reverse-scored, and participants’ responses were summed to create a total Conceptual Perception score (ranging from -32 to +32), with higher scores indicating more positive views of OR practices.
Awareness of OR. This scale was similar to the OR Awareness scale administered in Study 1 (Krishna & Peter, 2018) and was developed by Gilligan-Lee et al. (under review). As in Study 1, participants indicated whether they had heard of specific OR practices (previously stated in Conceptual Perceptions of OR) at any time during their degree, but there were only 8 practices rather than 12 as in Study 1. Positive responses were summed to give participants a total OR Awareness score ranging from 0 to 8, with a higher score indicating a greater level of OR awareness.
Experience with OR. Using the same eight practices from the Awareness of OR scale above, participants’ experience with OR was measured using an 8-item scale (Gilligan-Lee et al., under review) where participants indicated whether or not they had used OR practices. Responses were summed for a total OR Experience score ranging from 0 to 8, where a larger score suggested experience having used more OR practices.
Self Perceptions of QRPs. As in Study 1, questions adapted by Krishna and Peter (2018) were used to assess student Self Perceptions of QRPs. One item (claiming results are unaffected by demographic variables) was unintentionally omitted from this scale in Study 1 but was included here. Additionally, in Study 1, participants rated the items on a 1-7 scale to be in line with the other scales, but here, we used a 1-5 scale to mirror the original Krishna and Peter (2018) questionnaire. There were 11 negative practices in which participants rated on a 5-point Likert scale (1 = “sensible”, 5 = “problematic”); as in the original Krishna and Peter (2018) study, the positive and neutral practices (four items) were excluded from the analyses, and scores were reversed. The Self Perception of QRPs score ranged from 11 to 55, with higher scores indicating greater endorsement of QRPs as sensible.
Statistics Anxiety and Attitudes (STARS Section A and B). The Statistics Anxiety Rating Scale (STARS) questionnaire developed by Cruise et al. (1985) was used to measure Statistics Anxiety and Attitudes. This questionnaire consists of 51 items split into two subscales: the first 23 questions measure Anxiety, and the remaining 28 statements measure Attitudes. Both subscales were rated on a 5-point Likert scale (Section A: 1 = “no anxiety”, 5 = “a great deal of anxiety”; Section B: 1 = “strongly disagree”, 5 = “strongly agree”). Some items in this scale were adapted by Terry and Field (2021) from the original items by Cruise et al. (1985) to better represent more modern experiences of students; for example “Asking someone in the computer centre for help in understanding a printout” was changed to “Asking someone in the computer lab for help in understanding statistical output”. Total scores were computed for Statistics Anxiety (STARS-A; ranging from 23 to 115) and Statistics Attitude (STARS-B; ranging from 28 to 140), with higher scores indicating greater anxiety levels or more positive attitudes.
Statistics Attitudes (SATS-28). The Survey of Attitudes Toward Statistics (SATS-28; Schau et al., 1995) was also used to assess student’s attitudes towards statistics. This questionnaire included 28 statements, for example, “I like statistics” and “I am scared by statistics.”, which participants rated on a 7-point Likert scale (1 = “completely disagree”, 7 = “completely agree”, where the negative items were reverse scored). A total Statistics Attitude score ranging from 28 to 196 was computed, with higher scores indicating more positive attitudes towards statistics.
Statistics Knowledge and Confidence. Statistics Knowledge and Statistics Confidence were measured using the questionnaires outlined in Study 1.
Statistics Interpretation. Questions from the science section of the ACT, a test in the US designed to measure student readiness to attend university, was used to assess student interpretation of statistics and science. Participants were asked to read a passage that included two tables of information and a graph and answer a number of multiple-choice questions using the information provided. There were five multiple-choice questions, each with four possible answers where only one was correct. Correct answers were summed to create a total Statistics Interpretation score ranging from 0 to 5, with higher marks indicating better statistics interpretation abilities.
Statistics Grades, Average Grades, and Statistics Competence. Participant grades were also assessed for their compulsory statistics modules and average overall grades during each year of their course (years one to three and also year-four for Scottish students). (Note that this differs from Study 1, in which only final-year students were included; here, all undergraduate levels were eligible to participate, and participants reported on their marks obtained thus far.) Student Grades were measured using the same questions outlined in Study 1. The questions were displayed in relation to the years of study completed and appropriate for the country’s system as indicated by the demographic questions. Participants also answered an additional question that asked them to rate their self-perceived Statistics Competence compared to the rest of their cohort on a Likert scale (1 = “below average”, 7 = “above average”).
Generalised Anxiety. The Trait component of the STAI-T (Spielberger, 1983) was used to assess participants’ trait anxiety to control for Generalised Anxiety in the STARS questionnaire. This is a 20-item scale on which participants rated how much they perceived statements (e.g., “I feel nervous and restless;” “I feel secure”) to be similar to themselves on a 4-point Likert scale (1 = “not at all”, 4 = “very much so”, with the positive items reverse scored), giving a total General Anxiety score ranging from 20 to 80, with higher scores indicating greater levels of anxiety.
Student Attitudes Toward Learning. Questions developed in collaboration with the Framework for Open and Reproducible Research Training Community (FORRT; https://forrt.org) measured Students’ Attitudes Toward Learning about OR. These included a number of questions (e.g., “For me, it is more useful to learn the facts of science than how science is made”), presented on a 5-point scale (strongly disagree to strongly agree). Items demonstrating a lack of interest in learning about OR were reverse scored, and a composite Attitudes Toward Learning Score was computed, with higher scores indicating more positive attitudes towards learning about OR (ranging from 19 to 101).
Results
Deviations from Preregistration
Data collection and analyses deviated from plans outlined in the preregistration as follows:
Data were also collected from the participant pool at Aston University, as well as the other university participant pools mentioned in the preregistration. This has no bearing on the results or interpretations.
The preregistered plan was that participants who spent less than 12 minutes completing the questionnaire would be excluded. Participants who completed the full questionnaire in less than 12 minutes were excluded; however, participants who only partially completed the questionnaires were not excluded based on the same time restriction. We were less concerned that participants who did not complete all questionnaires would be speeding through the questionnaires (without engaging with the content) for a participation credit, and examination of their participation durations appeared reasonably in line with the amount of content they completed; therefore, data from incomplete participants’ fully completed questionnaires were included, regardless of their participation duration, as per Study 1.
Statistics Knowledge scores were computed differently than outlined in the preregistration but were calculated the same for both Study 1 and 2. This was done to equally weight the questions because they had different numbers of answer options. An average of participants’ percentage of correct answers for each question was used to compute a total Statistics Knowledge score ranging from 0 (where no questions were answered correctly) to 1 (where all questions were answered correctly).
Our planned regression models included a third step in which we would examine interactions between OR Awareness, General Anxiety, and our other predictor variables; however, this was based on the assumption that OR Awareness and General Anxiety would significantly predict our outcome variables. In the first regression model, neither predicted our outcome variable, so we did not pursue the third step with the interactions. In the latter two regression models, only OR Awareness predicted our outcome variables, but the interactions between OR Awareness and the other variables were not significant predictors of our outcome variables. Therefore, there were no relevant analyses to include for our planned third step, and this has been omitted from all regression models.
Data Exclusions
Twenty-five participants were excluded from analyses because they took less than 12 minutes to complete the survey, which was the minimum time cap as outlined in the preregistration. Additionally, 66 participants were excluded for failing to complete any questionnaires, and four participants were excluded as they did not provide a university name to indicate they were a current psychology student. In total, 95 entries were removed, and the number of remaining participants included in the analysis was 619, thus exceeding the planned sample size target. (All participants who completed any given questionnaire were included in the relevant analyses, but no data from partially completed questionnaires were included. The questionnaire order was randomised for each participant, so slightly different numbers of participants completed each questionnaire.)
Internal Reliability
As outlined in Study 1, the internal reliability of some scales (Statistics Knowledge, Awareness of OR, Experience with OR, Statistics Marks, Average Marks, Statistics Competence, and Statistics Interpretation) was not assessed in cases where they consisted of only binary response items or did not measure a construct but rather provide a score similar to a percentage on an exam. For the remaining questionnaires, all but one questionnaire met the criteria for high internal reliability (i.e. Cronbach’s α greater than 0.7, [Peterson, 1994]; see Table 2). One scale, Situational Perceptions of OR, was below the acceptable threshold (0.56), perhaps because this questionnaire can be considered more similar to an exam score rather than measuring a construct; however, this scale was still utilised in the analyses.
Variable | Cronbach's α |
Situational Perceptions of OR | 0.56 |
Conceptual Perceptions of OR | 0.71 |
Self Perceptions of QRPs | 0.71 |
Attitudes Toward Learning | 0.75 |
Statistics Confidence | 0.80 |
Statistics Attitudes (SAT-28) | 0.91 |
Generalised Anxiety | 0.92 |
Statistics Anxiety (STARS-A) | 0.95 |
Statistics Attitudes (STARS-B) | 0.95 |
Variable | Cronbach's α |
Situational Perceptions of OR | 0.56 |
Conceptual Perceptions of OR | 0.71 |
Self Perceptions of QRPs | 0.71 |
Attitudes Toward Learning | 0.75 |
Statistics Confidence | 0.80 |
Statistics Attitudes (SAT-28) | 0.91 |
Generalised Anxiety | 0.92 |
Statistics Anxiety (STARS-A) | 0.95 |
Statistics Attitudes (STARS-B) | 0.95 |
Correlations
Pearson’s r correlations were used to assess the relationship between normally distributed continuous data, and Spearman’s 𝞺 rank correlations were used for ordinal data including students’ grades, which were reported in grade bands rather than numerically. As in Study 1, both p-values and Bayesian statistics are reported. As per our preregistration, the alpha for correlations was set at p < 0.001 with effect sizes less than 0.2 dismissed as noise, BF10 > 6 were regarded as evidence for the alternative hypotheses in comparison to the null hypotheses, and BF10 < 0.17 were considered as evidence for the null hypothesis in comparison to the alternative hypothesis (e.g. Etz & Vandekerckhove, 2016). Results are displayed in Figure 2. Significant correlations (p < 0.001, , r > 0.2, and BF10 > 6) between variables are outlined below; p-values are not reported in text as all p’s < 0.001. In the event of disagreements between the p-values and Bayes factors, we only considered the results meaningful if both figures exceeded the specified thresholds.
Note. Continuous variables for which Pearson’s correlations were computed are depicted here. Spearman’s 𝞺 were computed for ordinal data, and relationships of note are reported in text.
Note. Continuous variables for which Pearson’s correlations were computed are depicted here. Spearman’s 𝞺 were computed for ordinal data, and relationships of note are reported in text.
Perceptions and Awareness of OR and QRPs. Conceptual and situational perceptions of OR were strongly positively correlated with each other (r = 0.59, BF10 = 8.96×1054), and more positive conceptual perceptions of OR were also moderately related to less of a tendency to endorse QRPs (r = -0.31, BF10 = 1.97×1011). Positive situational perceptions of OR were weakly predicted better statistics attitudes (r = 0.21, BF10 = 25070.62), improved statistics interpretation (r = 0.22, BF10 = 21579.99), and more negative self perceptions of QRPs (r = -0.31, BF10 = 1.97×1011). Student self-endorsement of QRPs was also associated with worse attitudes toward statistics (r = -0.22, BF10 = 34755.63) and learning science (r = -0.22, BF10 = 41966.64).
Bayes Factors provided evidence for no relationship between conceptual OR perceptions and OR awareness (r = 0.06, BF10 = 0.16), general anxiety (r = 0.004, BF10 = 0.05), statistics grades (𝞺 = 0.07, BF10 = 0.13), and average grades(𝞺 = 0.02, BF10 = 0.05), as well as between situational perceptions and general anxiety (r = 0.01, BF10 = 0.05) and average grades(𝞺 = 0.05, BF10 = 0.10), and between self-perceptions of QRPs and experience with OR (r = 0.03, BF10 = 0.07), general anxiety (r = -0.003, BF10 = 0.05), statistics anxiety (r = 0.03, BF10 = 0.07), statistics attitudes (r = -0.06, BF10 = 0.015), statistics confidence (r = -0.05, BF10 = 0.12), statistics grades (𝞺 = -0.07, BF10 = 0.12), average grades (𝞺 = -0.002, BF10 = 0.06), and statistics competence (r = 0.04, BF10 = 0.09). Additionally, no relationship was evidenced between awareness of OR and experience with OR (r = -2.37×10-4, BF10 = 0.05), general anxiety (r = -0.02, BF10 = 0.06), statistics anxiety (r = -0.001, BF10 = 0.05), statistics attitudes (r = 0.05, BF10 = 0.09; r = 0.02, BF10 = 0.06), statistics interpretation (r = 0.05, BF10 = 0.10), statistics competence (r = -0.01, BF10 = 0.05), and between experience of OR and general anxiety (r = -0.03, BF10 = 0.07), statistics anxiety (r = 0.001, BF10 = 0.06), statistics attitudes (r = -0.04, BF10 = 0.08; r = -0.02, BF10 = 0.06), statistics interpretation (r = -0.01, BF10 = 0.05), and student attitudes of learning (r = 0.04, BF10 = 0.08).
Statistics Anxiety and Attitudes. Statistics anxiety was moderately positively correlated with general anxiety (r = 0.42, BF10 = 3.31×1022) and was associated with worse statistics attitudes (r = -0.48, BF10 = 1.211×1031; r = -0.54; BF10 = 2.04×1040), confidence (r = -0.39, BF10 = 8.67×1017), student self-rated competence (r = -0.44, BF10 = 2.21×1024), and attitudes toward learning (r = -0.21, BF10 = 18447.53). General anxiety also predicted poorer statistics attitudes (r = -0.30, BF10 = 9.97×109), confidence (r = -0.21, BF10 = 15758.86), and student self-rated competence (r = -0.31, BF10 = 1.78×1011). Additionally, better student attitudes of statistics were correlated with higher self-competence rating (r = 0.39, BF10 = 5.69×1017; r = 0.47, BF10 = 2.21×1024), more positive attitudes toward learning (r = 0.39, BF10 = 6.10×1017; 0.36, BF10 = 6.63×1014), more statistics confidence (r = 0.31, BF10 = 1.20×1011; r = 0.34, BF10 = 1.06×1013) and a higher aptitude for statistics interpretation (r = 0.25, BF10 = 3.17×106). Bayes Factors provide evidence for no relationship between statistics anxiety and statistics grades (𝞺 = -0.09, BF10 = 0.13) and average grades (𝞺 = -0.02, BF10 = 0.05), and for statistics attitudes and average grades (r = 0.02, BF10 = 0.05; r = 0.01, BF10 = 0.05). General anxiety also was shown to have no relationship with statistics interpretation (r = -0.02, BF10 = 0.06) and statistics grades (r = -0.09, BF10 = 0.16).
Statistics Understanding and Confidence. Statistics knowledge did not have any significant relationships with the other variables, and Bayes factors provide evidence for no relationship with general stress (r = -0.04, BF10 = 0.08), student self-competence (r = 0.05, BF10 = 0.11), and self- and lecturer-perceptions (r = 0.01, BF10 = 0.05; r = 0.02, BF10 = 0.06). Higher levels of statistics confidence were associated with better statistics interpretation (r = 0.22, BF10 = 14235.83), higher statistics grades (𝞺 = 0.24, BF10 = 33329.31), more positive statistics attitudes (r = 0.31, BF10 = 1.20×1015; r = 0.34; BF10 = 1.06×1011), increased self-competence (r = 0.35, BF10 = 8.28×1013), and lower levels of statistics anxiety (r = -0.39, BF10 = 8.67×101) and general anxiety (r = -0.21, BF10 = 15758.86).
Student Grades and Competence. Student statistics grades were strongly positively correlated with their average grades (𝞺 = 0.77, BF10 = 2.91×10112). Bayes factors also provide evidence for no relationship between average grades and statistics competence (𝞺 = 0.03, BF10 = 0.06).
Regressions
We used hierarchical regressions to investigate whether statistics anxiety, attitudes, knowledge, confidence, and interpretation as well as students’ average and statistics grades predict perceptions of OR and QRPs (as measured by Conceptual Perceptions of OR, Situational Perceptions of OR and Self Perception scales). As per our preregistration, three regression models were used, one for each outcome variable, each model containing the same steps. During step one, Awareness of OR and Generalised Anxiety were entered to control for knowledge of OR and general anxiety (rather than statistics anxiety specifically). Then in step two, the statistics anxiety, attitudes, knowledge, confidence, and interpretation, and student marks scales were entered. For all regression models, our alpha was set at the p < 0.05 level.
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.001 | 0.03 | 0.50 | 0.68 | 0.24 |
Generalised Anxiety | 0.003 | 0.94 | 0.07 | |||
2 | Awareness of OR | 0.07 | 0.003 | 0.95 | 0.07 | 3.67 |
Generalised Anxiety | 0.03 | 0.53 | 0.63 | |||
Statistics Anxiety (STARS-A) | 0.04 | 0.46 | 0.74 | |||
Statistics Attitudes (SAT-28) | -0.007 | 0.92 | -0.10 | |||
Statistics Attitudes (STARS-B) | 0.15 | 0.02* | 2.31 | |||
Statistics Knowledge | 0.07 | 0.10 | 1.63 | |||
Statistics Confidence | 0.08 | 0.08 | 1.75 | |||
Statistics Interpretation | 0.10 | 0.03* | 2.17 | |||
Statistics Grades | 0.08 | 0.27 | 1.11 | |||
Average Grades | -0.07 | 0.30 | -1.04 |
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.001 | 0.03 | 0.50 | 0.68 | 0.24 |
Generalised Anxiety | 0.003 | 0.94 | 0.07 | |||
2 | Awareness of OR | 0.07 | 0.003 | 0.95 | 0.07 | 3.67 |
Generalised Anxiety | 0.03 | 0.53 | 0.63 | |||
Statistics Anxiety (STARS-A) | 0.04 | 0.46 | 0.74 | |||
Statistics Attitudes (SAT-28) | -0.007 | 0.92 | -0.10 | |||
Statistics Attitudes (STARS-B) | 0.15 | 0.02* | 2.31 | |||
Statistics Knowledge | 0.07 | 0.10 | 1.63 | |||
Statistics Confidence | 0.08 | 0.08 | 1.75 | |||
Statistics Interpretation | 0.10 | 0.03* | 2.17 | |||
Statistics Grades | 0.08 | 0.27 | 1.11 | |||
Average Grades | -0.07 | 0.30 | -1.04 |
Note. *p < 0.05.
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Statistics Attitudes (STARS-B) | 0.5 | 0.15 | < 0.001* | 3.53 | 13.68 |
Statistics Interpretation | 0.12 | 0.005* | 2.84 |
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Statistics Attitudes (STARS-B) | 0.5 | 0.15 | < 0.001* | 3.53 | 13.68 |
Statistics Interpretation | 0.12 | 0.005* | 2.84 |
Note. *p < 0.05.
Regression Model Two: Situational Perceptions of Open Research. All predictors entered can be seen in Table 5. In the final model, OR Awareness was included in Step 1 and accounted for 3% of the variability in Situational Perceptions of OR. Between Step 1 and 2, ΔR2 = 0.09, and significant predictors included Statistics Knowledge and Statistics Interpretation; see Table 6.
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.03 | 0.17 | < 0.001* | 4.12 | 8.62 |
Generalised Anxiety | 0.03 | 0.53 | 0.62 | |||
2 | Awareness of OR | 0.13 | 0.15 | < 0.001* | 3.50 | 7.94 |
Generalised Anxiety | 0.08 | 0.07 | 1.81 | |||
Statistics Anxiety (STARS-A) | 0.003 | 0.96 | 0.05 | |||
Statistics Attitudes (SAT-28) | 0.06 | 0.40 | 0.85 | |||
Statistics Attitudes (STARS-B) | 0.11 | 0.07 | 1.82 | |||
Statistics Knowledge | 0.11 | 0.01* | 2.55 | |||
Statistics Confidence | 0.07 | 0.15 | 1.46 | |||
Statistics Interpretation | 0.13 | 0.002* | 3.07 | |||
Statistics Grades | 0.11 | 0.10 | 1.63 | |||
Average Grades | -0.08 | 0.23 | -1.21 |
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.03 | 0.17 | < 0.001* | 4.12 | 8.62 |
Generalised Anxiety | 0.03 | 0.53 | 0.62 | |||
2 | Awareness of OR | 0.13 | 0.15 | < 0.001* | 3.50 | 7.94 |
Generalised Anxiety | 0.08 | 0.07 | 1.81 | |||
Statistics Anxiety (STARS-A) | 0.003 | 0.96 | 0.05 | |||
Statistics Attitudes (SAT-28) | 0.06 | 0.40 | 0.85 | |||
Statistics Attitudes (STARS-B) | 0.11 | 0.07 | 1.82 | |||
Statistics Knowledge | 0.11 | 0.01* | 2.55 | |||
Statistics Confidence | 0.07 | 0.15 | 1.46 | |||
Statistics Interpretation | 0.13 | 0.002* | 3.07 | |||
Statistics Grades | 0.11 | 0.10 | 1.63 | |||
Average Grades | -0.08 | 0.23 | -1.21 |
Note. *p < 0.05.
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.03 | 0.18 | < 0.001* | 4.14 | 17.16 |
2 | Awareness of OR | 0.09 | 0.17 | < 0.001* | 4.02 | 18.12 |
Statistics Knowledge | 0.13 | 0.002* | 3.08 | |||
Statistics Interpretation | 0.19 | < 0.001* | 4.53 |
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.03 | 0.18 | < 0.001* | 4.14 | 17.16 |
2 | Awareness of OR | 0.09 | 0.17 | < 0.001* | 4.02 | 18.12 |
Statistics Knowledge | 0.13 | 0.002* | 3.08 | |||
Statistics Interpretation | 0.19 | < 0.001* | 4.53 |
Note. *p < 0.05.
Regression Model Three: Self Perceptions of QRPs. All predictors entered can be seen in Table 7. In the final model, OR Awareness was included in Step 1 and accounted for 1% of the variance in Self Perceptions of QRPs. Between Step 1 and 2, ΔR2 = 0.08, and significant predictors included both measures of Statistics Attitudes (SAT-28 and STARS-B); see Table 8.
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.01 | -0.10 | 0.02* | -2.44 | 3.19 |
Generalised Anxiety | -0.03 | 0.47 | -0.72 | |||
2 | Awareness of OR | 0.10 | -0.08 | 0.06 | -1.91 | 5.70 |
Generalised Anxiety | -0.01 | 0.78 | -0.28 | |||
Statistics Anxiety (STARS-A) | -0.06 | 0.30 | -1.04 | |||
Statistics Attitudes (SAT-28) | 0.18 | 0.01* | 2.76 | |||
Statistics Attitudes (STARS-B) | -0.34 | < 0.001* | -5.28 | |||
Statistics Knowledge | -0.08 | 0.07 | -1.80 | |||
Statistics Confidence | 0.01 | 0.82 | 0.23 | |||
Statistics Interpretation | 0.80 | 0.07 | -1.82 | |||
Statistics Grades | -0.12 | 0.09 | -1.70 | |||
Average Grades | 0.11 | 0.09 | 1.70 |
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.01 | -0.10 | 0.02* | -2.44 | 3.19 |
Generalised Anxiety | -0.03 | 0.47 | -0.72 | |||
2 | Awareness of OR | 0.10 | -0.08 | 0.06 | -1.91 | 5.70 |
Generalised Anxiety | -0.01 | 0.78 | -0.28 | |||
Statistics Anxiety (STARS-A) | -0.06 | 0.30 | -1.04 | |||
Statistics Attitudes (SAT-28) | 0.18 | 0.01* | 2.76 | |||
Statistics Attitudes (STARS-B) | -0.34 | < 0.001* | -5.28 | |||
Statistics Knowledge | -0.08 | 0.07 | -1.80 | |||
Statistics Confidence | 0.01 | 0.82 | 0.23 | |||
Statistics Interpretation | 0.80 | 0.07 | -1.82 | |||
Statistics Grades | -0.12 | 0.09 | -1.70 | |||
Average Grades | 0.11 | 0.09 | 1.70 |
Note. *p < 0.05.
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.01 | -0.10 | 0.02* | -2.35 | 5.52 |
2 | Awareness of OR | 0.08 | -0.08 | 0.04* | -2.05 | 15.17 |
Statistics Attitudes (SAT-28) | 0.20 | < 0.001* | 3.31 | |||
Statistics Attitudes (STARS-B) | -0.37 | < 0.001* | -6.05 |
Step . | Predictor . | R2 . | β . | p . | t . | F . |
---|---|---|---|---|---|---|
1 | Awareness of OR | 0.01 | -0.10 | 0.02* | -2.35 | 5.52 |
2 | Awareness of OR | 0.08 | -0.08 | 0.04* | -2.05 | 15.17 |
Statistics Attitudes (SAT-28) | 0.20 | < 0.001* | 3.31 | |||
Statistics Attitudes (STARS-B) | -0.37 | < 0.001* | -6.05 |
Note. *p < 0.05.
Study 2 Interim Summary
Conceptual Replication of Study 1
There was overlap between several of our measures between Study 1 and Study 2, and Study 2 served as a conceptual replication for these measures but with a full undergraduate population rather than only final-year students as in Study 1. We considered the findings from Study 2 to conceptually replicate those from Study 1 if the relationships met our inference criteria and were in the same direction.
The Situational Perceptions of OR and Conceptual Perceptions of OR scales used in Study 2 were similar to the QRP Endorsement scale from Study 1 in that they measured participants’ views on research decisions. (Note that higher scores on Situational/Conceptual Perceptions of OR indicate higher endorsement of OR whereas higher scores on QRP Endorsement from Study 1 indicate the opposite – higher scores indicate higher endorsement of QRPs). As in Study 1, higher endorsement of OR (akin to lower endorsement of QRPs) in research decisions was associated with less favourable views of QRPs. We also found a similar relationship between statistics confidence and grades, such that higher levels of confidence were associated with better academic performance. However, we did not replicate the Study 1 relationships between OR awareness and views on QRPs, potentially because we sampled a wider selection of students in Study 2, many in their first year of study, with limited exposure to OR on their programmes thus far.
Additional Findings
In Study 2, we also identified relationships between perceptions of OR, attitudes toward statistics, and statistics interpretation abilities, such that more positive views of OR predicted more positive statistics attitudes and abilities. Furthermore, higher levels of statistics anxiety were associated with worse attitudes toward statistics and lower levels of statistics confidence and competence. Levels of general anxiety were tied to levels of statistics anxiety and overlapped with these other factors. Students’ attitudes toward learning about OR concepts were negatively associated with their perceptions of QRPs and their levels of statistics anxiety but positively associated with their attitudes toward statistics. Finally, we found that statistics attitudes and statistics interpretation predicted participants’ conceptual perceptions of OR. When controlling for OR awareness, statistics knowledge and interpretation predicted participants’ situational perceptions of OR, and statistics attitudes predicted participants’ perceptions of QRPs.
Discussion
We aimed to investigate how perceptions of OR and QRPs are related to psychology students’ attitudes toward and understanding of statistics and experiences with research. Among final-year psychology students in Study 1, awareness of OR practices was associated with higher levels of confidence and better academic performance. Greater OR awareness also predicted less favourable views or QRPs, and students’ own views on research practices were also aligned with the views they perceived their lecturers to hold. In the expanded sample of all years of psychology students in Study 2, we conceptually replicated some of the relationships in Study 1 and identified additional relationships between perceptions of OR and statistics such that positive views of OR were associated with better attitudes toward statistics and statistics abilities. Regression analyses indicated that conceptual perceptions of OR were predicted by statistics attitudes and interpretation, that situational perceptions of OR were predicted by OR awareness and statistics knowledge and interpretation, and that perceptions of QRPs were predicted by statistics attitudes.
Research has predominantly focused on established academics and the factors that drive them to use QRPs, such as publication bias, an incentive culture focused on statistical significance, quantity over quality, and career incentives (Bruton et al., 2020; Doyle & Cuthill, 2015; Janke et al., 2019; Pennington, 2023; Smaldino & McElreath, 2016). However, students do not encounter these same influences as professional researchers to the same extent because they are largely unaffected by publication pressures and incentives to bolster their careers, and they often have limited research experience (Bruton et al., 2020; Krishna & Peter, 2018). Therefore, other factors may be related to students’ use of QRPs, such as their understanding of OR and attitudes toward statistics, confidence, academic success, and views of their lecturers’ perceptions of research, which we explored here.
Like Krishna and Peter (2018), we found a strong positive relationship between student perceptions and their views of their lecturer’s perceptions of QRPs. Students are first exposed to research during their undergraduate degree, and their individual learning environments are likely to impact their approach to research (Gigerenzer, 2018). Whether they are taught about OR on their course influences their own approach to research (Pownall, Azevedo, et al., 2023), and we found that, in final-year students, OR awareness is related to their perceptions of QRPs, their academic performance, and their statistics confidence. We did not find the same relationship with OR awareness in our expanded sample of all undergraduate students; in Study 2, only 4% of our sample were final-year students, and Bayesian analysis indicated evidence for no relationships between OR awareness and many of our factors of interest. These primarily first- and second-year undergraduate students perhaps did not yet have enough exposure to OR or research in general for this relationship to be apparent.
Stress in students is well documented and linked to mental health problems and decreased academic performance (Bekkouche et al., 2022; Khan et al., 2013), and it is possible that students who are particularly stressed about their project may see QRPs as shortcuts to achieving a successful outcome (e.g., grades). Our findings that believing significant results would be seen as more favourable (Hoping for Significance measure) was associated with more positive views of QRPs is in line with the idea that some students may engage with QRPs in the hopes of achieving significant results (see Pownall, Pennington, et al., 2023).
Student motivations in their studies are not yet well understood, although some research suggests students are particularly goal oriented, seeing their education as a way to reach their aims rather than internally rewarding in itself (Nilsson & Warrén Stomberg, 2008; Ryan & Deci, 2020). Students may still be pressured by academic culture to endorse and use QRPs, but their motivations likely differ; academics may engage in QRPs to aid the publication of their research whereas students may be instead driven by their desires to achieve good grades (Moran et al., 2022). However, we also found that higher student grades were associated with more positive perceptions of OR, so an understanding of responsible research conduct rather than engaging in QRPs may be a better route to academic success. If the pressures associated with a ‘culture of significance’ were no longer reinforced in academia, student perceptions of QRPs and OR may change (Moran et al., 2022; Stürmer et al., 2017). This could also be achieved through the teaching of the replication crisis and the associated scientific reforms of open research (see Pennington, 2023), perhaps as a mandatory component of undergraduate education (see Pownall, Azevedo, et al., 2023 for a review, but also see Thibault et al., 2023 for alternative views on whether open research is a critical part of undergraduate education and the challenges associated with fitting in the required content).
The pressures of academic achievement and how they are related to students’ views of QRPs may also be seen in their approaches to studying research methods and statistics and the anxiety they experience about statistics. Statistics anxiety has many detrimental effects on students, such as loss of academic buoyancy and performance and an increase in their procrastination (Dunn, 2014; Onwuegbuzie, 2004; Paechter et al., 2017; Putwain & Daly, 2013). As such, we predicted that statistics anxiety would be negatively associated with perceptions of OR, since it may increase the likelihood of students feeling pressured during their project and make them more likely to endorse QRPs. In our wider survey of undergraduate psychology students across all years (Study 2), our results show that neither perceptions of OR nor QRPs were associated with statistics anxiety as we had hypothesised, and Bayesian analysis revealed evidence for there being no relationship between statistics anxiety and QRP endorsement.
However, we did find that students’ interest in and openness to exploring OR research concepts (Attitudes Toward Learning) were negatively associated with their perceptions of QRPs and statistics anxiety but positively associated with better attitudes toward statistics. Students’ anxiety and attitudes toward statistics were inversely related to each other such that higher levels of statistics anxiety were associated with worse statistics attitudes, and these measures were also linked with a number of other factors we investigated. Statistics anxiety was negatively associated with statistics confidence and competence, whereas better attitudes toward statistics were related to higher levels of confidence and competence. Statistics attitudes were also moderately negatively associated with perceptions of QRPs and positively correlated with situational perceptions, as well as with statistics interpretation. Because ‘delusions’ and false beliefs of statistics might influence students’ use of QRPs (Gigerenzer, 2018), good statistics knowledge and competence act as a protective factor, and this in turn may foster higher endorsement of OR practices.
Perhaps surprisingly, however, there were no relationships between statistics knowledge and our other variables measures of interest, with Bayesian analyses indicating evidence for there being no relationship between statistics knowledge and OR awareness, statistics competence, and attitudes toward learning. This may be driven by the nature of the questions in the statistics knowledge questionnaire: the questions required a rather nuanced understanding of statistical concepts, such as the meaning of a p-value, which many professional researchers get wrong themselves (Gigerenzer, 2018). Given that the students in the sample may have been taught incorrect ideas about these concepts, their performance on this questionnaire may not be predictive of other factors if strong students were simply responding with the incorrect information they have been taught. Our measure of statistics interpretation may have allowed for a better assessment of students’ statistical prowess, and this measure was associated with students’ perceptions of OR as well as confidence and attitudes toward statistics.
Finally, we examined how our various measures predict views of OR and QRPs when controlling for awareness of OR and general anxiety. Because different psychology programmes contain different amounts of OR-related content, some of our participants may have had very limited exposure to OR concepts. Therefore, we controlled for OR awareness as well as for general anxiety to examine the impacts of statistics attitudes, knowledge, and anxiety-related factors on student perceptions of OR. OR awareness did not predict conceptual perceptions of OR but did predict situational perceptions of OR and perceptions of QRPs. Statistics interpretation was a significant predictor for both conceptual perceptions of OR and situational perceptions of OR, and statistics attitudes predicted both situational perceptions of OR and perceptions of QRPs. Conceptual perceptions of OR and situational perceptions of OR are similar constructs that both tap into how students view various QRP- or OR-related scenarios, where conceptual perceptions are presented in terms of a general concept (e.g., should researchers share their data?) and situational perceptions are presented in terms of a particular scenario (e.g., Elaine has put her data online; should she have done this before her manuscript was reviewed?). The situational perceptions questionnaire could, in effect, be considered more nuanced and require a more advanced understanding of OR, and thus factors such as awareness of OR and statistics knowledge also predicted participants’ situational perceptions of OR.
Our results highlight important relationships between OR and student success and academic experiences. Research is starting to explore the benefits of student education in OR, and competence and experience in academic research can be improved by incorporating OR into students’ teaching (Pennington, 2023; Pownall, Pennington, et al., 2023), and although our work is correlational, our results are in line with related research that has identified a causal link between teaching OR and academic understanding (e.g., Pownall, Azevedo, et al., 2023). Students may benefit from the chance to gain experience in OR methods during their degree courses, by incorporating practice of OR into student assignments and dissertations, for example, by setting write-ups of preregistrations in assessments (Blincoe & Buchert, 2020; Parsons et al., 2022), running replications or encouraging the use of preregistrations within final year projects (Button et al., 2020; Pownall, Pennington, et al., 2023; Strand & Brown, 2019). There are also many resources for educators wanting to embed the teaching of open research within their curriculum, such as FORRT (2023), Pennington (2023), Pennington & Pownall (2024), Button et al. (2020), Chopik et al. (2018), and Pownall et al. (2021).
Due to the high prevalence of QRPs and statistics anxiety among students (Krishna & Peter, 2018; Onwuegbuzie, 2004), our results also have implications for teaching OR and statistics. Our results suggest many links between statistics anxiety and attitudes and other variables, including student confidence and interpretation as well as OR perceptions. Students may benefit from university lecturers fostering positive attitudes towards statistics, decreasing their anxieties and boosting confidence in their abilities (Kvetnaya et al., 2019), but further work is required to confirm the causal link between these factors. Gigerenzer (2018) also argued that statistics teachers should teach a statistical ‘toolbox’ to their students and that null-hypothesis testing can encourage students to see significance as their aim rather than good scientific practice, as we found in Study 1. Perhaps wider teachings of the history and methods of statistics, including both frequentist and Bayesian techniques, could help students understand more deeply the methods and decisions made during analysis. This could steer them away from ritualistic behaviours, especially since an awareness of the problems caused by QRPs and statistics rituals can help students make more informed decisions to not engage in them (Gigerenzer, 2018; Moran et al., 2022).
Our work has identified a host of important relationships between students’ understanding of and attitudes toward statistics and OR, but future research should aim to further disentangle the relationships and possible factors related to both OR and statistics in student populations. It may also be important to investigate student perceptions about the importance of OR and research processes, both in their studies and the scientific process as a whole, to help adequately inform students about the real development of scientific knowledge and assist their understanding of the scientific process, and also establish this across multiple research disciplines, beyond psychology students. Additionally, researchers should explore the effectiveness of different integrations of OR and statistics into degree programmes to determine how best to educate students in these key areas and encourage practices of scientific integrity.
Conclusions
Since the future integrity of science is dependent upon its practicing researchers, investigating students’ OR and QRPs perceptions is important because they will form the next generation of researchers. In this two-study UK-wide project, we found important relationships between factors relating to statistics and perceptions of OR and QRPs amongst psychology undergraduate students. The understanding of OR has clear links to students’ academic experiences, attitudes, and success, and reducing students’ desires to attain significant results in their research might not only improve scientific rigour but also help reduce student stress. Our findings have important implications for teaching OR to students in terms of the potential benefits not only regarding research understanding but also for the sake of students’ wellbeing and the reduction of academic-related stress. Future research should strive to further understand the relationship between statistics and OR and to explore students’ perceptions of the importance of OR and its practices in order to enhance scientific reform.
Author Contributions
Conceptualization: K.C., A.L., K.G.-L., L.d.-W., and N.E.S.-S.; Data curation: K.C. and A.L; Formal analysis: K.C. and A.L..; Funding acquisition: K.C.; Investigation: K.C., A.L., K.G.-L., L.d.-W., N.E.S.-S., and C.R.P.; Methodology: K.C., A.L., K.G.-L., L.d.-W., N.E.S.-S., and F.A.; Project administration: K.C., A.L., and K.G.-L.; Supervision: K.C.; Visualization: K.C.; Writing - original draft: K.C. and A.L.; Writing - review & editing: K.C., A.L., K.G.-L., L.d.-W., N.E.S.-S., F.A., and C.R.P.
Acknowledgements
We thank Caroline Elliott for her work in the early stages of this project, and we thank Andy Eastwood and P Taint for their thoughtful comments.
Funding Information
This work was partially supported by funding from the Psychological Sciences Research Group at the University of the West of England obtained by K.C.
Competing Interests
The authors have no competing interest to declare.
Data Availability
All materials, code, and raw data are available via the Open Science Framework: https://osf.io/ume9k/