Skip to content

Instantly share code, notes, and snippets.

@JoKeyser
Last active September 21, 2025 11:34
Show Gist options
  • Select an option

  • Save JoKeyser/3506f3087bc68dda89f32f56ed9c283c to your computer and use it in GitHub Desktop.

Select an option

Save JoKeyser/3506f3087bc68dda89f32f56ed9c283c to your computer and use it in GitHub Desktop.
OSF Preregistration template in Markdown

OSF Preregistration form

Important

This is a text-only Markdown version of the OSF Preregistration form. Upon completion, you can copy/paste your answers into the OSF web form to publish your preregistration.

Why a Markdown version? The OSF web form is unwieldy for writing and editing long texts, much less for coordinating between multiple authors. This Markdown version may help you prepare your preregistration, in a format that is easier to edit and version control. To coordinate contributions from multiple authors, you can use Git and GitLab, for example. Also, you can convert this Markdown file to a polished PDF using Pandoc or Quarto.

How to use this form? To use the Markdown version, follow the instructions below. At the places indicated, replace the placeholder text with your own text. Ultimately, you should also delete or comment out all instructional text (i.e., any line leading with >), so that only your own text remains.

Instructions

Note

Explanation by COS: General information about preregistration is available at https://cos.io/prereg and you can reach out to prereg@cos.io. A preprint of this template is available at https://osf.io/preprints/metaarxiv/epgjd/.

This document includes the questions that will be when completing this registration template on OSF. Make a copy of this document and use it to plan and prepare for submitting your registration.

Questions with an asterisk (✱) are required.

Questions will offer one of the following input options:

Symbol Name What to do (in the web form)
🔘 Radio button You will be provided with a series of options and may select only one. (Note that Markdown does not support radio buttons; thus below, all radio buttons are replaced with check boxes.)
Check box You will be provided with a series of options and may select as many as necessary.
✎▭ Text box (short or long) You will type in your response.
📤➕ File upload widget You can upload a file as a response to this question. You may attach up to 5 files and cannot total over 5 GB in size.

Metadata

Title ✱

Note

✎▭ Short text box; replace this note with your title.

Description ✱

Note

✎▭ Short text box; replace this note with your description.

Contributors ✱

Note

✎▭ Short text box; replace this note with a list of all contributors, each with their full name and email address.

License ✱

Note

🔘 Select one, i.e., enter an x to mark your choice,

  • like this.

You can read more about licenses in our help guides.

  • No license
  • GNU Lesser General Public License (LGPL) 3.0
  • BSD 3-Clause "New"/"Revised" License
  • BSD 2-Clause "Simplified" License
  • GNU Lesser General Public License (LGPL) 2.1
  • CC-By Attribution 4.0 International
  • Artistic License 2.0
  • CC0 1.0 Universal
  • Apache License 2.0
  • Mozilla Public License 2.0
  • Academic Free License (AFL) 3.0
  • Eclipse Public License 1.0
  • MIT License
  • GNU General Public License (GPL) 3.0
  • GNU General Public License (GPL) 2.0
  • CC-By Attribution-ShareAlike 4.0 International
  • CC-By Attribution-NonCommercial-NoDerivatives 4.0 International

Subject ✱

Note

Explanation by COS: Our system uses the bepress taxonomy. Please select as many subjects as you please. Note, the more detailed and inclusive you are in your response makes it easier for others to find your work.

Note

✎▭ Short text box; replace this note with your subject(s).

Tags

Note

✎▭ Short text box; replace this note with your chosen tags.

Study Information

Hypotheses ✱

Note

Explanation by COS: List specific, concise, and testable hypotheses. Please state if the hypotheses are directional or non-directional. If directional, state the direction. A predicted effect is also appropriate here. If a specific interaction or moderation is important to your research, you can list that as a separate hypothesis.

Example by COS
If taste affects preference, then mean preference indices will be higher with higher concentrations of sugar.

Note

✎▭ Long text box; replace this note with your hypotheses.

Design Plan

Note

Explanation by COS: In this section, you will be asked to describe the overall design of your study. Remember that this research plan is designed to register a single study, so if you have multiple experimental designs, please complete a separate preregistration.

Study type ✱

Note

🔘 Please select one of the following statements, i.e., enter an x to mark your choice,

  • like this.
  • Experiment - A researcher randomly assigns treatments to study subjects, this includes field or lab experiments. This is also known as an intervention experiment and includes randomized controlled trials.
  • Observational Study - Data is collected from study subjects that are not randomly assigned to a treatment. This includes surveys, "natural experiments," and regression discontinuity designs.
  • Meta-Analysis - A systematic review of published studies.
  • Other

Blinding ✱

Note

Blinding describes who is aware of the experimental manipulations within a study. ☒ Mark all that apply, i.e., enter an x for each choice,

  • like this.
  • No blinding is involved in this study.
  • For studies that involve human subjects, they will not know the treatment group to which they have been assigned.
  • Personnel who interact directly with the study subjects (either human or non-human subjects) will not be aware of the assigned treatments. (Commonly known as "double blind".)
  • Personnel who analyze the data collected from the study are not aware of the treatment applied to any given group.

Is there any additional blinding in this study?

Note

✎▭ Long text box; replace this note with your text about additional blinding. If you have no additional blinding, you can write "None".

Study design ✱

Note

Explanation by COS: Describe your study design. The key is to be as detailed as is necessary given the specific parameters of the design. There may be some overlap between this question and the following questions. That is OK, as long as sufficient detail is given in one of the areas to provide all of the requested information. Examples include two-group, factorial, randomized block, and repeated measures. Is it a between (unpaired), within-subject (paired), or mixed design? Describe any counterbalancing required.

Example by COS
We have a between subjects design with 1 factor (sugar by mass) with 4 levels.

Tip

More info by COS: This question has a variety of possible answers. The key is for a researcher to be as detailed as is necessary given the specifics of their design. Be careful to determine if every parameter has been specified in the description of the study design. There may be some overlap between this question and the following questions. That is OK, as long as sufficient detail is given in one of the areas to provide all of the requested information. For example, if the study design describes a complete factorial, 2×3 design and the treatments and levels are specified previously, you do not have to repeat that information.

Note

✎▭ Long text box; replace this note with your study design.

Note

📤➕ In the OSF web form, you can upload files here.

Randomization

Note

Explanation by COS: If you are doing a randomized study, state how you will randomize, and at what level. Typical randomization techniques include: simple, block, stratified, and adaptive covariate randomization. If randomization is required for the study, the method should be specified here, not simply the source of random numbers.

Example by COS
We will use block randomization, where each participant will be randomly assigned to one of the four equally sized, predetermined blocks. The random number list used to create these four blocks will be created using the web applications available at http://random.org.

Tip

More info by COS: Typical randomization techniques include: simple, block, stratified, and adaptive covariate randomization. If randomization is required for the study, the method should be specified here, not simply the source of random numbers.

Note

✎▭ Long text box; replace this note with your randomization. If you have no randomization, you can write "None".

Sampling Plan

Note

Explanation by COS: In this section we'll ask you to describe how you plan to collect samples, as well as the number of samples you plan to collect and your rationale for this decision. Please keep in mind that the data described in this section should be the actual data used for analysis, so if you are using a subset of a larger dataset, please describe the subset that will actually be used in your study.

Existing data✱

Note

Explanation by COS: Preregistration is designed to make clear the distinction between confirmatory tests, specified prior to seeing the data, and exploratory analyses conducted after observing the data. Therefore, creating a research plan in which existing data will be used presents unique challenges. Please select the description that best describes your situation. See https://cos.io/prereg for more information.

  • Registration prior to creation of data: As of the date of submission of this research plan for preregistration, the data have not yet been collected, created, or realized.
  • Registration prior to any human observation of the data: As of the date of submission, the data exist but have not yet been quantified, constructed, observed, or reported by anyone — including individuals that are not associated with the proposed study. Examples include museum specimens that have not been measured and data that have been collected by non-human collectors and are inaccessible.
  • Registration prior to accessing the data: As of the date of submission, the data exist, but have not been accessed by you or your collaborators. Commonly, this includes data that has been collected by another researcher or institution.
  • Registration prior to analysis of the data: As of the date of submission, the data exist and you have accessed it, though no analysis has been conducted related to the research plan (including calculation of summary statistics). A common situation for this scenario when a large dataset exists that is used for many different studies over time, or when a data set is randomly split into a sample for exploratory analyses, and the other section of data is reserved for later confirmatory data analysis.
  • Registration following analysis of the data: As of the date of submission, you have accessed and analyzed some of the data relevant to the research plan. This includes preliminary analysis of variables, calculation of descriptive statistics, and observation of data distributions.

Explanation of existing data

Note

Explanation by COS: If you indicate that you will be using some data that already exist in this study, please describe the steps you have taken to assure that you are unaware of any patterns or summary statistics in the data. This may include an explanation of how access to the data has been limited, who has observed the data, or how you have avoided observing any analysis of the specific data you will use in your study.

Example by COS
An appropriate instance of using existing data would be collecting a sample size much larger than is required for the study, using a small portion of it to conduct exploratory analysis, and then registering one particular analysis that showed promising results. After registration, conduct the specified analysis on that part of the dataset that had not been investigated by the researcher up to that point.

Tip

More info by COS: An appropriate instance of using existing data would be collecting a sample size much larger than is required for the study, using a small portion of it to conduct exploratory analysis, and then registering one particular analysis that showed promising results. After registration, conduct the specified analysis on that part of the dataset that had not been investigated by the researcher up to that point.

Note

✎▭ Long text box; describe your existing data here, if you have any.

Data collection procedures ✱

Note

Explanation by COS: Please describe the process by which you will collect your data and your inclusion and exclusion criteria. If you are using human subjects, this should include the population from which you obtain subjects, recruitment efforts, payment for participation, how subjects will be selected for eligibility from the initial pool, and your study timeline. For studies that don't include human subjects, include information about how you will collect samples, duration of data gathering efforts, source or location of samples, or batch numbers you will use.

Example by COS
Participants will be recruited through advertisements at local pastry shops. Participants will be paid $10 for agreeing to participate (raised to $30 if our sample size is not reached within 15 days of beginning recruitment). Participants must be at least 18 years old and be able to eat the ingredients of the pastries.

Tip

More information by COS: The answer to this question requires a specific set of instructions so that another person could repeat the data collection procedures and recreate the study population. Alternatively, if the study population would be unable to be reproduced because it relies on a specific set of circumstances unlikely to be recreated (e.g., a community of people from a specific time and location), the criteria and methods for creating the group and the rationale for this unique set of subjects should be clear.

Note

✎▭ Long text box; replace this note with your data collection procedures.

Note

📤➕ In the OSF web form, you can upload files here.

Sample size ✱

Note

Explanation by COS: Describe the sample size of your study. How many units will be analyzed in the study? This could be the number of people, birds, classrooms, plots, or countries included. If the units are not individuals, then describe the size requirements for each unit. If you are using a clustered or multilevel design, describe how many units are you collecting at each level of the analysis. This might be the number of samples or a range, minimum, or maximum.

Example by COS
Our target sample size is 280 participants. We will attempt to recruit up to 320, assuming that not all will complete the total task.

[!TIP] More information by COS: For some studies, this will simply be the number of samples or the number of clusters. For others, this could be an expected range, minimum, or maximum number.

Note

✎▭ Long text box; replace this note with your sample size.

Sample size rationale

Note

Explanation by COS: This could include a power analysis or an arbitrary constraint such as time, money, or personnel.

Example by COS
We used the software program G*Power to conduct a power analysis. Our goal was to obtain .95 power to detect a medium effect size of .25 at the standard .05 alpha error probability.

Tip

More information by COS: This gives you an opportunity to specifically state how the sample size will be determined. A wide range of possible answers is acceptable; remember that transparency is more important than principled justifications. If you state any reason for a sample size upfront, it is better than stating no reason and leaving the reader to "fill in the blanks." Acceptable rationales include: a power analysis, an arbitrary number of subjects, or a number based on time or monetary constraints.

Note

✎▭ Long text box; replace this note with your sample size rationale.

Stopping rule

Note

Explanation by COS: If your data collection procedures do not give you full control over your exact sample size, specify how you will decide when to terminate your data collection. If you are using sequential analysis, include your pre-specified thresholds.

Example by COS
We will post participant sign-up slots by week on the preceding Friday night, with 20 spots posted per week. We will post 20 new slots each week if, on that Friday night, we are below 320 participants.

Tip

More information by COS: You may specify a stopping rule based on p-values only in the specific case of sequential analyses with pre-specified checkpoints, alphas levels, and stopping rules. Unacceptable rationales include stopping based on p-values if checkpoints and stopping rules are not specified. If you have control over your sample size, then including a stopping rule is not necessary, though it must be clear in this question or a previous question how an exact sample size is attained.

Note

✎▭ Long text box; replace this note with your stopping rule(s).

Variables

Note

Explanation by COS: In this section you can describe all variables (both manipulated and measured variables) that will later be used in your confirmatory analysis plan. In your analysis plan, you will have the opportunity to describe how each variable will be used. If you have variables which you are measuring for exploratory analyses, you are not required to list them, though you are permitted to do so.

Manipulated variables

Note

Explanation by COS: Precisely define all variables you plan to manipulate and the levels or treatment arms of each variable. This is not applicable to any observational study.

Example by COS
We manipulated the percentage of sugar by mass added to brownies. The four levels of this categorical variable are: 15%, 20%, 25%, or 40% cane sugar by mass.

Tip

More information by COS: For any experimental manipulation, you should give a precise definition of each manipulated variable. This must include a precise description of the levels at which each variable will be set, or a specific definition for each categorical treatment. For example, "loud or quiet," should instead give either a precise decibel level or a means of recreating each level. 'Presence/absence' or 'positive/negative' is an acceptable description if the variable is precisely described.

Note

✎▭ Long text box; replace this note with your manipulated variable(s). If you have no manipulated variable, you can write "None".

Note

📤➕ In the OSF web form, you can upload files here.

Measured variables ✱

Note

Explanation by COS: Precisely define each variable that you will measure. This will include outcome measures, as well as any measured predictors or covariates.

Example by COS
The single outcome variable will be the perceived tastiness of the single brownie each participant will eat. We will measure this by asking participants 'How much did you enjoy eating the brownie' (on a scale of 1-7, 1 being 'not at all', 7 being 'a great deal') and 'How good did the brownie taste' (on a scale of 1-7, 1 being 'very bad', 7 being 'very good').

Tip

More information by COS: Observational studies and meta-analyses will include only measured variables. As with the previous questions, the answers here must be precise. For example, 'intelligence,' 'accuracy,' 'aggression,' and 'color' are too vague. Acceptable alternatives could be 'IQ as measured by Wechsler Adult Intelligence Scale' 'percent correct,' 'number of threat displays,' and 'percent reflectance at 400 nm.'

Note

✎▭ Long text box; replace this note with your measured variable(s).

Note

📤➕ In the OSF web form, you can upload files here.

Indices

Note

Explanation by COS: If applicable, please define how measures will be combined into an index (or even a mean) and what measures will be used. Include either a formula or a precise description of the method. If you are using a more complicated statistical method to combine measures (e.g., a factor analysis), please note that here but describe the exact method in the analysis plan section.

Example by COS
We will take the mean of the two questions above to create a single measure of 'brownie enjoyment.'

Tip

More information by COS: If you are using multiple pieces of data to construct a single variable, how will this occur? Both the data that are included and the formula or weights for each measure must be specified. Standard summary statistics, such as "means" do not require a formula, though more complicated indices require either the exact formula or, if it is an established index in the field, the index must be unambiguously defined. For example, "biodiversity index" is too broad, whereas "Shannon's biodiversity index" is appropriate.

Note

✎▭ Long text box; replace this note with your indices. If you have no indices, you can write "None".

Note

📤➕ In the OSF web form, you can upload files here.

Analysis Plan

Note

Explanation by COS: In this section, you will describe one or more confirmatory analysis. Please remember that all analyses specified below must be reported in the final article, and any additional analyses must be noted as exploratory or hypothesis-generating. A confirmatory analysis plan must state up front which variables are predictors (independent) and which are the outcomes (dependent).

Statistical models ✱

Note

Explanation by COS: What statistical model will you use to test each hypothesis? Please include the type of model (e.g., ANOVA, RMANOVA, MANOVA, multiple regression, SEM, etc.) and the specification of the model. This includes each variable that will be included, all interactions, subgroup analyses, pairwise or complex contrasts, and any follow-up tests from omnibus tests. If you plan on using any positive controls, negative controls, or manipulation checks you may mention that here. Provide enough detail so that another person could run the same analysis with the information provided. Remember that in your final article any test not included here must be noted as exploratory and that you must report the results of all tests.

Example by COS
We will use a 2×3 repeated measures ANOVA (RMANOVA) with both factors within subjects to analyze our results.

Tip

More information by COS: This is perhaps the most important and most complicated question within the preregistration. As with all of the other questions, the key is to provide a specific recipe for analyzing the collected data. Ask yourself: is enough detail provided to run the same analysis again with the information provided by the user? Be aware for instances where the statistical models appear specific, but actually leave openings for the precise test.

See the following examples:

  • If someone specifies a 2x3 ANOVA with both factors within subjects, there is still flexibility with the various types of ANOVAs that could be run. Either a repeated measures ANOVA (RMANOVA) or a multivariate ANOVA (MANOVA) could be used for that design, which are two different tests.
  • If you are going to perform a sequential analysis and check after 50, 100, and 150 samples, you must also specify the p-values you'll test against at those three points.

Note

✎▭ Long text box; replace this note with your statistical models.

Note

📤➕ In the OSF web form, you can upload files here.

Transformations

Note

Explanation by COS: If you plan on transforming, centering, recoding the data, or will require a coding scheme for categorical variables, please describe that process.

Example by COS
The "Effect of sugar on brownie tastiness" does not require any additional transformations.

However, if it were using a regression analysis and each level of sweet had been categorically described (e.g., not sweet, somewhat sweet, sweet, and very sweet), 'sweet' could be dummy coded with 'not sweet' as the reference category.

Tip

More information by COS: If any categorical predictors are included in a regression, indicate how those variables will be coded (e.g., dummy coding, summation coding, etc.) and what the reference category will be.

Note

✎▭ Long text box; replace this note with your transformations. If you have no transformations, you can write "None".

Inference criteria

Note

Explanation by COS: What criteria will you use to make inferences? Please describe the information you'll use (e.g., specify the p-values, Bayes factors, specific model fit indices), as well as cut-off criterion, where appropriate. Will you be using one or two tailed tests for each of your analyses? If you are comparing multiple conditions or testing multiple hypotheses, will you account for this?

Example by COS
We will use the standard p<.05 criteria for determining if the ANOVA and the post hoc test suggest that the results are significantly different from those expected if the null hypothesis were correct. The post-hoc Tukey-Kramer test adjusts for multiple comparisons.

Tip

More information by COS: P-values, confidence intervals, and effect sizes are standard means for making an inference, and any level is acceptable, though some criteria must be specified in this or previous fields. Bayesian analyses should specify a Bayes factor or a credible interval. If you are selecting models, then how will you determine the relative quality of each? In regards to multiple comparisons, this is a question with few "wrong" answers. In other words, transparency is more important than any specific method of controlling the false discovery rate or false error rate. One may state an intention to report all tests conducted or one may conduct a specific correction procedure; either strategy is acceptable.

Note

✎▭ Long text box; replace this note with your inference criteria.

Data exclusion

[!NOTE] Explanation by COS How will you determine what data or samples, if any, to exclude from your analyses? How will outliers be handled? Will you use any awareness check?

Example by COS
We will verify that each subject answered each of the three tastiness indices. Outliers will be included in the analysis.

Tip

More information by COS: Any rule for excluding a particular set of data is acceptable. One may describe rules for excluding a participant or for identifying outlier data.

Note

✎▭ Long text box; replace this note with your data exclusion rule(s).

Missing data

Note

Explanation by COS: How will you deal with incomplete or missing data?

Example by COS
If a subject does not complete any of the three indices of tastiness, that subject will not be included in the analysis.

Tip

More information by COS: Any relevant explanation is acceptable. As a final reminder, remember that the final analysis must follow the specified plan, and deviations must be either strongly justified or included as a separate, exploratory analysis.

Note

✎▭ Long text box; replace this note with your missing data handling.

Exploratory analysis

Note

Explanation by COS: If you plan to explore your data to look for unspecified differences or relationships, you may include those plans here. If you list an exploratory test here, you are not obligated to report its results. But if you do report it you are obligated to describe it as an exploratory result.

Example by COS
We expect that certain demographic traits may be related to taste preferences. Therefore, we will look for relationships between demographic variables (age, gender, income, and marital status) and the primary outcome measures of taste preferences.

Note

✎▭ Long text box; replace this note with your exploratory analysis plan(s).

Other

Other

Note

Explanation by COS: If there is any additional information that you feel needs to be included in your preregistration, please enter it here. Literature cited, disclosures of any related work such as replications or work that uses the same data, or other context that will be helpful for future readers would be appropriate here.

Note

✎▭ Long text box; replace this note with your other information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment