Training for Jira-based workflow

For pre-publication verification, we use a Jira-based workflow similar to the post-publication processes described in the Wiki.

Pre-publication verification is a priority and should be completed within a week of being assigned.

Typically these replications involve interacting with openICPSR repositories where code and data are stored.
There is a specific entry questionnaire for these replications separate from the one used for post-publication replications.

Scope

Your supervisor will assign you to this workflow if needed. This workflow covers code and data, even when data may not be accessible. Supervisor, see other document for details.

This workflow DOES NOT cover assessment of data citations. This is covered by a different training.

Overview

The following table illustrates the flow and transitions. The transition field identifies the button that will appear in the interface that needs to be clicked in order to progress an issue from the From state to the To state. The Condition field identifies which form field needs to be filled out in order to be able to make the transition. Blocked is always an option, and leads to a "waiting state" until a resolution can be found.

From	Transition	→ To	Condition
Open	Start task	→ In Progress
In Progress	Download code	→ Code	`Code provenance` have been filled out, `Journal` has been identified.
Code	Access data	→ Data	`Git working location` has been filled out.
Data	Prepare preliminary report	→ Write Preliminary Report
Write Preliminary Report	Data is accessible	→ Verification	`Location of data` has been filled out.
.	Data not available	→ Code review	`Reason for non-accessibility of data` has been filled out.
Verification, Code review	Prepare report	→ Report
Report	Submit for review	→ Under Review	`Report URL` has been filled out.
Under Review	Approve	→ Approved	Can only be done by approvers.
.	Incomplete	→ Incomplete	n.a.
Approved	Done	→ Done	n.a.
Multiple	Need information	→ Incomplete	when information is missing
Incomplete	Restart	→ Code review
.	Restart verification	→ Verification
.	Restart task	→ In Progress
Blocked	Reopen	→ Open	n.a.

Notes

In the Issue form, please also fill out other fields, as noted.
If code and/or data are provided by email, Code provenance should be filled out with "email", otherwise with a URL.
There are no drop-down menus for Software, but once a value has been entered, it becomes available for future use. E.g., once Stata has been entered in software, it becomes a choice for future entries, and should be re-used.
All code should be stored on Bitbucket Git repositories.
- The root repository should contain only our files (i.e., REPLICATION.md, etc.)
- The paper's files should be in a subdirectory (e.g, paper_archive). Often this is created by the author-provided ZIP file - re-use it.
When committing, always use Smart Commits, e.g.

JRA-34 #comment corrected indent issue

Data should be stored locally (currently) / in Git LFS (soon)
- This is why it is important to identify the exact URL to download the data from (in Jira) - if somebody else needs to check what you are doing, they need to re-download the data
- When data directories contain ONLY data (no README, no code), then you MUST create a "README.md" indicating where the data is stored - otherwise, git will not preserve the directory structure.
Use JIRA to communicate with your supervisor as issues arise, including code that takes a long time to run.

Details

Additional details for each of the key stages are provided here.

In Progress

At this stage, you are collecting information.

start by creating a repository
- the repository name should be the name of the JIRA issue (e.g., AEJPOLICY-5)
- populate the repository with the template
- delete unused files from the template! Then git add those that you keep around
Establish a list of the Tables and Figures in the paper, and use this to guide you in REPLICATION.md.
Fill out the Entry Questionnaire (see the Jira project for the link)
Then fill out the Jira form, in particular the following fields
- Entry questionnaire - date the Entry Questionnaire was filled out
- Code provenance - location of the code (programs) package - this can be email, or a particular location
- Journal has been identified.
- Manuscript Central identifier has been noted (optional, if available)

You can now proceed to change the status to Code

Code

In this stage, download the code or the entire replication package, and populate the Bitbucket repository.

See above how to handle data.
Fill out the form with the location of the repository (e.g. https://bitbucket.org/aeaverification/aearep-2/src/master/)
From the README of the replication package, or the article itself, establish a list of Datasets used. You will use this to guide you when filling out the Data Citation and Information report.
Add the list of datasets to the repository, git add it, and commit.

Commit!

You can proceed to the next stage.

Data

Referring back to your list of datasets, assess whether at least part of the code can be run.

If nothing can be run, fill out Reason for non-accessibility of data and continue to Code Review.
If at least some of the code can be run, identify the Data provenance (where you got the data from: email, URL, Github, Dropbox, etc.) and Location of data (where you put the data, which can be CISER, laptop, or Git LFS, or somewhere else).

Proceed to Verification.

Verification

In this stage, you are verifying the code, either using the provided data, or by inspecting the completeness of the source code. The REPLICATION.md is the report.

Keep a log of what you do, what you find, and what does not work, in the REPLICATION.md.

You should commit your report with intermediate results as you have them. Do not wait until you have all the results finished. Commit (using Smart Commits) frequently!

Commit!

Once you are done with the verification, prepare the report.

Prepare Report

To complete this stage, enter the direct URL of the report, i.e., in the relevant repository:

https://bitbucket.org/aeaverification/aearep-2/src/master/REPLICATION.md

At this time, you can submit the report for review.

Be sure you fill in all of the metadata!

Depending on the view you chose for Jira, it may look different (see below), but always fill out all the fields that you can.
In particular, fill in the field for Journal, MC number, and Report URL (pointing to Bitbucket) in EVERY single case.

View 1

View 2

Updating information

When receiving updated files from authors, do NOT create "update" or "new" directories. The current state of the repository should always correspond to the author's structure. Overwrite files, delete files. The previous state is preserved in Git. This will also tell you what files have changed.
When running a second replication on the same archive, please be sure to have the committed "REPLICATION.md" be accurate when you commit it - do not let it contain holdover data from a previous replication attempt, as this can lead to confusion.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

jira-workflow-training.md

jira-workflow-training.md

Training for Jira-based workflow

Scope

Overview

Notes

Details

In Progress

Code

Data

Verification

Prepare Report

Be sure you fill in all of the metadata!

Updating information

Files

jira-workflow-training.md

Latest commit

History

jira-workflow-training.md

File metadata and controls

Training for Jira-based workflow

Scope

Overview

Notes

Details

In Progress

Code

Data

Verification

Prepare Report

Be sure you fill in all of the metadata!

Updating information