-
Notifications
You must be signed in to change notification settings - Fork 25
Synchronizing Study Data
If you add or edit data directly to your Terra (FireCloud) workspace outside of the portal, and would like to 'sync' the study with the workspace, you may do so in the portal. This will have several effects on your study in the portal:
-
Any files found in the workspace that do not have a matching entry in your study will be added to your study (you will have a chance to choose which files to include and provide details about each one before saving).
-
Your study permissions (i.e. people you have shared your study with) will be made to match the permissions of the Terra (FireCloud) workspace. In other words, if you have shared your workspace with someone outside of the portal, a matching share will be added in the portal for your study. The converse is also true - if you remove a user's sharing of your workspace in Terra (FireCloud), then their share of you study in the portal will be removed as well.
After logging in, select '+ Add a study' from the profile menu.
After you enter all of the normal information for a study (name, description, embargo date, public/private), select 'Yes' for 'Use an existing workspace?' and enter the name of your Terra (FireCloud) workspace in the text field.
NOTE: The workspace must be in the 'single-cell-portal' FireCloud project to be used with the portal. Please email scp-support@broadinstitute.zendesk.com for help creating a workspace in this project.
Click 'Create Study' to save your study.
After logging in, select 'My Studies' from the profile dropdown.
Click 'Sync Workspace' for the corresponding study.
You will see a dialog while the portal is synchronizing your study. Depending on how many files there are in your workspace, this make take a little while.
You will see a list of all files that were found in the workspace that the portal does not have a record of.
For every file found in the workspace, you will be prompted to provide details for each one. Depending on what type of file it is, the forms will update to match your selection.
NOTE: If you do not want a file to show up in your study, you can either leave the form blank, or click "Don't Sync" to remove the form from the page. The portal will ignore that file for now. You can add it in at a later date if you wish by re-running the 'Sync Workspace' function for this study.
If you select 'Cluster', you will be prompted to provide a name (what the cluster will be labeled), a description, labels for each axis, and optionally the domain ranges of each axis. When you are finished, click 'Sync' to save your entry.
After you sync the file, you will see a dialog notifying you that the file is being loaded into the portal, and that you will receive an email when it has completed.
If you select 'Expression Matrix', you will be prompted to provide a description and a label to be used when plotting expression scores (will be an axis label). When you are finished, click 'Sync' to save your entry. You will see the same kind of completion dialog as with cluster files regarding email notifications.
If you select 'Metadata', you will be prompted to provide a description for the file. When you are finished, click 'Sync' to save your entry. You will see the same kind of completion dialog as with cluster files regarding email notifications.
If you select 'Gene List', you will be prompted to provide a name for the list and a description. When you are finished, click 'Sync' to save your entry. You will see the same kind of completion dialog as with cluster files regarding email notifications.
If you select 'Documentation' or 'Other', you will be prompted to provide a description for the file. When you are finished, click 'Sync' to save your entry. Since these files are not parsed and loaded into the portal, there is no email notification as the synchronization is already complete.
If you have uploaded multiple fastq files to your workspace, rather than entering information about each file individually, the portal will instead group files together by their directory path inside the Google storage bucket in your workspace. You will see a form for each group of files showing you the location where they were found and the number of files found there. You will be able to provide a description that will be used for all files found at that location.
In other words, if you had 3 directories at the locations /sample_a, /sample_b, and /sample_c, you would see forms for each location. NOTE: The portal currently only supports one level of directory nesting. Any subdirectories below the first will not be listed as separate entries.
If you have previously entered information about a file in your study that the portal cannot find a matching entry for in your workspace, you will see an 'orphaned' file form instead of the normal one. You can view these by clicking the 'Orphaned Study Files' header. Here, you will have the option of designating which new file in your workspace the old entry was supposed to point at. This is useful if you upload a new version of the file directly into the workspace and simply wish to update your entry in the portal to use that new version.
You will also have the option to 'reparse' the file (re-load the data from the file back into the portal) if the contents have changed. This will remove any of the old entries that were originally loaded from the previous file and replace them with the new ones from your new version. Once you click 'Sync', the entry will be updated (and re-loaded if necessary). You will see the normal confirmation dialog once that has completed.
After you have completed entering information about all of your files, you can review the current information for each file as it is now by clicking the 'Synced Data' header. You can also use these forms to make any edits to the files (like names, descriptions, etc.) if you wish.