Skip to Main Content

Gibson D. Lewis Library Libguides

HSC Fort Worth Data Repository

Submissions Checklist

Before you submit your data to the HSC Fort Worth Data Repository, carefully review and answer the questions in the following checklist:

  • Preparing Data for Submission
  • Preparing Data Documentation
  • Data Permissions and Rights
  • Is Your Data Right for the HSC Fort Worth Data Repository?
  • Data Preparation
  • Data Documentation
  • Deposit Rights
  • Sharing Permissions
  • Licensing

Preparing Data for Submission

Group Files into Meaningful Datasets

Decide on how you want to structure your data. A submission should consist of a set of files – up to 4GB per file – that make up a complete dataset that are explicitly labeled. If your submission contains data from multiple, unrelated projects, consider grouping your items into separate submissions.

More repository policies: HSC Fort Worth Data Repository policies

Ensure Future Usability

To facilitate others opening and using your data files in the future, please make sure that your files are in an appropriate, open-source formats to assist with long-term preservation. If your data is dependent on proprietary software formats, options for preserving the data long-term may be limited.

Preparing Data Documentation

Include Documentation Describing Your Data

Collect any documentation that gives information about what data is included in your data set and how it is structured. Some examples of what to include in your documentation are:

  • Descriptions of any acronyms or abbreviations used (e.g., column headings, variable names, etc.)
  • The methodology used to collect and analyze the data
  • Citations to journal articles based on the data
  • Explanation of file-naming conventions
  • The names and contact information of any contributors
  • Descriptions of what is found in each file

This document should describe what data is included in your dataset and any special instructions for understanding your data files. It should give context to your data and ensure that future users of your data will be able to easily understand what is included.

Not sure where to start? Try using our the Readme Template provided by Cornell University.

Data Permissions and Rights

Ensure that You Have the Right to Share the Data

Make sure that you have all necessary rights to deposit the data into the HSC Fort Worth Data Repository. If other individuals maintain rights to the data, you must obtain permission from them to deposit your dataset.

De-Identify Any Personally Identifiable Information 

Ensure that you have removed any data that could be used to identify subjects of your research.

Consent must specify the type and identifiability of the data to be shared.

Consent must specify that the sharing will be Open Access (allows anyone to access and use the dataset).

For more information, go to: HSC Protection of Human Research Subjects Policies and Procedures

Using Open License to Share Your Data

An open license allows others to use your data for other purposes.  The HSC Fort Worth Data Repository will always default to CC0 (Public Domain).  Other than Public Domain, two other possible options could be CC BY (Attribution) or CC BY-NC (Attribution-Noncommercial).  

To discuss more restrictive licensing options, please contact the library at datarepository@unthsc.edu 

For more detailed information on Creative Commons licensing, please refer to the Creative Commons guide.  

Is Your Data Right for the HSC Fort Worth Data Repostory?

Does Your Dataset Meet the Requirement for Submission?

  • Authored by at least one HSC researcher
  • Does NOT contain any private, confidential, or other legally protected information
  • Ready for public access and reuse

Data Preparation

Plan Your Organization of Files and Datasets

  • Would it make sense to break your data into multiple submissions?
  • Are your data files grouped in a meaningful way?
  • Is your data labeled consistently (e.g., data headers, file naming, etc.)?
  • Have you avoided using proprietary software wherever possible?

Data Documentation

Do You Have Documentation for Your Data? 

  • If not, have you prepared a README file to describe the dataset?
  • Are all acronyms/abbreviations spelled out in the documentation?
  • Is your data collection methodology included in the documentation?
  • Would someone else be able to understand your dataset using the documentation?

Deposit Rights

Do You Have All the Necessary Copyright Permissions to Make the Data Publicly Available?

  • Have all collaborators, advisors, or other interested parties agreed on sharing the data publicly in the HSC Fort Worth Data Repository?
  • Are you aware of the rights you are granting the Dataverse community by depositing your data?

Sharing Permissions

Have You Considered the Questions around Sharing?

  • Do you have any specific data sharing requirements (e.g., from funding agencies)?
  • Is the data anonymized to protect any personally identifiable information?
  • Do you wish to manage access to your data (e.g., place an embargo)?
  • Have you made note of any special software that would be required to access your data?

Watch this video from Texas Data Repository on how to restrict access to certain files as a means of managing permissions.

Licensing

Licensing

  • Have you considered applying an open license to your dataset? A CC0 license is applied to all uploaded datasets by default. 
  • Have you considered if a different license would work better?
  • What constraints, if any, would you like to add to the license (e.g., non-commercial use only, attribution required, etc.)?

Questions?

For further information on using the HSC Fort Worth Data Repository please contact us at DataRepository@unthsc.edu.

For questions about the Data Management and Sharing (DMS) Policy, please contact elizabeth.speer@unthsc.edu.