Guide to Creating README-Style Metadata for GIS Data

Jan 18, 2023 · 4 mins read

Effective data documentation (metadata) is crucial for the long-term usability and understanding of your GIS data, whether you're sharing it publicly or revisiting it years later. While creating formal metadata can seem daunting, this guide offers a flexible, prompt-based approach to help you document the context of your data.

The goal is to provide a free-form, permissive way to begin documenting the flow of ideas around your data, without the pressure of strict templates or getting it "wrong." This allows you to start documenting your work now, preventing the loss of crucial context that often occurs when formal requirements feel overwhelming. These prompts are designed as thought-starters; choose to answer as much or as little as you find helpful.

Completing this guide can serve as an excellent starting point for a deeper consultation with our team, or simply as a valuable personal record for your project.

Prompts for Creating Your README-Style Metadata:

  • Originator(s): The name(s) of the organization(s) or individual(s) who developed the dataset.
  • Publication date: The date when the dataset is published or otherwise made available for release.
  • Calendar date(s) of the content: The date, dates, or date range describing the content of the dataset.
  • Title: A clear and descriptive title for the dataset.
  • Geospatial data presentation form: The mode in which the geospatial data are represented (e.g., "vector digital data," "raster digital data").
  • Abstract: A brief narrative description of the dataset, summarizing its content and purpose.
  • Motivations: The reason why the dataset was developed or intentions for its use. If you created the data yourself, or augmented existing data, how does the work add value to the current data landscape? (i.e., what are the project's successes?)
  • Data sources: If you obtained and edited data from elsewhere, the source of the original datasets and date accessed.
  • Body of work: Is this data related to any larger bodies of work or projects which could help explain the methods or content? Please include links to any relevant citations.
  • Process steps and methods: A sequential explanation of the step-by-step process for creating or editing the data.
  • Challenges: Difficult aspects of working with the data, or words-to-the-wise for someone else endeavoring to work with it.
  • Key highlights ("pride points"): If you were showing this dataset to someone for the first time, which aspects would you draw attention to, and why? Which columns do you think are the most important and valuable?
  • Relationships: If there are multiple datasets, processing scripts, or other resources in the project, describe their filenames and how the materials relate to or were used with one another.
  • Data integrity: If values are missing, or fields are blank, instructions for future users for making sense of possible reasons why. You can also provide any guidance on how you were thinking about "accuracy" for this project.
  • Codebooks: If you cleaned data, deleted, renamed, mathematically calculated or inferred fields from other fields, the filename and link to a plain-text codebook with field name definitions, including units of measurement.
  • Taxonomies: If you used any taxonomic authorities, provide the reference and any modifications.
  • Completion and maintenance: Acknowledging that projects are seldom "finished" or "perfect," what, in your estimation, marks this submission as "complete"? Speak a little about the data's current status, and anticipated maintenance frequency (e.g., "annually," "never").
  • Point of contact: Contact information for an individual or organization that is knowledgeable about the dataset (person/organization, e-mail, etc.).
  • Access constraints: Restrictions on accessing the data.
  • Use constraints: Restrictions on the use of the data (e.g., "For educational non-commercial use only. This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License, CC BY-NC-SA 4.0 DEED (https://creativecommons.org/licenses/by-nc-sa/4.0/)").

Next Steps & More Resources:

To formalize your metadata or for in-depth consultation on your GIS data project, schedule a guided interview with our team through GIS Data Management Consultation Services Guide.


This site is a project of the Harvard Map Collection. Visit our website to learn more.