Vectorizing Historic Maps for Spatial Analysis
Generating Geospatial Data from Historic Maps: A Step-by-Step Guide
This project illustrates a core methodology supported by the Harvard Map Collection: transforming static historic maps into analyzable geospatial data. Through our partnership with Joyce Chaplin's 'Re-Wilding Harvard' course, this initiative empowered students to investigate the evolving histories and changing natures of open space within Harvard's campus and its surroundings, showcasing the library's critical role in digital scholarship.
Hannah Adler '25, a student from the course, undertook an ambitious project focusing on the Charles River region of Harvard's campus, particularly the area near Memorial Drive in Cambridge.
"I was interested in the Harvard Square adjacent area near the river, because today we think of the river walkways and Memorial Drive, especially when it’s closed as a really valuable open space. I also had seen an image in HOLLIS that conveys a certain set of aesthetic ideals with how people were imagining the riverside could be.
Drawing on her foundational GIS coursework, Adler sought expert guidance from the Harvard Map Collection to incorporate advanced spatial analysis techniques into her historical inquiry. Through consultation, she aimed to investigate key questions such as:
- What did the development process look like in this area? How rapidly did this region change?
- What was Harvard's role in developing this area?
Hannah partnered with GIS Librarian Belle Lipton in a collaborative consultation to model and refine the spatial data approaches essential for addressing these questions.
Finding Map Sources
As part of this collaborative process, GIS Librarian Belle Lipton recommended Atlascope, a powerful online tool which includes fire insurance maps held by the Harvard Map Collection, which integrates over 100 historical atlases of Boston and Cambridge. Atlascope serves as a dynamic GIS viewer, enabling researchers to seamlessly compare temporal layers as live geospatial data. This resource was instrumental in revealing the precise historical changes Adler sought to investigate, providing direct access to GIS layers for her project.
Vectorizing: Transforming Historic Maps into Analyzable Data
Following the identification of key temporal snapshots (1873, 1903, 1916, and 1930) and the acquisition of their corresponding GIS layers via Atlascope, the next step involved transforming these historical map images into structured, analyzable polygon data. This meticulous process, known as vectorization, entailed digitally tracing the boundaries of individual land parcels and annotating them with relevant attributes, thereby enabling the quantitative measurement of spatial and temporal changes. This transformation turns visual information into queryable data, unlocking new research insights.
Attribute Information: Adding Meaning to Spatial Features

Orange indicates parcels that are undeveloped in 1873, meaning they have no structures on them. Purple are developed parcels, or parcels with structures on them.
Upon generating the foundational shape data for each parcel, Adler enriched the GIS table by adding new columns, or attributes, capturing specific variables derived from the historical maps for analysis:
- Development Status: A binary value indicating whether a parcel had a structure (
true) or was undeveloped (false) in 1873. - Land Ownership: The name of the landowner in 1873, extracted directly from the historical maps.

The maps highlighted in yellow indicate parcels owned by Harvard in 1873.
Adler rigorously repeated this vectorization and attribute encoding process for each selected temporal snapshot, building an historical geospatial dataset suitable for longitudinal analysis and comparison.
1873
| Development | Harvard-ownership |
|---|---|
![]() |
![]() |
1903
| Development | Harvard-ownership |
|---|---|
![]() |
![]() |
1916
| Development | Harvard-ownership |
|---|---|
![]() |
![]() |
1930
| Development | Harvard-ownership |
|---|---|
![]() |
![]() |
Area Analysis
Leveraging the quantitative power of GIS, Adler then performed precise area calculations to determine the square meterage of each parcel type. Because these vectorized shapes correspond to real-world locations and are properly projected into standard coordinate reference systems, GIS tools accurately report the true ground area they represent, enabling quantitative analysis from historical sources.
Adler used the coordinate reference system Massachusetts State Plane or EPSG: 26986, which is the same coordinate reference system Massachusetts uses to store GIS data in MassGIS, the state's open geodata portal. This generates area calculations in the unit of square meters.
Adler exported the area calculations to tabular formats (.csv), so she could analyze them further using other software, such as R. Her findings are presented in the following table.

While the total area developed (parcels with structures on them) increases only a small amount between 1873 and 1930, the area owned by Harvard increases dramatically during this period. Adler reported that there were many other variables included in the historic atlases one could choose to encode and then apply this same methodology to, whereby one calculates area based on a subsetted attribute filter of the traced parcels.
The complete project data, curated and structured for reuse, is publicly available for download through the Harvard Geospatial Library (HGL), contributing to a community of shared, analyzable datasets and methodologies for geospatial humanities scholarship.
- Select
https://doi.org/10.17605/OSF.IO/C9GV3. Filesacross the top menu bar.Download this folder.
To follow these steps on your own, use our QGIS: Vectorizing from Historic Maps Tutorial.





