{"id":1623,"date":"2018-06-13T08:16:08","date_gmt":"2018-06-13T08:16:08","guid":{"rendered":"http:\/\/www.lancaster.ac.uk\/digging-ecm\/?p=1623"},"modified":"2018-06-13T11:39:18","modified_gmt":"2018-06-13T11:39:18","slug":"location-location-location","status":"publish","type":"post","link":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/2018\/06\/location-location-location\/","title":{"rendered":"Location, Location, Location"},"content":{"rendered":"
<\/div>
<\/div><\/div>
<\/div>

Following our last post, Extracting and Creating Data from the Geographic Relations of New Spain<\/a>, in which I mentioned the problems we face in automatized identification of place-names, I thought it would be worthwhile to take a look at the toponyms we are working with, and why using computational approaches will allow us to further our understanding of the Relaciones Geogr\u00e1ficas.<\/em><\/p>\n

One of our first, and ongoing, challenges with this project is the identification of thousands of place-names across Mesoamerica. The source materials for the gazetteer we are currently compiling include:<\/p>\n

Rene Acu\u00f1a\u2019s Relaciones Geogr\u00e1ficas del Siglo XVI<\/em><\/a><\/p>\n

Mercedes de la Garza\u2019s Relaciones Hist\u00f3rico-geogr\u00e1ficas de la Gobernaci\u00f3n de Yucat\u00e1n<\/em><\/a><\/p>\n

Alejandra Moreno Toscano\u2019s Geografia econ\u00f3mica de Mexico<\/em><\/a><\/p>\n

Francisco del Paso y Troncoso\u2019s Suma de visitas de pueblos de Nueva Espa\u00f1a<\/em><\/a><\/p>\n

Peter Gerhard\u2019s A Guide to the Historical Geography of New Spain<\/em><\/a> and The Southeast Frontier of New Spain<\/em><\/a><\/p>\n

Our first task was cleaning and converting each of these sources into a computer-readable format, allowing us to extract data more easily. OCR was (sometimes) our friend for this part of the process. We were then able to extract all the place-names listed in the indexes of these works (correcting OCR mistakes along the way), leaving us with a list of almost 14,500 toponyms. Of course, many of these are duplicates or alternate spellings of the same place. We are currently disambiguating these place-names to ensure we are referring to the correct location. (I described this process in our Historical GIS<\/a> post if you\u2019d like a little more detail about this.)<\/p>\n

The wordcloud below was created from the full list of toponyms listed in Rene Acu\u00f1a\u2019s editions of the Relaciones Geogr\u00e1ficas, excluding <\/em>alternate spellings for the same place. If I had included the alternate spellings, the list would have been over 6,200 names. As it was, I inputted a list of around 4,900 toponyms.<\/p>\n<\/div>

<\/div>
<\/div><\/div>
<\/div>
\"wordcloud<\/span><\/div><\/div>
<\/div>
<\/div><\/div>
<\/div>

The influence of the Spanish language is clear, though not surprising, with names of saints featuring prominently alongside common descriptors such as R\u00edo, Valle and Laguna. However, indigenous toponyms remain prevalent, with frequent mentions of specific locations such as Ac\u00e1mbaro, Tlaxcala and Ixtlahuacan. Yucu, a Mixtec word meaning \u2018hill\u2019, appears 33 times, no less frequently than Valle. The occurrence of Yucu in this source material was also exclusively within the region of Antequera (currently Oaxaca), explained by the region being home to the convergence of numerous mountain chains, known as the Complejo Oaxaque\u00f1o (Oaxaca Complex).<\/p>\n

Disambiguating the thousands of place-names which are mentioned in the Relaciones Geogr\u00e1ficas<\/em> will allow us to effectively interact with the source material using computational methods. Using techniques such as Collocation Analysis in conjunction with our gazetteer will open up opportunities for analysing the text in innovative ways, such as identifying associations between locations, entities, topics etc. For example, it should be possible to search for Tlacotepec and determine whether this place has any relationship to another place, person, or concept. Furthermore, it will be possible to search for the specific Tlacotepec which you may be interested in, and any associated alternate names\/spellings for that particular place. As the map below demonstrates, place-names are often repeated across, and within, regions. This is why disambiguating our corpus is so important!<\/p>\n<\/div>

<\/div>
<\/div><\/div>
<\/div>
\"Map<\/span><\/div><\/div>
<\/div>
<\/div>
<\/div>

At present, we have a total of 3,650 fully disambiguated place names \u2013 meaning that we have definite coordinates assigned to these names. You can see a sample of some of these locations on the Corpus and Datasets<\/a> tab of our website.<\/p>\n

We have a fair few more toponyms which are partially located (i.e. we have identified the region in which they lie), and thousands more awaiting disambiguation. We\u2019re approaching the halfway point\u2026just over the next yucu!<\/p>\n<\/div>

<\/div><\/div><\/div><\/div><\/div>\n","protected":false},"excerpt":{"rendered":"","protected":false},"author":7,"featured_media":1648,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_acf_changed":false,"footnotes":""},"categories":[82,80],"tags":[97,99,95,96,86,91,101],"class_list":["post-1623","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-datasets","category-research","tag-16th-century","tag-colonial","tag-geographic-relations-of-new-spain","tag-mexico","tag-research","tag-spatial-humanities","tag-toponyms"],"acf":[],"_links":{"self":[{"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/posts\/1623","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/users\/7"}],"replies":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/comments?post=1623"}],"version-history":[{"count":8,"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/posts\/1623\/revisions"}],"predecessor-version":[{"id":1652,"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/posts\/1623\/revisions\/1652"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/media\/1648"}],"wp:attachment":[{"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/media?parent=1623"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/categories?post=1623"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.lancaster.ac.uk\/digging-ecm\/wp-json\/wp\/v2\/tags?post=1623"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}