Pre-PhD Fellow: Tessa Wijckmans MA
Student Assistant: Wouter van Elburg
Supervisors: prof. dr. Lia van Gemert, prof. dr. Rens Bod
This project aims to test the attainability of the creation of a tool that can provide Dutch prose texts from the 17th century with an orthographic layer of Modern Dutch. Historical Dutch texts are characterized by its unstable orthographies, which troubles automatic processing and analyzing of the texts. Orthographical normalization might be a solution here. It can serve as a preprocessing step for more advanced text processing, like lemmatizing and part of speech tagging. It will also enable analyzing and comparing stylistic, semantic and syntactic characteristics of text from various time periods. This project will build further on the results of Hupkes’ project to improve part of speech tagging of historical Dutch texts.