What's the difference?: Textual analysis of variations in technical document structure

Abstract

The aim of this project is to analyze text variations in technical documents, provided by John Deere, to better understand how the documents are structured. Example of variations include synonymous titles and differences in content structure. The goals are to report the amount of deviations by the document texts from the mean, describe the anatomy of the documents, and identify commonalities between documents. This supports more structured standards to be drafted, allowing the data inside the documents to be more manageable. Python is the languages of choice for this project.Ope

    Similar works