A library for dividing Latin poetry into component syllables for quantitative analysis. Testing of each function and error analysis can be viewed in the jupyter notebook latinsyllabifier.ipynb.
If you would like to run the jupyter notebook yourself, try this Binder link: https://mybinder.org/v2/gh/chenmasterandrew/latinsyllabifier/master?filepath=latinsyllabifier.ipynb
The Latin source texts for this projects can be found at https://github.com/tesserae/tesserae.
This project is part of a paper on quantifying the phonetic difference between Virgil's Aeneid and it's translations for QSS 30.12 written in collaboration with Dylan Whang. A digital version of this paper can be found here: https://docs.google.com/document/d/1KECKDe_43vN8M1XoZKLlk_22Qq_ChuoQZIVAKWEiWt0/edit?usp=sharing
The syllabifier for English translations of Latin poetry used in this paper can be found here: https://github.com/dmwhang/eng-latin-syllabifier
Syllabifies a Latin passage line-by-line through:
- Dividing the passage into lines (liner function)
- Dividing each line into words, removing extra whitespace, numerals, punctuation, and tesserae file metadata (worder function)
- Dividing each word into component consonant and vowel sounds taking into account pairs of letters e.g. 'qu', 'ae', elision, and prodelision (sounder function)
- Groups the consonant sounds with vowel sounds based on syllabification rules (line_syllabifier function)
- Syllabifies whole passages (syllabifier function) as a list lists of syllables of each line
- Syllabifies passages with line numbers (syllabifier_line_numbers function) as a list of tuples (#, [syllables])