-
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Is there a way to include Psalm superscriptions in a UsfmFileTextCorpus that uses English versification? #101
Comments
Here is the USFM from the link provided:
As per the USFM documentation, superscripts are designated by \sup, which does not appear in the text. Are you referring to footnotes or something else? Currently only the primary text (not headers, footnotes etc.) are extracted; everything else is discarded when processing the USFM. |
Superscription might be the wrong word. I'm referring to the text that precedes the main body of a Psalm which in the English versification isn't assigned a verse number but in other schemes it is. For example:
In many USFM or USX files this kind of text is part of something other than a |
Yes - we call that "non Scripture portions" and have an issue open in SILNLP here. We are planning on implementing this over the next few months (no promised timeline). We (that is @ddaspit) still need to work out exactly how we will have unique labels to reference these text before and around scripture. |
As @johnml1135 said, we don't currently support extracting non-verse text using the corpus classes in Machine. We are planning on adding support for this. In the meantime, Machine does provide an underlying USFM parser that you could use to implement custom text extraction. Here is a short tutorial on how to use the parser classes. |
Sounds good. Thank you for the example on how to customize the parser, this is helpful. |
When I load an English Bible into a
UsfmFileTextCorpus
and then look at the Psalms, I notice that the superscriptions are not included in the corpus. For example, using the Berean Standard Bible and versification from Berean Bible: Free Downloads, when printing out Psalm 3 there is no superscription.What printed:
PSA 3:1 - O LORD, how my foes have increased! How many rise up against me!
PSA 3:2 - Many say of me, “God will not deliver him.” Selah
PSA 3:3 - But You, O LORD, are a shield around me, my glory, and the One who lifts my head.
PSA 3:4 - To the LORD I cry aloud, and He answers me from His holy mountain. Selah
PSA 3:5 - I lie down and sleep; I wake again, for the LORD sustains me.
PSA 3:6 - I will not fear the myriads set against me on every side.
PSA 3:7 - Arise, O LORD! Save me, O my God! Strike all my enemies on the jaw; break the teeth of the wicked.
PSA 3:8 - Salvation belongs to the LORD; may Your blessing be on Your people. Selah
Is there a way to access or include Psalm superscriptions in a corpus that uses English versification when using machine?
The text was updated successfully, but these errors were encountered: