-
Notifications
You must be signed in to change notification settings - Fork 87
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Replaced 'break' with 'beginning' for page|line|column|gathering in Guidelines and Specs #2634
base: dev
Are you sure you want to change the base?
Conversation
@sydb Might as well have a look at this while you're at it? |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The majority of my suggestions for change, here, are underpinned by my strong distaste for using the noun “beginning” unless it is being used to gloss an element itself or very specifically about the beginning of a feature. Others may disagree.
@@ -426,7 +426,7 @@ dates, and predefined value lists.</note></p></div> | |||
individual characters, the overall typesetting process also | |||
follows specific rules of how to calculate the distance between | |||
characters, how much whitespace occurs between words, at which | |||
points line breaks might occur and so forth. </p> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This use of the term has nothing to do with our <lb>
element, and perhaps should just be left as “break”.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How would you feel about rewriting the above without "break" and to avoid repetition of "occurs" and "occur":
"... at which points new lines might begin and so forth"
@@ -329,7 +329,7 @@ problem for text encoders. Suppose, for example, that we wish to | |||
investigate a diachronic English corpus for occurrences of | |||
<mentioned>tea-pot</mentioned> and <mentioned>teapot</mentioned>, to | |||
find evidence for the point at which this compound becomes | |||
lexicalized. Any case where the word is hyphenated across a linebreak, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This use of the term has nothing to do with our <lb>
element, and perhaps should just be left as “line break”. (Note the space — 18 of the 25 occurences of "line.?break" in the Guidelines have a space. (One has a hyphen, and six have the single word version.)
@@ -381,11 +381,11 @@ whitespace. </p> | |||
|
|||
<p> The <gi>lb</gi>, <gi>pb</gi>, and <gi>cb</gi> elements are notable | |||
exceptions to this general rule, since their function is precisely to | |||
represent (or replace) line, page, or column breaks, which, as noted | |||
represent (or replace) line, page, or column beginnings, which, as noted | |||
above, are generally considered to be equivalent to whitespace. These | |||
elements provide a more reliable way of preserving the lineation, | |||
pagination, etc of a source document, since the encoder should not |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Missing period after “etc”. (I know that has nothing to do with the changes @trishaoconnor is making, but since I noticed it …)
@@ -3492,11 +3492,11 @@ section <ptr target="#CORS6"/> and in section <ptr target="#SACR"/>. | |||
<p>When a text has no pre-existing associated reference system of any | |||
kind, these Guidelines recommend as a minimum that at least the page | |||
boundaries of the source text be marked using one of the methods | |||
outlined in this section. Retaining page breaks in the markup is also | |||
outlined in this section. Retaining page beginnings in the markup is also |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would use “boundaries” as opposed to either “breaks” or “beginnings”.
recommended for texts which have a detailed reference system of their | ||
own. Line breaks in prose texts may be, but need not be, tagged.<note place="bottom">Many encoders find it convenient to retain the line |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I would leave as “break”.
@@ -1250,11 +1250,11 @@ precinct at Dodona. (L.H. Jeffery Archive)</head> | |||
is reversed, and so is their individual orientation (in fact, we see them | |||
<soCalled>from the back</soCalled>, as it were). <gi>seg</gi> elements | |||
have been used here because these are clearly not <soCalled>lines</soCalled> | |||
in the sense of poetic lines; the text is continuous prose, and linebreaks | |||
in the sense of poetic lines; the text is continuous prose, and line beginnings |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Perhaps “… continuous prose, and division into separate lines is incidental.” or some such?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Updated to “… continuous prose, and the division into separate lines is incidental.”
are incidental.</p> | ||
|
||
<p>There are obviously some unsatisfactory aspects of this manner of encoding | ||
boustrophedon. In the inscription above, some words run across linebreaks, | ||
boustrophedon. In the inscription above, some words run across line beginnings, |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think “some words are split across two lines,” would be better.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Agreed. I've applied the above change.
@@ -46,7 +46,7 @@ of any adjacent whitespace</desc> | |||
</valList> | |||
<exemplum xml:lang="en"> | |||
<p>In the following lines from the <title level="a">Dream of the Rood</title>, | |||
linebreaks occur in the middle of the words <mentioned>lāðost</mentioned> | |||
line beginnings occur in the middle of the words <mentioned>lāðost</mentioned> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think something like
<p>In the following lines from the <title level="a">Dream of the Rood</title>,
the words <mentioned>lāðost</mentioned> and <mentioned>reord-berendum</mentioned>
each start on one line and continue on the next.</p>
would be better.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thank you very much, I've updated the file with your recommended changes.
@@ -100,7 +100,7 @@ | |||
</paramList> | |||
</valItem> | |||
<valItem ident="break"> | |||
<desc versionDate="2015-08-21" xml:lang="en">create a line, column, or page break |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
IMOm, this should be left “break”.
@@ -72,7 +72,7 @@ | |||
</valItem> | |||
<valItem ident="type"> | |||
<desc versionDate="2017-05-11" xml:lang="en">a parameter of this name can be used to | |||
categorize the specified behaviour in any way; for example the kind of break (when used |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this should remain “break”.
@trishaoconnor Just letting you know I've just cherry-picked a single-character commit fix from dev into this branch just to make sure it didn't cause any issues. It was a typo in the boustrophedon section of WD-NonStandardCharacters.xml which just needed an added s, found by @sydb. Doesn't affect any of the other stuff in your branch. |
Thank you very much @sydb for your thorough review and thank you too @martindholmes for fixing that typo and merging it to the PR. I appreciate your point @sydb and thank you very much for highlighting the passages where "beginning" isn't an appropriate substitution for "break". I confess that I still have a preference for using "beginning" instead of "break" and would be in favour of rewriting these passages along similar lines to the example that you supplied above:
Of course, if the general consensus is to revert to "break" instead, I'll happily do so. |
Replaced 'break' with 'beginning' for page|line|column|gathering in the following files from the Guidelines and Specs:
There were no mentions of 'break' in the prose of
<gb>
but I deleted an superfluous space in the proseThe requested changes to
<lb>
are addressed in PR #2633.