Blake-Proj email
The long view (really long: get a cup of coffee) John Unsworth (jmu2m@virginia.edu) Wed, 22 Jan 1997 00:09:09 -0500 Boyz, First, some news flashes, then a couple of questions and a matter for your contemplation. [material omitted] OK, now we're past the news and the questions, and on to the matter for contemplation. This has to do with EBT presentation, the Blake DTD, and the dreaded concurrency problem. In fact, it comes down to a choice that you Eds. have to make, between the physical and the logical (thanks Daniel...). Here's the problem, in simplest form, and I realize that this expression of the problem doesn't correspond to your current use of the <plate> tag. <plate 1> <poem 1> </plate> <plate 2> </poem> <poem 2> </plate> The problem is that a logical structure--a poem--can overrun a plate division. The solution currently in force is to make <plate> an empty tag, which means it doesn't contain anything, it just functions as a kind of milestone (here's a plate beginning, here's another plate beginning). It looks like this: <plate 1> <poem 1> <plate 2> </poem> <poem 2> This solution begs the question of concurrency by making plate something that can be contained within the kind of div that is poem. In effect, what that means is that, in your markup, you have privileged the logical structure over the physical structure, the poem over the plate. Now, up to the point that we began to deal with EBT, the implications of this decision could be hidden from view by virtue of the fact that we were a) doing no searching on these structures, and b) writing custom perl programs--programs that really understood nothing about SGML structures--to convert this markup to HTML, and c) doing a certain amount of hand-made markup on top of the perl output, to make it look the way you wanted. With your plate-oriented display, for browsing, you were choosing to privilege the plate over the poem, or the physical structure over the logical one. In short, having your cake and eating it too. So, why not continue that way? Well, a couple of reasons. The handmade solution is not scalable--it can't apply across the whole archive, without lots of mucking around every time a new work presents new problems. Thel, for example, has only eight plates and can be treated one way, but Jerusalem can't be run through the same script; we'll need a new one. Why not just write those scripts for every work? Well, you still need to be able to search the whole damn thing, and as soon as you want to do that, you need a program (perl or EBT) to be able to navigate these SGML structures and present them in a programmatic, archive-wide way. So forget searching, let's just browse: still, it sucks: the problem doesn't go away, it just gets swept under the rug, at great expense of effort and sacrifice of utility. I want to stress that the problem is not raised, but rather uncovered, by EBT: although concurrence is theoretically allowable in SGML, nobody (to my knowledge) has ever written any software to search or browse SGML that could deal with it: it's a nice idea, but when you try to write code that deals with concurrence systematically, rather than ad hoc, the complexity of the problem is impossible. OK: back up and look at it this way. In order to make programmatic SGML software (EBT) present your material the way the perl-generated SGML appears, we have had to adjust the markup of the test copy of Thel to allow only one plate per div: this is to say, we have systematized the privileging of the physical over the logical. Might work OK with Thel, but what happens when you want to see a poem that contains 20 plates in a work of 100 plates, rather than seeing the whole work as a list of 100 plates? Well, in order to do that, you'll need to go the other way, and systematize the privileging of the logical over the physical: you'll need an SGML structure that looks something like this: <work> <poem 1> <plate 1> </plate> <plate 2> </plate> </poem> <poem 2> <plate 2> </plate> </poem> </work> This means that if a poem begins on plate one and crosses over to plate two, and a new poem begins on plate two, the information about plate two would need to be repeated, in its entirety, within poem 1 and within poem 2. If this were done, then you could see the logical structure, and the physical structure, something like this: Thel poem 1 plate 1 plate header plate transcription plate image plate 2 plate header plate transcription plate image poem 2 plate 2 plate header plate transcription plate image That's one choice, and it would entail a relatively major intervention into the current markup--nothing that couldn't be revised by a computer program. The point is that it's a major editorial decision, not that it's a major revision to existing markup. The latter we can handle; the former is a decision you have to make, on editorial principle. The other choice is to go further in the direction your display output has already gone, and privilege the physical over the logical. In that case, a work consists of plates, and all plates are equal in the hierarchy, and logical structures (poems) are subordinate to plates: Thel plate 1 poem 1 plate 2 poem 1 poem 2 So, it comes down to this. Is the plate more important than the poem, or is the poem more important than the plate? I have nothing invested in either answer, but it's time to fish or cut bait on this question. I realize that you didn't mean to finesse the issue--the decision was made, and then worked around, out of your sight and probably without having the issues explained. But please believe me: this is not an EBT problem; it is an SGML problem. It's time now to make this decision, once and for all, because if you don't make it, it will come back and bite you in the ass at some point, probably in the not very distant future. Tradeoffs: If you like, more or less, the current EBT display, then you want to privilege the plate over the poem. If you can't imagine looking at a longer, more complex work without being able to see the work divided into its constituent poems, rather than its constituent plates, then you want to systematically privilege the oogical structure over the physical structure. The thing to grasp here is that you have to choose: physical, or logical. Whichever you choose, we can represent; we mahy even be able to fudge a bit here and there. But whichever you choose, you will be faced with certain consequences, plus and minus, that attend that choice. And finally, there will come a point at which I say: nope, we can't do that, because you made this foundational choice. It's not, repeat not, EBT that's the problem: it's a logical and editorial problem. Have at.... John Unsworth / Director, IATH / Dept. of English ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ http://jefferson.village.virginia.edu/~jmu2m/
Next--> |