verma wrote:I have no clue about how this is coded now in Mellel, but it seems to me that one could perhaps try another strategy, instead of using another (new) "level".
Sounds o.k, I only haven’t found the hint to those strategy in your post.
verma wrote:
1. Spelling. For me, spelling is just a tag, in HTML speak <language="English">Well, you see?</language="English">. […] Combining parts of words in different languages (Mart°n's previous example of combined tags as in <english>atta<german>ch</german>ment</english>, would be technically fine iff <german>ch</german> had been marked as such; and in <english>atta<german>ch</english> we would have a tag conflict, and hence would be marked as incorrect. Means: user intervention required. Seems fair to me.
My first thoughts were the same, but I think this doesn’t work in a word processor like Mellel. If you edit your text with an HTML (text) editor or work with
LATEX the language markup is an easy thing and would work perfectly the way you’ve described. Working in an
WYSIWYG editor is quite different as you couldn’t see the tags at all and the word processor is the one that needs to set them correctly. One (not very likely) example:
You write (or copy&paste) the sentence:
John has bought a new multimedia device, he likes to use for a
specific purpose.
Later you decide to move the word “device” to the end (selection in blue):
John has bought a new multimedia
device, he likes to use for a
specific purpose.
which leads to:
John has bought a new multimedia, he likes to use for a
specific purpose device.
Now you like to delete some words you don’t need any longer (selection in blue):
John has bought a new multi
media, he likes to use for a specific_ purpose device.
which leads to:
John has bought a new multi
purpose device.
If you try this in Mellel, you’ll notice that the bold tags <bold></bold> used in this example were automatically corrected so that a case like
<english>atta<german>ch</english> couldn’t happen in Mellel. If you create a example as the one above only with two languages and replace the <bold> tag by a language tag, the result would be a bilingual word. As the user couldn’t instantly
see this word (as only the language but not the appearance is different on the first and the second half of this word), it’s likely that he has created such a monster without knowing or noticing it. When writing technical or computer related stuff, it could be easy to end up with such words, as for example with German and English, some of those words could be spelled in both languages (as they are adopted from the English version) or parts of a word could be spelled in both languages.
If a language is attached to a character style, I guess it would behave exactly the same and this behavior doesn’t prevent examples like the above one. A case where a closing tag would be missed and user interaction is required to solve the problem would not occur here. That’s why I’ve requested the “word” limit.
verma wrote:
Or if one copy/pastes a word from a larger portion of text (where the whole paragraph or section is marked for a specific language), that copied word will not be marked for language until it's inserted into another language-defined area, or explicitly marked for a specific language.
Why should a marked word lose it’s language because you’ve copied/pasted it? If you copy/paste a styled word (bold/font/size) the style also would be preserved. The “auto-clean” behavior is one of the things I hated most when I had to deal with PowerPoint some time ago.
If you use Edit › Copy Special › Plain Text it would be o.k. to delete the language tag too, but otherwise I would like to preserve it.
verma wrote:
It should however be possible to apply a language setting to whatever portion of a text/word one would like, much in the same sense as we can do that with boldface or italics, and to attach the Language Tag to specific paragraph styles as well.
Usually I would agree but as the main goal of this feature request is
multilingual spell checking and the added wagon could be called
multilingual hypenation the option to apply a language to whatever portion of text would knock out (or, even worse, lead to wrong results) both features as multilingual words couldn’t be spell checked nor hyphenated (let alone correctly).
The tricky thing about this is, that you couldn’t see the language (only if you select some crazy colors for your different language styles) as a language tag is a invisible thing in opposite to the mentioned
boldface or
italic One could workaround this by either using the color example from above or by a search/replace action, that would find multilingual words, so that you could correct them.
But the question I’ve asked myself is why should one use a workaround for something that could be prevented from the beginning. Considering that there is no use of creating multilingual words regarding spell checking and hyphenation, my personal answer were “word” limits that would prevent bilingual words.
One possible use of this was brought up by rpcameron:
rpcameron wrote:Also, just because you use Mellel for one type of work, it's very possible that other users will find it useful to break their words down into their constituent morphemes, and mark those morphemes for their language of origin. (I personally would not be opposed to this use. And, I'm pedantic enough that if I am meta–marking my document for language, then each portion ought to be correctly assigned, even if only for completeness' sake.)
but while it may be o.k. if you like to tag pieces of words with different languages, the option to spell check or hyphenate those pieces of text would be knocked out by this (for the sake of an invisible tag). If a teacher likes to create some papers where different portions of words where marked up differently, I would use a visible markup (as colors/styles…) as the students could comprehend and
see the differencies, but that’s only me.
A invisble language tag is hard to see (which may be only true for my eyes
) on a printout.