Can anybody remember why BBC BASIC for SDL 2.0 (BBCSDL) displays these characters for character codes &81 to &9F inclusive:
I have no recollection of why this set of characters was chosen, nor what the abbreviations mean. They are not the same as BB4W displays for the same codes, so compatibility is not enhanced.
Character codes &81 to &9F
Character codes &81 to &9F
You do not have the required permissions to view the files attached to this post.
Re: Character codes &81 to &9F
Nobody? If the reason has been forgotten, what are people's views on possibly changing them to be more compatible with BB4W? My expectation would have been that they are rarely used on either platform, but that's been shattered by the admin, no less, who uses at least one of them in his recent competition entry.
Re: Character codes &81 to &9F
Hi Richard,
I think what that shows is that it would be an advantage to have a character set that matches that of BB4W more closely!
I think the stray character is a single quote, or something similar... or possibly some sort of hyphen. I haven't worked out why Word rendered them into some form that the text files don't like... It certainly wasn't intentional use of one of the "special" characters...
Best wishes,
D
I think what that shows is that it would be an advantage to have a character set that matches that of BB4W more closely!
I think the stray character is a single quote, or something similar... or possibly some sort of hyphen. I haven't worked out why Word rendered them into some form that the text files don't like... It certainly wasn't intentional use of one of the "special" characters...
Best wishes,
D
Re: Character codes &81 to &9F
A Google search found this: "ANSI characters 32 to 127 correspond to those in the 7-bit ASCII character set, which forms the Basic Latin Unicode character range. Characters 160–255 correspond to those in the Latin-1 Supplement Unicode character range. Positions 128–159 in Latin-1 Supplement are reserved for controls, but most of them are used for printable characters in ANSI".
So it looks as though BBCSDL is using the 'Latin-1' character set whereas BB4W is using the 'ANSI' character set. According to that reference the two are the same for characters 32-126 and 160-255, but not for characters 129-159, which is exactly what we observe. I worry that this difference was for a good reason, but if so I can't remember now what that reason was! Alternatively it was just an accident.
Arguably you should have written your BASIC program to use Unicode (UTF-8) encoding for safety. That would have ensured that BB4W and BBCSDL rendered the same characters, and given you the peace of mind of knowing that any character that Word was likely to introduce would be rendered correctly. Even Notepad will save-to-file in UTF-8 encoding!I haven't worked out why Word rendered them into some form that the text files don't like... It certainly wasn't intentional use of one of the "special" characters...
Do you have a source of 8x8 bitmapped character shapes for those that you think should be changed? It's not easy to represent some of them with such a coarse grid, especially things like permille (‰) or the ligatures (e.g. œ)
Re: Character codes &81 to &9F
For reference the characters required are:
Code: Select all
&82 ‚ single low quotation mark
&83 ƒ small letter f with hook
&84 „ double low quotation mark
&85 … horizontal ellipsis
&86 † dagger
&87 ‡ double dagger
&88 ˆ circumflex accent
&89 ‰ per mille
&8A Š capital S caron
&8B ‹ single left-pointing angle quotation mark
&8C Œ capital OE ligature
&8E Ž capital Z caron
&91 ‘ left single quotation mark
&92 ’ right single quotation mark
&93 “ left double quotation mark
&94 ” right double quotation mark
&95 • bullet
&96 – en dash
&97 — em dash
&98 ˜ small tilde
&99 ™ trade mark sign
&9A š small S caron
&9B › single right-pointing angle quotation mark
&9C œ small OE ligature
&9E ž small Z caron
&9F Ÿ capital Y diaeresis