Replies: 14 comments 45 replies
-
Hello @tyama501, I have been thinking of unicode for ELKS also, but there is one main problem on the IBM PC, which could also result in similar problem on PC-98. That is, the text-mode character ROM only has 128 (additional) non-ASCII characters that can be displayed on the screen. Thus, any fancy input processing that resulted in a non-displayable character might not actually work very well for arbitrary input. That said, with the recent work been done to allow applications to run outside/off the console (i.e. via the serial terminal or network telnet session), this problem has been solved by converting any unicode character to UTF-8, which automatically displays properly on most modern terminal emulators (e.g. all on Linux and macOS, for instance). This is how the IBM PC ROM character set is easily displayable outside the console: there is a CP 437 -> unicode conversion table which readily allows for converting IBM PC character ROM output to unicode for display on terminal emulators. For PC-98, I am aware that the basic (text mode?) code page glyphs are different. Thus, one approach would be to use a different japanese PC-98 to unicode conversion table, which, when also including the CP 437 table, should allow for japanese and IBM PC applications to run on terminal emulators. I haven't started yet, but if we (respectively) wrote unicode input to -> IBM PC and PC-98 code page conversion routines, that would allow for unicode input to be displayed on the native console. However, as noted above, this allows for only 128 additional characters, and is potentially a lot of work for little gain, since a much simpler Latin-1 conversion table could be used while keeping the input path at 8 bits (for IBM PC). Can the PC-98 display arbitrary bitmapped glyphs in text mode? What are your thoughts on all this? Thank you! |
Beta Was this translation helpful? Give feedback.
-
Hello @ghaerr , PC-98 has kanji ROM and text console can display double byte JIS kanji but it is different from shift-jis which are used from early Windows and DOS, and it is different from unicode. So it needs conversion. I am thinking of some nano-X application that can use additional fonts. It is also interesting your idea if serial console works. Japanese need to separate alphabet to appropriate hiragana/katakana/kanji character so the input method engine is also needed. |
Beta Was this translation helpful? Give feedback.
-
If an additional font is created in one of the formats allowed in nano-X (.fnt, .bdf/.pcf), then it should be no problem to display any glyph from a nano-X application, using an arbitrary character index. However, there are severe memory limitations on this 16-bit version of nano-X, so the font code may have to use disk-caching (not written yet), depending on the size of the font file.
The idea there is that it will be a lot easier to get applications to work with unicode input and libraries, than to enhance the kernel and/or console code. As said earlier, we could likely add UTF-8 processing to consoles, but they would be quite limited in the characters that can be displayed on console. In addition, we have a compatibility issue with all characters > 127, as their high bit interferes with UTF-8 processing. |
Beta Was this translation helpful? Give feedback.
-
I don't know the size but this seems bdf fonts. |
Beta Was this translation helpful? Give feedback.
-
Hello @tyama501,
That BDF file is huge: 1.4M! Also, I didn't realize that the ELKS version of nano-X does not have support for reading .pcf files. I have a number of font conversion tools, which we could use to translate font formats, except ELKS nano-X will require all fonts to be compiled into the data segment (64K max)! Thank you! |
Beta Was this translation helpful? Give feedback.
-
Hello @ghaerr , This is very small 8x8 fonts but even this the size is more than several 100kB so it seems impossible to compile in 64KB. |
Beta Was this translation helpful? Give feedback.
-
I download some of files from the site and found that MISAKI.FNT binary in misaki_fontx_2012-06-03 is 55KB. |
Beta Was this translation helpful? Give feedback.
-
I recently added the So it looks like we could solve this problem :) |
Beta Was this translation helpful? Give feedback.
-
Happy New Year @ghaerr , I could display Japanese UTF-8 file using the serial terminal! Thank you! |
Beta Was this translation helpful? Give feedback.
-
I have released first alpha version of Simple UTF-8 Japanese text file viewer with Nano-X for ELKS. https://github.com/tyama501/nxjtxtv Thank you! |
Beta Was this translation helpful? Give feedback.
-
I found sjis to utf8 converter with c source that might be portable if far memory and separated table file is used. |
Beta Was this translation helpful? Give feedback.
-
Hello @tyama501, After looking at your screenshots above, I came up with an idea that I thought you might find interesting: creating a Japanese console that runs in graphics mode for ELKS! :) Instead of running a conversion program to display japanese text, the new console could run (using low-level drawing code pulled over from Nano-X) directly interpreting ShiftJIS characters and displaying them on the console in graphics mode. Since scrolling graphics text will be quite slow, the console could just erase and start at the top instead. If you're interested in this, I would suggest starting with getting an ELKS graphics console running (initially for PC-98, but later I could help for IBMPC) that just displays ASCII text. After that, the MISAKI.FNT file could be loaded (or compiled into the kernel) and the conversion routines implemented. A graphics console would be lots slower, but still potentially interesting also to display native UTF-8. I suppose it would mostly be a fun project, rather than being seriously useful, depending on the speed of the machine. Thank you! |
Beta Was this translation helpful? Give feedback.
-
Release & Updates First Porting of SJIS to UTF8 for ELKS Simple UTF-8 Japanese text file viewer |
Beta Was this translation helpful? Give feedback.
-
Release & Updates Added character size double hight and width mode. Simple UTF-8 Japanese text file viewer |
Beta Was this translation helpful? Give feedback.
-
Hello,
I was searching for Japanese Input Method ( IME) and found an open source project.
Anthy-unicode
https://github.com/fujiwarat/anthy-unicode/blob/main/doc/GUIDE.english
The Anthy was used on Linux and this is unicode version.
There is a long way to show and input the language other than English in ELKS but just for information.
Beta Was this translation helpful? Give feedback.
All reactions