Tillbaka till svenska Fidonet
English   Information   Debug  
ENET.SYSOP   33805
ENET.TALKS   0/32
ENGLISH_TUTOR   0/2000
EVOLUTION   0/1335
FDECHO   0/217
FDN_ANNOUNCE   0/7068
FIDONEWS   23541
FIDONEWS_OLD1   0/49742
FIDONEWS_OLD2   0/35949
FIDONEWS_OLD3   0/30874
FIDONEWS_OLD4   0/37224
FIDO_SYSOP   12847
FIDO_UTIL   0/180
FILEFIND   0/209
FILEGATE   0/212
FILM   0/18
FNEWS_PUBLISH   4193
FN_SYSOP   41525
FN_SYSOP_OLD1   71952
FTP_FIDO   0/2
FTSC_PUBLIC   0/13584
FUNNY   0/4886
GENEALOGY.EUR   0/71
GET_INFO   105
GOLDED   0/408
HAM   0/16053
HOLYSMOKE   0/6791
HOT_SITES   0/1
HTMLEDIT   0/71
HUB203   466
HUB_100   264
HUB_400   39
HUMOR   0/29
IC   0/2851
INTERNET   0/424
INTERUSER   0/3
IP_CONNECT   719
JAMNNTPD   0/233
JAMTLAND   0/47
KATTY_KORNER   0/41
LAN   0/16
LINUX-USER   0/19
LINUXHELP   0/1155
LINUX   0/22011
LINUX_BBS   0/957
mail   18.68
mail_fore_ok   249
MENSA   0/341
MODERATOR   0/102
MONTE   0/992
MOSCOW_OKLAHOMA   0/1245
MUFFIN   0/783
MUSIC   0/321
N203_STAT   900
N203_SYSCHAT   313
NET203   321
NET204   69
NET_DEV   0/10
NORD.ADMIN   0/101
NORD.CHAT   0/2572
NORD.FIDONET   189
NORD.HARDWARE   0/28
NORD.KULTUR   0/114
NORD.PROG   0/32
NORD.SOFTWARE   0/88
NORD.TEKNIK   0/58
NORD   0/453
OCCULT_CHAT   0/93
OS2BBS   0/787
OS2DOSBBS   0/580
OS2HW   0/42
OS2INET   0/37
OS2LAN   0/134
OS2PROG   0/36
OS2REXX   0/113
OS2USER-L   207
OS2   0/4785
OSDEBATE   0/18996
PASCAL   0/490
PERL   0/457
PHP   0/45
POINTS   0/405
POLITICS   0/29554
POL_INC   0/14731
PSION   103
R20_ADMIN   1117
R20_AMATORRADIO   0/2
R20_BEST_OF_FIDONET   13
R20_CHAT   0/893
R20_DEPP   0/3
R20_DEV   399
R20_ECHO2   1379
R20_ECHOPRES   0/35
R20_ESTAT   0/719
R20_FIDONETPROG...
...RAM.MYPOINT
  0/2
R20_FIDONETPROGRAM   0/22
R20_FIDONET   0/248
R20_FILEFIND   0/24
R20_FILEFOUND   0/22
R20_HIFI   0/3
R20_INFO2   2789
R20_INTERNET   0/12940
R20_INTRESSE   0/60
R20_INTR_KOM   0/99
R20_KANDIDAT.CHAT   42
R20_KANDIDAT   28
R20_KOM_DEV   112
R20_KONTROLL   0/13063
R20_KORSET   0/18
R20_LOKALTRAFIK   0/24
R20_MODERATOR   0/1852
R20_NC   76
R20_NET200   245
R20_NETWORK.OTH...
...ERNETS
  0/13
R20_OPERATIVSYS...
...TEM.LINUX
  0/44
R20_PROGRAMVAROR   0/1
R20_REC2NEC   534
R20_SFOSM   0/340
R20_SF   0/108
R20_SPRAK.ENGLISH   0/1
R20_SQUISH   107
R20_TEST   2
R20_WORST_OF_FIDONET   12
RAR   0/9
RA_MULTI   106
RA_UTIL   0/162
REGCON.EUR   0/2055
REGCON   0/13
SCIENCE   0/1206
SF   0/239
SHAREWARE_SUPPORT   0/5146
SHAREWRE   0/14
SIMPSONS   0/169
STATS_OLD1   0/2539.065
STATS_OLD2   0/2530
STATS_OLD3   0/2395.095
STATS_OLD4   0/1692.25
SURVIVOR   0/495
SYSOPS_CORNER   0/3
SYSOP   0/84
TAGLINES   0/112
TEAMOS2   0/4530
TECH   0/2617
TEST.444   0/105
TRAPDOOR   0/19
TREK   0/755
TUB   0/290
UFO   0/40
UNIX   0/1316
USA_EURLINK   0/102
USR_MODEMS   0/1
VATICAN   0/2740
VIETNAM_VETS   0/14
VIRUS   0/378
VIRUS_INFO   0/201
VISUAL_BASIC   0/473
WHITEHOUSE   0/5187
WIN2000   0/101
WIN32   0/30
WIN95   0/4277
WIN95_OLD1   0/70272
WINDOWS   0/1517
WWB_SYSOP   0/419
WWB_TECH   0/810
ZCC-PUBLIC   0/1
ZEC   4

 
4DOS   0/134
ABORTION   0/7
ALASKA_CHAT   0/506
ALLFIX_FILE   0/1313
ALLFIX_FILE_OLD1   0/7997
ALT_DOS   0/152
AMATEUR_RADIO   0/1039
AMIGASALE   0/14
AMIGA   0/331
AMIGA_INT   0/1
AMIGA_PROG   0/20
AMIGA_SYSOP   0/26
ANIME   0/15
ARGUS   0/924
ASCII_ART   0/340
ASIAN_LINK   0/651
ASTRONOMY   0/417
AUDIO   0/92
AUTOMOBILE_RACING   0/105
BABYLON5   0/17862
BAG   135
BATPOWER   0/361
BBBS.ENGLISH   0/382
BBSLAW   0/109
BBS_ADS   0/5290
BBS_INTERNET   0/507
BIBLE   0/3563
BINKD   0/1119
BINKLEY   0/215
BLUEWAVE   0/2173
CABLE_MODEMS   0/25
CBM   0/46
CDRECORD   0/66
CDROM   0/20
CLASSIC_COMPUTER   0/378
COMICS   0/15
CONSPRCY   0/899
COOKING   28498
COOKING_OLD1   0/24719
COOKING_OLD2   0/40862
COOKING_OLD3   0/37489
COOKING_OLD4   0/35496
COOKING_OLD5   9370
C_ECHO   0/189
C_PLUSPLUS   0/31
DIRTY_DOZEN   0/201
DOORGAMES   0/2014
DOS_INTERNET   0/196
duplikat   6000
ECHOLIST   0/18295
EC_SUPPORT   0/318
ELECTRONICS   0/359
ELEKTRONIK.GER   1534
ENET.LINGUISTIC   0/13
ENET.POLITICS   0/4
ENET.SOFT   0/11701
Möte FIDONEWS_OLD4, 37224 texter
 lista första sista föregående nästa
Text 36917, 112 rader
Skriven 2016-10-03 01:39:58 av FidoNews Robot (2:2/2.0)
Ärende: FidoNews 33:40 [01/06]: General Articles
================================================
=================================================================
                        GENERAL ARTICLES
=================================================================

                An UTF-8 nodelist. And now what?
                By Michiel van der Vlist, 2:280/5555


As you may or may not know, ZC2 in cooperation with some Z2 RCs,
distributes an UTF-8 encoded nodelist in addition to the weekly and
daily ASCII encoded lists. It is distributed on a daily basis in the
file area DAILYUTF.

The project started a couple of years ago and took a slow start.
Presently only R28 and R56 participate actively by submitting an UTF-8
encoded segment. The number of regions participating passively is
unknown.

The UTF-8 nodelist offers sysops the opportunity to have their names,
their loaction and the name of their system listed as spelled in the
alfabet of their native language.

I am often asked "what use is an UTF-8 encoded nodelist, when there is
hardly any FTN sofware around that can handle it?" This article is
about how I deal with the UTF-8 nodelist in my system. I hope it will
be a guideline for others.

Of the the software I use: binkd, Fmail and Golded, only Golded is
relevant. Binkd and Fmail are character encoding agnostic with regard
to the nodelist. But Golded uses the nodelist as a lookup data base
and it it looks at the names of the sysops.

Golded can not deal directly with UTF-8. Even worse, it has the nasty
habit - at least the Windows version - of disregarding the active
codepage  and always forcing the code page that was in effect at
system start up. In my case CP850. So I have no choice but - for
better or worse - to convert to CP850 whatever I offer to Golded's
nodelist compiler.

For this I use two commonly available utilities: sed and iconv. Sed is
a so called serial or stream editor. It converts streams of bytes into
something else. I this case I use it to convert characters not present
in CP850 to something else. Sed is driven bij a script in this case
called to850.scr

Here is the line in the batch file that is called  when DILYUTF.ddd
comes in:

Daynbr Sed -f to850.scr dailyutf.@### >dailyutf.998

Daynbr is the win32 version of the famous daynbr.com by Ben Baker.

Here is the content of to850.scr:

s/ßÖ·â·ßÖ·â¶/ij/g
s/ßÖ·â·ßÖ·Öà/IJ/g
s/âéü/EUR/g

Oops, that looks nasty. It looks OK when viewed in in UTF-8
environment, but unfortunately Fidonews can not deal with UTF-8 yet,
so it does no look so good. What is betweem the first slashes in the
first line is the two byte UTF-8 code for the Dutch concatenated 'ij'
as viewed in CP850. This translates the Dutch concatenated 'ij'into
the diftong 'ij'. The second line does the same for the comapnion
capital.

The third line translates the Euro currency symbol into "EUR". Those
are all the characters that I presently expect in the UTF-8 nodelist
that do not fit into CP850.

The next line in the batch file calls iconv. Iconv is a character code
conversion utility. It converts files from one character encoding
scheme to another. Of course it will only work properly if the
characters to bve converted are present in the target encoding set.

Oh, wait, I first introduce a 2 second delay. To make sure that the
next file, DAILYUTF.999 has a later time stamp then DAILYUTF.998.
That way Golded's nodelist compiler sees it as "the latest".

ping -n 2 loopback
Iconv -c -f utf-8 -t cp850 dailyutf.998 >dailyutf.999
cd \fido\golded
gncyg -f -d

The -f tells golded to do a forced compile and the -d tells it to
remove duplicate entries. There will be a lot as I compile both the
ASCII nodelist and the UTF-8 noelist

In golded's config I have this for the nodelist section:

NODEPATH d:\fido\nodelist\
NODELIST d:\fido\\nodelist\dailyutf.*
NODELIST d:\fido\nodelist\dailylst.*
NODELIST d:\fido\z2pnt\z2pnt.*


As a result I can now type either "schroeter" or "schröter" in the To:
field of a message in Golded and the nodelist lookup of Golded will
give me the node numbers of Ullrich Schroeter or Ullrich Schröter.

This is an ongoing project. When our friends in Eastern Europe join
the project, things get more complicated. I do not yet know how to
deal with that. If and when it happens and I find a workaround, that
may trigger a follow up article.


(C) 2016, Michiel van der Vlist.

-----------------------------------------------------------------

--- Azure/NewsPrep 3.0
 * Origin: Home of the Fidonews (2:2/2.0)