Tillbaka till svenska Fidonet
English   Information   Debug  
ENET.SYSOP   33806
ENET.TALKS   0/32
ENGLISH_TUTOR   0/2000
EVOLUTION   0/1335
FDECHO   0/217
FDN_ANNOUNCE   0/7068
FIDONEWS   23548
FIDONEWS_OLD1   0/49742
FIDONEWS_OLD2   0/35949
FIDONEWS_OLD3   0/30874
FIDONEWS_OLD4   0/37224
FIDO_SYSOP   12847
FIDO_UTIL   0/180
FILEFIND   0/209
FILEGATE   0/212
FILM   0/18
FNEWS_PUBLISH   4200
FN_SYSOP   41525
FN_SYSOP_OLD1   71952
FTP_FIDO   0/2
FTSC_PUBLIC   0/13586
FUNNY   0/4886
GENEALOGY.EUR   0/71
GET_INFO   105
GOLDED   0/408
HAM   0/16053
HOLYSMOKE   0/6791
HOT_SITES   0/1
HTMLEDIT   0/71
HUB203   466
HUB_100   264
HUB_400   39
HUMOR   0/29
IC   0/2851
INTERNET   0/424
INTERUSER   0/3
IP_CONNECT   719
JAMNNTPD   0/233
JAMTLAND   0/47
KATTY_KORNER   0/41
LAN   0/16
LINUX-USER   0/19
LINUXHELP   0/1155
LINUX   0/22012
LINUX_BBS   0/957
mail   18.68
mail_fore_ok   249
MENSA   0/341
MODERATOR   0/102
MONTE   0/992
MOSCOW_OKLAHOMA   0/1245
MUFFIN   0/783
MUSIC   0/321
N203_STAT   900
N203_SYSCHAT   313
NET203   321
NET204   69
NET_DEV   0/10
NORD.ADMIN   0/101
NORD.CHAT   0/2572
NORD.FIDONET   189
NORD.HARDWARE   0/28
NORD.KULTUR   0/114
NORD.PROG   0/32
NORD.SOFTWARE   0/88
NORD.TEKNIK   0/58
NORD   0/453
OCCULT_CHAT   0/93
OS2BBS   0/787
OS2DOSBBS   0/580
OS2HW   0/42
OS2INET   0/37
OS2LAN   0/134
OS2PROG   0/36
OS2REXX   0/113
OS2USER-L   207
OS2   0/4785
OSDEBATE   0/18996
PASCAL   0/490
PERL   0/457
PHP   0/45
POINTS   0/405
POLITICS   0/29554
POL_INC   0/14731
PSION   103
R20_ADMIN   1117
R20_AMATORRADIO   0/2
R20_BEST_OF_FIDONET   13
R20_CHAT   0/893
R20_DEPP   0/3
R20_DEV   399
R20_ECHO2   1379
R20_ECHOPRES   0/35
R20_ESTAT   0/719
R20_FIDONETPROG...
...RAM.MYPOINT
  0/2
R20_FIDONETPROGRAM   0/22
R20_FIDONET   0/248
R20_FILEFIND   0/24
R20_FILEFOUND   0/22
R20_HIFI   0/3
R20_INFO2   2810
R20_INTERNET   0/12940
R20_INTRESSE   0/60
R20_INTR_KOM   0/99
R20_KANDIDAT.CHAT   42
R20_KANDIDAT   28
R20_KOM_DEV   112
R20_KONTROLL   0/13068
R20_KORSET   0/18
R20_LOKALTRAFIK   0/24
R20_MODERATOR   0/1852
R20_NC   76
R20_NET200   245
R20_NETWORK.OTH...
...ERNETS
  0/13
R20_OPERATIVSYS...
...TEM.LINUX
  0/44
R20_PROGRAMVAROR   0/1
R20_REC2NEC   534
R20_SFOSM   0/340
R20_SF   0/108
R20_SPRAK.ENGLISH   0/1
R20_SQUISH   107
R20_TEST   2
R20_WORST_OF_FIDONET   12
RAR   0/9
RA_MULTI   106
RA_UTIL   0/162
REGCON.EUR   0/2055
REGCON   0/13
SCIENCE   0/1206
SF   0/239
SHAREWARE_SUPPORT   0/5146
SHAREWRE   0/14
SIMPSONS   0/169
STATS_OLD1   0/2539.065
STATS_OLD2   0/2530
STATS_OLD3   0/2395.095
STATS_OLD4   0/1692.25
SURVIVOR   0/495
SYSOPS_CORNER   0/3
SYSOP   0/84
TAGLINES   0/112
TEAMOS2   0/4530
TECH   0/2617
TEST.444   0/105
TRAPDOOR   0/19
TREK   0/755
TUB   0/290
UFO   0/40
UNIX   0/1316
USA_EURLINK   0/102
USR_MODEMS   0/1
VATICAN   0/2740
VIETNAM_VETS   0/14
VIRUS   0/378
VIRUS_INFO   0/201
VISUAL_BASIC   0/473
WHITEHOUSE   0/5187
WIN2000   0/101
WIN32   0/30
WIN95   0/4277
WIN95_OLD1   0/70272
WINDOWS   0/1517
WWB_SYSOP   0/419
WWB_TECH   0/810
ZCC-PUBLIC   0/1
ZEC   4

 
4DOS   0/134
ABORTION   0/7
ALASKA_CHAT   0/506
ALLFIX_FILE   0/1313
ALLFIX_FILE_OLD1   0/7997
ALT_DOS   0/152
AMATEUR_RADIO   0/1039
AMIGASALE   0/14
AMIGA   0/331
AMIGA_INT   0/1
AMIGA_PROG   0/20
AMIGA_SYSOP   0/26
ANIME   0/15
ARGUS   0/924
ASCII_ART   0/340
ASIAN_LINK   0/651
ASTRONOMY   0/417
AUDIO   0/92
AUTOMOBILE_RACING   0/105
BABYLON5   0/17862
BAG   135
BATPOWER   0/361
BBBS.ENGLISH   0/382
BBSLAW   0/109
BBS_ADS   0/5290
BBS_INTERNET   0/507
BIBLE   0/3563
BINKD   0/1119
BINKLEY   0/215
BLUEWAVE   0/2173
CABLE_MODEMS   0/25
CBM   0/46
CDRECORD   0/66
CDROM   0/20
CLASSIC_COMPUTER   0/378
COMICS   0/15
CONSPRCY   0/899
COOKING   28578
COOKING_OLD1   0/24719
COOKING_OLD2   0/40862
COOKING_OLD3   0/37489
COOKING_OLD4   0/35496
COOKING_OLD5   9370
C_ECHO   0/189
C_PLUSPLUS   0/31
DIRTY_DOZEN   0/201
DOORGAMES   0/2024
DOS_INTERNET   0/196
duplikat   6000
ECHOLIST   0/18295
EC_SUPPORT   0/318
ELECTRONICS   0/359
ELEKTRONIK.GER   1534
ENET.LINGUISTIC   0/13
ENET.POLITICS   0/4
ENET.SOFT   0/11701
Möte FIDONEWS_OLD4, 37224 texter
 lista första sista föregående nästa
Text 21144, 103 rader
Skriven 2015-02-16 01:18:33 av FidoNews Robot (2:2/2.0)
Ärende: FidoNews 32:07 [02/07]: General Articles
================================================
=================================================================
                        GENERAL ARTICLES
=================================================================

                         UTF-8 In The Nodelist
                         By Michiel van der Vlist, 2:280/5555

This weekend an initiative was launched in Z2 to break the "ASCII only
barrier" for the nodelist. Ever since the first Fidonet nodelist was
issued in 1984, the character set for the nodelist has been the ASCII
set. The relevant FTSC document, FTS-5000.005.005 says this:

  The nodelist is a flat text file containing any number of lines,
  using only the ASCII (7 bit) character set.

In the early days of Fidonet, this was acceptable when Fidonet was
limited to Northern America and there was no other choice anyway
because at the time that was the only standardised character set that
all Fidonet systems had in common.

But already in 1989 when Fidonet had spread all over the world the
ASCII only barrier provoked attempts to break it. In the R22 segment
of nodelist.197 of 1989, we find lines like this:

,60,Night_Service,Espoo_Finland,Jali_H{kkil{,358-0-5471235,1200,V22,XA
,98,Vallu_Opus,Valkeala_Finland,Kimmo_Kotim{ki,358-51-863364,1200,V22
,304,Kummised{n_kyytipoika,Keitele_Finland,Urpo_Heikkil{,358-78-52663,
,308,Kiveri|_BBS,Lahti_Finland,Jukka_Sorjonen,358-18-182476,1200,CT3
,86,Dataline,Sipoo_Finland,Mikael_L|nnroth,358-0-234141,2400,V22,XB

What we see here is an early attempt to accomodate names written with
"funny" characters. At the time MakeNl did not allow 8 bit characters,
they were replaced by question marks, so if a different character set
was used, it had to be a seven bit set. The character set used is
ISO 646-FI. In this set the seldom used ASCII characters { \ } { | and
} are replaced by the upper and lower case a diaresis, o diaresis and
a ring. (ä ö å)

My nodelist archive is incomplete, so I was unable to look beyond 2004
In 2004 this was still in use in the Finnish part of the nodelist. It
illustrates that there is a demand for the ability to write the
sysop's name in his own language in his own alphabet.

Using national characters sets however is impractical to use an
understatement. The above method is not suitable for more general use.
If all regions or nets started using their own encoding, it would
become quit messy.

The latest version of MakeNl allows the use of non-ASCII characters by
using the option "Allow8bit 1". Using the 8th bit gives much more
freedom than substituting not much used characters by national
characters.

An eight bit character set however is not good enough either. Even
with the nodelist shrunk to a percentage of what it once was, there
is no single eight bit set that covers all of Fidonet. Net even "most
of Fidonet. Fortunately most modern operating systems now use
unicode internally, so dealing with UTF-8 encoding is not all that
hard.

In December a test was performed by adding a season's greeting to the
prolog of the daily nodelist as distributed in Z2. The encoding was
UTF-8 and it covered all the umlauts and accents as used in Western
Europe in addition to characters with macrons as used in Eastern
Europe as well as Cyrilics as used in Russia, Ukrania and Bulgaria
and more. This test was a success in that it broke nothing. Most
likely the majority of Fidonet sysops never saw it.

As a follow up ZC2 is now starting to distribute a daily nodelist in
UTF-8 format. It is not a replacement of the existing weekly nodelist
and nodelist diff, nor does it replace the daily nodelist in ASCII.
It is an additional service with a seperate distribution. The name
of the file is DAILYUTF.ddd where ddd is the day of the year.

Presently there are contributions in UTF-8 from R28 and R56.

The following paragraph has been coordinated with the ZC2:

RC's wishing to participate in the project are invited to do so. They
should:

1) Send a message to the ZC2 to inform him. The ZC2 needs to adjust
his configuration to process UTF-8 formatted segments from an RC.

2) Submit segments named UTFRrr.ddd  where rr is the region number and
ddd is the day of the year of the coming or present Friday or
UTFRrr.Cdd where C is the compression and dd are the last two digits
of the day of the year of the coming or present Friday.

3) The character encoding for the segment must be UTF-8.

4) Particpating in the project does not relieve RC's of the
oubligation to submit REGIONrr segments in ASCII. They must continue
to submit those segments in addition to the segments in UTF-8.


Next week: How to create and edit UTF-8 text files.


-----------------------------------------------------------------

--- Azure/NewsPrep 3.0
 * Origin: Home of the Fidonews (2:2/2.0)