Tamil Discussion archive
[Date Prev][Date Next][Thread Prev][Thread Next][Date Index][Thread Index]
Re: [WMASTERS] Design goals of 8-bit Tamil encoding standard
________________________________________________
This week's sponsors -The Asia Pacific Internet Company (APIC)
@ Nothing Less Than A Tamil Digital Renaissance Now @
<http://www.apic.net> Click now<mailto:info@apic.net> for instant info
________________________________________________
govindasamy wrote:
>
> ________________________________________________
>
> This week's sponsors -The Asia Pacific Internet Company (APIC)
> @ Nothing Less Than A Tamil Digital Renaissance Now @
> <http://www.apic.net> Click now<mailto:info@apic.net> for instant info
> ________________________________________________
>
> Mani M. Manivannan wrote:
> >
> > ________________________________________________
> >
> > This week's sponsors -The Asia Pacific Internet Company (APIC)
> > @ Nothing Less Than A Tamil Digital Renaissance Now @
> > <http://www.apic.net> Click now<mailto:info@apic.net> for instant info
> > ________________________________________________
> >
> > Since September we have been discussing the original draft proposed
> > standard of Dr. Kalyan that has been revised four times as a result of this
> > discussion. Recently Dr. Selvaa proposed a serious alternative standard.
> > During the discussion of this draft standard, we have considered
> > linguistic, political, historical as well as emotional issues. The
> > discussions have at times wandered into areas that have contributed little
> > to the standards process. This is perhaps inevitable with a truly
> > multinational group very interested in the welfare of the Tamil language
> > and particularly since the design goals of the standard have not been
> > clearly spelled out.
> >
> > Standards bodies have a tough task. They should be conservative and desist
> > from any attempt to innovate, reform or otherwise tamper with an
> > established convention. At the same time they should anticipate the users
> > needs. Otherwise the standard risks getting rejected by the users of the
> > standard. We will not be discussing a new standard if we are happy with
> > UNICODE or ISCII for our current needs.
>
> Mani, thanks for pulling me in to give my comments. you posted a valid
> question. you are right. The Draft version of the proposed version of
> ISCII -1997, which is prepared by the Deparment of Electronics -
> committee, will take care of all the things that we are discussing- with
> slight modification.
> just for info, the 1997 ISCII draft is aim at to arrive at an encoding
> that satisfies the following important requirements:
> 1. An encoding scheme that is independent of the input
> process(keyboard, speech, OCR, hand writing, ect.)
> 2. An encoding scheme that is independent of the output or rendering
> process (display, print, speech, etc.)
>
> the committee, which was set up by Government of India, in November
> 1996, has taken into accounts the followings, before drafting ISCII
> -1997:
> 1. The use of Internet as a vast source of electronic information
> 2. the increasing demands on information management applications, and
> 3. the interest expressed by a number of International Computer system
> manufacturers to provide Indian language support in their system.
>
> Mani, how many of us involved in this discussion has given thought for
> the proposed ISCII-97, and the implication it will bring to UNCODE? The
> present Unicode is implemented with the recommendation of DoE and BIS
> (Bureau of Indian Standards) based on ISCII 91 version. If this Draft
> version of ISCII-97 is accepted as a standard, the next version of
> UNICODE, will have the new code. UNICODE is tied with ISCII, DoE & BIS.
> Eventhough I am in TNSCommitte, I know pretty well, that the Tamil Nadu
> government cannot overrule DoE’s decision. TNSC can recommend to Tamil
> Nadu government, to put a proposal to make slight changes in the
> ISCII-97, provided we have a strong case to argue. Tamil Nadu
> government cannot make new standard on encoding.
> On 11 Sep you posed some very good questions. (What the recommended
> process for character encoding standards creation and acceptance by
> international standard bodies? ) I hope somebody will answer you.
>
> >
> > I am glad that most of the Tamil font and software developers are active in
> > this discussion. I am not a Tamil software developer and I have no vested
> > interest in the outcome except as a Tamil software user. I hope the group
> > will not be offended with my attempt to focus our efforts by explicitly
> > stating the design goals. I have used the Unicode standard and the archive
> > of the Tamil standard discussions that Dr. Kalyan kindly provided me. I
> > expect the participants of this discussion to augment and refine these
> > statements.
> >
> > ------------------------------------------------------------------------------
> >
> > Tamil character encoding standard design goals:
> >
> > 1. Establish a consistent international Tamil character standard that can
> > be used by software publishers, Web publishers, newspaper adn book
> > publishers, bibliographic information services, and academic researchers.
> >
> > 2. Encode characters, not glyphs.
>
> on Sep 17 AnbuArasan clearly explain, that the proposed encodings should
> be based on basic characters not glyphs, and he gave the reasons. At
> this point we have to look at his mail seriously. can any one of you
> repost that mail?
> >
> > 3. Must be universal. The standard must be include all characters that are
> > likely to be used in everyday Tamil text interchange.
> >
> > 4. Must be efficient. "Plain text, composed of a sequence of fixed-width
> > characters, provides an extremely useful model because it is simple to
> > parse: software does not have to maintain state, look for special escape
> > sequences, or search forward or backward through text to identify
> > characters." From Unicode 2.0 standard. ( Note: This goal may be
> > problematic for Tamil character standard. When we violate this standard, we
> > must be very careful to consider alternatives and explicitly state the
> > reasons, if any for vioalting this goal.)
> >
> > 5. Must be Uniform. "A fixed character code allows efficient sorting,
> > searching, display, and editing of text." From Unicode 2.0. (Note: Once
> > again, Tamil standard violates this goal in some places by encoding some
> > characters with a single byte, but other characters with a modifier.
> > However, Unicode itself does exactly that for Indic languages. When
> > considering exceptions, we can use Unicode as a useful guide for
> > implementation.)
> >
> > 6. Must be Unambiguous. Any 8-bit value should always represent the same
> > character. (Note: This pretty much kills ORNL.)
> >
> > 7. Must be usable and coexist with popular software until Unicode compliant
> > software becomes available.
> >
> > 8. Must be Unicode compatible. (i.e.) Shall not use characters that cannot
> > be saved in Unicode format. (Note: The TNC/DOE-India may require an ISCII
> > compatibility as well. )
> >
> > 9. Must be in the public domain. The character encoding standard will have
> > no restrictions on its use. It can be used freely for both commercial and
> > private purposes. Enhancements, alterations and other changes to the
> > standard will be done only by the standards body.
> >
> > ------------------------------------------------------------------------------
> >
> > I will include another extract from the Unicode standard that can guide us
> > to focus on the task at hand and bring it to a reasonable conclusion:
> >
> > "The .. standard does not encode idiosyncratic, personal, novel, rarely
> > exchanged, or private-use characters, nor does it encode logos or graphics.
> > Artificial entities, whose sole function is to serve transiently in the
> > input of text, are excluded. Graphologies unrelated to text, such as
> > musical and dance notations, are outside the scope of the .. standard.
> > Font variants are explicitly not encoded. The .. standard includes a
> > private use area, which may be used to assign codes to characters not
> > included."
> >
> > It is my belief that the current draft proposed standard and Dr. Selvaa's
> > alternative are identical in essential details, except those that Dr.
> > Selvaa himself has noted. I notice that there is some consensus around
> > the proposed standard from such font creators as Dr. Kalyan, Muthu
> > Nedumaran, Dr. Srinivasan and Prof. Hart. Prof. Schiffman had expressed a
> > willingness to accept the draft proposed standard. Mr. Ravi Paul has
> > raised specific problems with the draft standard but seems to support the
> > standard otherwise. Prof. Naa. Govindasamy, a pioneer of the Tamil on the
> > Internet effort and a member of the TNC has not commented on the draft
> > proposed standard.
>
> Mani, I am not a prof. just address me as Govind.
> In fact I don’t want to participate in this discussion.. I preferred to
> be a silent observer and a silent worker. since I was dragged in, I may
> give my comments in a few days.
>
> I am eagerly waiting for his technical documentation of
> > TamilNet Font's character encoding
>
> There is a paper (INET96, Montreal, June 1996) lying in this site (
> http://aua.am/aua/auacs/inet'96/a5/ ) you will get some kind of
> technical explanation on TamilNet font encodings
Mani, the link is not correct. the URL is
http://aua.am/aua/auacs/inet'96/a5/
sorry for mistake.
Thanks
anbudan
Naa Govindasamy
>
> and his comparison of TamilNet encoding
> > with alternate encoding conventions.
>
> Mani, I have not thought of that. If there is need, I may consider it.
>
> Thanks.
> Naa Govindasamy
> >
> > Mani M. Manivannan
> > Fremont, CA, USA.
> >
> > P.S.
> > I am sorry for this very long post. I hope this nudges everybody to focus
> > on the goal and iron out differences using technical guidelines. By
> > training I am a technician not a writer. I hope I haven't offended anybody
> > with my writing style. If I did, I apologize to all in advance. And if
> > anybody wants to correct my style, please, please send me personal e-mail.
> >
> > ________________________________________________
> >
> > Sponsors/Advertisers needed - please email bala@tamil.net
> > Check out the tamil.net web site on <http://tamil.net>
> > Postings to <webmasters@tamil.net>. To unsubscribe send
> > the text - unsubscribe webmasters - to majordomo@tamil.net
> > ________________________________________________
>
> ________________________________________________
>
> Sponsors/Advertisers needed - please email bala@tamil.net
> Check out the tamil.net web site on <http://tamil.net>
> Postings to <webmasters@tamil.net>. To unsubscribe send
> the text - unsubscribe webmasters - to majordomo@tamil.net
> ________________________________________________
________________________________________________
Sponsors/Advertisers needed - please email bala@tamil.net
Check out the tamil.net web site on <http://tamil.net>
Postings to <webmasters@tamil.net>. To unsubscribe send
the text - unsubscribe webmasters - to majordomo@tamil.net
________________________________________________
Home |
Main Index |
Thread Index