Mailing List Hosted on Kabissa - Space for Change in Africa

a12n-collaboration Mailing List Archive: RE: [A12n-Collab] Re: 5 categories of African orthographies (Latin)

[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

  • Subject: RE: [A12n-Collab] Re: 5 categories of African orthographies (Latin)
  • From: Tunde Adegbola <taintransit@xxxxxxxxxxx>
  • Date: Sat, 22 Dec 2007 08:48:18 +0100
  • Importance: Normal
Hi Don,
No, I do not have any problems with your classification. I was only pointing the out the Conrad views for completeness.
Tunde

-----------------------------------------------------------------------------------------------
Tunde Adegbola (Ph.D.)
Executive Director
African Languages Technology Initiative
(Alt-I ... Inserting African issues into the agenda of the knowledge age)
President
Tiwa Systems Ltd.
 
11 Oluyole Way, New Bodija Ibadan, Nigeria.
+234 8034019398
------------------------------------------------------------------------------------------------



Date: Sat, 22 Dec 2007 00:53:03 +1100
Subject: Re: [A12n-Collab] Re: 5 categories of African orthographies (Latin)
From: andrewc@xxxxxxxxxxxxx
To: a12n-collaboration@xxxxxxxxxxxx


On Fri, December 21, 2007 4:58 pm, Don Osborn wrote:

> 1) The original question: whether the system of categorization - 5
> categories of orthographies according to how unicode etc. supports them.
> Am
> I correct in concluding that no one has a problem with this? Reason I ask
> is
> that I want to use this in some writing and would rather get criticism now
> than later.

Don, sounds like a reasonable categorization to me.

> 2) Existing information on Latin-based orthographies of African languages.
>
> 2a) Hartell's 1993 book. Yes, this is one we refer to often. I used it for
> a
> series of charts on http://www.bisharat.net/A12N/#countrytables (Lee
> Pearce
> also did some work there), and more recently Christian Chanard set up a
> database using Hartell's data at http://sumale.vjf.cnrs.fr/phono/ .
> Problem
> is that there is no update to this, and indeed that expanding it would be
> a
> challenge given the fact some orthographies are not set. Some even
> apparently are changing

There are a few sites that now have collections of the Hartell data, e.g. Christian Chanard's database and the material on the Bisharat site to name two.

AT some point, it might be worth conforming the currency and accuracy of the Hartell data set.

Although some languages are still in a flux, it may be possible to start building a repository of orthographic data for various languages, start with what we know and extend out.

One issue is that most treatments only cover alphabetic characters. They don't necessary identify punctuation required to support a language, grammatical symbols required, or even other symbols such as currency symbols.

Additionally, its useful to collect information on collation sequences, some languages may have more than one collation sequence, a traditional order, and a more recent dictionary order.

Its also useful to collect sample texts in each language.

From the point of view of developing fonts, keyboards, locales and other building blocks
 
> 2b) Documents like the one by Jim Agenbroad that Charles referred to, and
> indeed the oft-discussed research John did (time to bring that up again)
> would indeed be great to get online for greater access.

I'd be interested in seeing how much of Jim's work was shaped or influenced by the limitations imposed on MARC.

A few of us have over the years gathered information form a variety of resources and in the early stages of this mailing list there was some quite useful sharing of information and data.

Recently, I've started going through some of the material I've gathered over the years, and cross checking it with other resources that are now available.

If anyone is interested, I'd be willing to coordinate a repository of data on African languages, assuming there is interest in collaborating and contributing to such an archive.

>
> 4) With regard to support for Category 4 orthographies (if we agree on
> that
> terminology), that is orthographies that need combining diacritics and
> hence
> support for those, the question of how good that support is, and indeed
> how
> good the concept is, have been around for a while. The suggestion that
> more
> precomposed characters be added to Unicode has been discussed on this list
> -
> see for instance the thread beginning with
> http://lists.kabissa.org/lists/archives/public/a12n-collaboration/msg00182.h
> tml .
>

>
> The fact that the question keeps getting raised (I hear it from others
> occasionally) is sign enough that there is a need to either clarify the
> support issues and how those are being addressed, or clarify how the
> system
> doesn't work. Continued doubts about dynamic composition either need to be
> addressed with better explanations (and real support) or alternatively -
> again from a NPOV - with a real proposal that makes the justification and
> proposes specific precomposed characters. This so that we can move forward
> one way or another rather than recycling debates.

The question is to what extend are these debates formed and shaped by old experiences, or the struggle to handle older computer hardware, software and operating systems (esp proprietary systems?

The biggest barrier to correctly handling combining diacritics are:

1) the amount of older computers in use, for which no practical solution exists (although it may be worth investigating the possibility of cut down Linux installations for older hardware that is optimized to support combining diacritics.

2) a lack of sound advice and detailed instructions on what is needed to successfully work with combining diacritics. I think part of the problem is lack of training and awareness.

>
> That said (now I'm no longer NPOV), the system apparently works but the
> support is not yet there for African languages. Maybe what the problem is,
> and also the key to the concerns raised by Tunde and Samuel, is that there
> still work to do to support input and display of Yoruba diacritics (and
> other "category 4 orthographies") - so obviously it doesn't seem to work.
>

>
> In any event, I think this discussion is very timely and would like to
> encourage people with whatever experience or expertise with category 4
> orthographies (i.e., ones that require use of combining diacritics or even
> stacking of diacritics) to let us know what they think.
>


--
Andrew Cunningham
Research and Development Coordinator
Vicnet
State Library of Victoria
Australia

andrewc@xxxxxxxxxxxxx


Express yourself instantly with MSN Messenger! MSN Messenger
[Date Prev][Date Next][Thread Prev][Thread Next] [Date Index] [Thread Index]

Last Updated: Sat Jan 05 23:34:44 2008

a12n-collaboration is hosted on Kabissa - Space for Change in Africa

Your feedback is important. Click here to send a message to the Kabissa team.

Terms of Use | Privacy Notice | Web Site Credits © 1999-2006, Kabissa or its affiliates