[Series] Open Software, Open Content, Open Translation Part IV


h1 Posted 8 months, 2 weeks ago mid-morning by oso

In parts one, two, and three, I tried to explain 1.) just what is open source software, 2.) what is open content and 3.) why both are in search of better translation tools. Now it’s time to look at how open source software programmers and open content creators think about translation.

First off, I think it is useful if we divide ‘translation’ into three different categories.

1.) The translation (localization) of software.

This refers to making a piece of software (like Firefox, for example) available in other languages. To do this, the developers of the software program divide all of the text found within the program into “strings”. Each ’string’ is a short collection of consecutive words. An example of a string in Firefox would be ‘Save Page As …’ which is found under the File menu pulldown. In Spanish, that string is translated as ”.

Picture 2.png Picture 3.png

2.) Translation (localization) of website user interfaces.

Blogs and most websites these days have two types of content. There is the text that always stays the same (’interface’) and the text that is frequently updated (’content’). For example, this post that I’m writing now is content while the menu items above that say ‘Begin’, ‘About’, ‘Photography’, etcetera, are ‘interface‘ because they always stay the same; they are part of the website’s design. Translating these two different parts of a website (interface and content) involve two different processes. To translate the interface of a website, we use the same process as translating a piece of software like Firefox. We divide all of the text of the website’s design as ’strings’. We then need to compile a list of all the strings that need to be translated in order to make the interface available in another language. The most common type of file which lists a series of strings and their translations is called a PO file. (The other standard way of doing this is with XLIFF files.)

One of the strings from WordPress is “Last 15 Posts”. That string can be found in the WordPress POT file and can be translated into any other language. A Bengali translator, for example, would take that POT file and add the translation of “Last 15 Posts” as “সর্বশেষ ১৫টি পোস্ট”.

Picture 4.png Picture 5.png

Here is a list of various programs that can be used to create and edit PO files.

3.) Translation of content.

Notice that it doesn’t just say ‘web content’ because any type of content that can exist online can also exist offline. For example, one of the conference attendees, Tomas Krag, is the author of a book on “Wireless Networking in the Developing World” which is available both online and in printed book-form. The book has already been translated into Spanish and French, but Tomas would obviously like it to be made available in as many languages as possible.

When we talk about translatable content, we’re really talking about just two types of things: text and voice. Images and video are not translatable unless they involve text and voice (which video usually does.) When it comes to multi-lingual open content websites, unfortunately, there aren’t many success stories. Translating content like a blog post or a five minute YouTube video is often more time-consuming than writing the original post or making the original video. And while being the author of good web content can turn you into a web celebrity, translators of that content are rarely given any recognition.

There are three open content, multilingual websites that stand out as success stories. First is Cafebabel.com, which dubs itself as “a European current affairs magazine: comment and analysis from Europe in 7 languages.” The website - and just about all of its content - is available in English, French, Italian, Spanish, German, Catalan, and Polish.

The biggest open content success story, without any doubt, is Wikipedia. The open content encyclopedia is available in more than 100 languages. Some languages, such as Aymara, have less than 1,000 entries while others like Galician have over 10,000 entries. One of the really great features of the website is that when you go to any single entry - for example, Karl Marx - on the left-hand side, you see a list of all the other languages which the entry is available in on.

Picture 4.png Picture 5.png

This is the same basic feature we use on Global Voices to let someone know on the main site that a particular blog post is available in other languages. For example, when I recently published my interview with Cristina Quisbert of the Voces Bolivianas project, Global Voices’ kind volunteers translated it into French, Portuguese, German, Bengali, Spanish, and Malagasy. Note that not only was the post translated into all those languages, but the video was also subtitled in Bengali, English, Spanish, and Portuguese.

Picture 1.png

Each individual translator from each of these three open content projects (Cafebabel.com, Wikipedia, and Global Voices) has their own methodology and their own toolset to help them translate content from one language to another. Global Voices translation coordinator (Lingua Director), Alice Backer, recently sent out an email to the mailing list asking for sample workflows. The responses from some of the translators reveal just how varied the translation process is depending on which languages you’re involved with. In the next post in this series I’ll take a look at some of the tools our Global Voices’ translators are currently using and some of the obstacles they’ve encountered.



6 comments | Feed for comments | Trackback URL

  1. 1TonNetNo Gravatar from United States says:

    I am looking forward to read the next post about all those obstacles GVO translators have to work with.

  2. 2KayiwaNo Gravatar from Uganda says:

    I cant wait to read the traslantions here
    can i translate too in my local languange?

  3. 3Élizabeth RobertNo Gravatar from Canada says:

    Hello David,

    thanks a whole lot for all these infos and updates. It was a very educative and informative tour I’ve had. I also took the liberty to share your blog post with my fellow Literary Translators from the Literary Translators Association of Canada.

    You are doing an amazing job and I truely enjoy reading your blog.
    Thanks to sven for redirecting me to you too ;0)

    Feliz Navidad en Medellín y quel año 2008 sea aun mas prospero para ti y los tuyos…. salud, desarrollo y felicidad!

    élizita
    http://www.nochesdepoesia.com

  4. 4osoNo Gravatar from United States says:

    TonNet,

    Should be up by tomorrow. Thanks for stopping by. :)

    Kayiwa,

    Is your local language Luganda? I’m curious to hear how well this online English - Luganda dictionary works.

    Élizabeth,

    That’s very sweet of you. I think that Sven has convinced me to pay Montreal a visit this summer so hopefully I’ll get to check out Noches de Poesia in person. Happy holidays.

  5. 5El Oso (Pecoso) » Blog Archive » Open Software, Open Content, Open Translation Part V from United States says:

    [...] the last post of this series, we looked at how producers of open source software and open content are usually [...]

  6. 6ÉlizitaNo Gravatar from Canada says:

    4oso from says:
    Monday, December 24th, 2007 at 12:15 pm
    Safari 523.10.6 on MacOS X

    Élizabeth,

    That’s very sweet of you. I think that Sven has convinced me to pay Montreal a visit this summer so hopefully I’ll get to check out Noches de Poesia in person. Happy holidays.

    Hello Oso,

    Happy late B-Day! BONNE FETE!!! FELIZ CUMPLEANOS!!! Sorry I was unable to write on the DAY OF… I<ve been wanting to ask… when ARE you coming? I have a radioshow here in Montreal… called POESIA LIBRE and would love to interview you when you come… it airs every other Wednesday night 21h40 til 22h.
    I’ve just closed our Noches de poesia for summer break… but will reopen on September 3rd… any chance to see you in 2008?

    saludos,
    e.



Share Your Comments


h1