Wikimedia blog

News from the Wikimedia Foundation and about the Wikimedia movement

Posts Tagged ‘i18n’

Universal Language Selector coming to all wikis

The Universal Language Selector (ULS) provides a flexible way to configure and deliver language settings like interface language, fonts, and input methods (keyboard mappings). It combines the features of two earlier Mediawiki extensions Narayam and WebFonts. From June 11, 2013 on, ULS will be made available to all Wikimedia wikis in 5 phases.

In the first phase, ULS will replace the Narayam and WebFonts extensions on 84 wikis. User preferences from the replaced extensions will not be preserved. Affected communities will be notified by the Wikimedia Language Engineering team of the upcoming change.

In the 5 weeks that follow, ULS will be deployed on Wikipedias in size 11-20 (phase 2), all projects without language versions (phase 3), English language Wikipedia (phase 4) and all other wikis (phase 5).

The Universal Language Selector can be visible in two ways: In the sidebar for wikis with language versions, like Wikipedia, or in the personal toolbar at the top of wiki pages for wikis without language versions, like Wikimedia Commons and Meta-Wiki. Based on the geographic location of users, the initial set of language preferences is presented. Users can set the input methods and fonts to that they want to use. Logged-in users can also change the language for the MediaWiki menu items.

Universal Language Selector is already available on several Wikimedia wikis like Wikimedia Commons and Meta-Wiki. The appearance on wikis like Wikipedia is available in the beta installation of the English language Wikipedia on Wikimedia Labs. A cog icon is present in the “Languages” section of the sidebar menu. Clicking the icon opens the Language settings panel that can be used to set the display and input settings.

Please have a look at the Universal Language Selector feature description or the Frequently Asked Questions for more detailed information.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Volunteers and staffers teach, learn, create at Amsterdam hackathon

149 participants from 31 countries came to Amsterdam in late May to teach each other and improve Wikimedia technology.

developers near the sticky-note wall

Developers work near sticky-notes representing topics and ideas at the Amsterdam hackathon in May 2013.

Technologists taught and attended sessions on how to write and run a bot, use the new Lua templating language, how to move from Toolserver to the new Wikimedia Labs, design, Wikidata, security, and the basics of Git and Gerrit. Check out the workshops page for slides, tutorials, and other reference material; videorecordings of sessions are due for uploading to Wikimedia Commons soon.

Wikimedia Netherlands, Wikimedia Germany, and the Wikimedia Foundation subsidized travel and accommodation for dozens of participants, enabling the highest participation in this event’s history. As one subsidized participant wrote, “One of the wonderful things about the Wikipedia world is the support given to the volunteers from the different chapters and the parent Wikimedia Foundation to promote community growth and building awesome stuff that the whole world can use….It’s such surprises that makes one love contributing to open source.” Organizers also put together a social events program that included a boat cruise of Amsterdam’s canals.

Participants are still listing what they accomplished or learned during the event, but here’s a sample:

  • The Wikimaps project aims to present historical maps on Wikimedia sites, and to work together with OpenStreetMap Historic “to find a common way to model historical geodata” (more details). Maps aficionados discussed the project and made plans in Amsterdam. One volunteer, Arun Ganesh, wrote a prototype wiki atlas: an interactive SVG file that comes with automatic labelling (details).
  • Moritz Schubotz, a volunteer, worked on improving search and math functionality in MediaWiki.
  • The Foundation testing and quality assurance team improved test coverage and the test environment, and taught other participants how to do QA for Wikimedia.
  • Pau Giner, a designer at the Foundation, wrote code to use an SVG for the collapsible section arrow in MediaWiki’s Vector skin. This will make the image less fuzzy-looking.
  • two technologists at Amsterdam hackathon

    A WMF staffer holds a microphone to amplify a volunteer’s voice during the closing demo session at the Amsterdam hackathon.

    User:Ruud Koot wrote a Wikivoyage listing editor that will make it easier to improve the specific parts of a travel suggestion without having to load the whole page.

  • Several volunteers worked on the account creation tool and process for English Wikipedia, to help the ACC team deal with prospective editors who have not been able to create an account via the web interface. The improved tool (code) streamlines the workflow, helping volunteers do their work faster.
  • A group of staffers and volunteers interested in statistical data improved the User Metrics API‘s reliability and security. Another wrote a proof-of-concept MediaWiki extension enabling editors to embed Limn graphs in wiki pages via wikitext.

So far, 90 participants have submitted the post-event survey and results are largely positive, with (of course) several suggestions for improvements in the future. For instance, next year, organizers should help trainers prepare more, and help participants with common interests find and work with each other more easily.  We don’t yet know where or when next year’s developer meeting will be, but it’ll happen; subscribe to the low-traffic wikitech-announce mailing list to hear when it’s settled.

You may also wish to read the Wikipedia Signpost report on the event.

Thanks are due to staffers at the Wikimedia Foundation, Wikimedia Netherlands, and Wikimedia Germany who made the event possible, and to volunteers who ran the event, especially lead Maarten Dammers.  And thanks to all the participants who gave up their weekend to make our sites better.

Sumana Harihareswara
Engineering Community Manager, Wikimedia Foundation

Language Engineering Development Updates and Events

In the recently concluded development sprint, the Wikimedia Language Engineering team fixed critical bugs for the Universal Language Selector, participated in several events around the world and also announced the release of the latest version of the MediaWiki Language Extension Bundle.

MediaWiki Language Extension Bundle and Updates to ULS

As the date for the first phase of deployment of Universal Language Selector (ULS) draws close, the team has been fixing critical bugs and testing the fixes. These included bugs related to the behavior of the ULS activation ‘cog’ icon. Significant design changes were also made on the input settings panel. Additionally, ULS has been hidden for users who do not use JavaScript on their browsers.

These updates are also part of the latest version of MediaWiki Language Extension Bundle (MLEB). Besides ULS, miscellaneous maintenance bugs were fixed for the Translate extension editor. This further improves the stability of the Translation Editor – TUX. CLDR has been updated to version 23.1.

Amsterdam and Tel-Aviv Hackathons and Community Programs

Members of the Language Engineering team participated and also helped in organizing hackathons at Amsterdam and Tel Aviv. At the hackathon in Amsterdam, organized by Wikimedia Nederland, team members interacted with their peers. Besides attending the workshops, they also submitted and merged patches for various internationalization extensions. A session for automated browser testing with the Wikimedia QA team was particularly well-received in view of the upcoming ULS deployment.

At the hackathon organized by Wikimedia Israel, Amir Aharoni led the event and brought together more than thirty local participants to explore various aspects of contributing to MediaWiki projects. The full report of the accomplishments from the event has been documented by him.

Alolita Sharma presented a talk about Internationalization in Wikimedia projects at IMUG. The entire video of the talk and presentation slides are available online.

Google Summer of Code

The Language Engineering team also welcomed the 4 students who will be participating in Wikimedia’s Internationalization projects for this year’s Google Summer of Code (GSoC). They will be contributing to the jQuery.ime project, Language Coverage dashboard, mobile app for Translate and right-to-left support on VisualEditor.

Coming up

Preparations for deployment of ULS and extending support to the GSoC candidates during the community bonding period are important focus areas during the next 2 weeks.

For information about the Language Engineering team and our projects, please write me at runa at wikimedia dot org or find team members on our IRC channel #mediawiki-i18n on Freenode.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Getting ready for ULS everywhere

The Wikimedia Language Engineering team recently completed their latest development sprint, with a special focus on preparing for the upcoming deployment of the Universal Language Selector (ULS) extension on multiple wikis. The team also hosted a ULS-specific office hour on May 8, 2013 (logs).

ULS deployment prep

The Language Engineering team is working on refining several important features of the Universal Language Selector. This extension will provide an umbrella of services including selection of UI language, input tools and fonts. ULS will superannuate Narayam and Webfonts to provide a unified solution for configuring language settings for MediaWiki. During this development sprint, critical bugs related to positioning of ULS’ activation area and its “cog icon” label were fixed. These affected multiple MediaWiki skins and interlanguage wiki pages. The improved version will be deployed over several phases. More information about the upcoming deployment can be found in the deployment schedule.

ULS testing

ULS features are to be verified based on the test scenarios identified. These scenarios, based on the Cucumber framework, can be adapted for automatic as well as manual testing. The scenarios cover core features of ULS: triggers, language settings panel, display settings, font selection and input tools selection. These have been written in a simple “Given-When-Then” format and provide the steps for easy walkthroughs. The testing instance hosts all the latest updates that are being made. The team is looking for volunteers who can help us with testing and reporting bugs. Let us know if you would like to join and help (write to runa at wikimedia dot org or ping us on #mediawiki-i18n) .

What’s next

The team will be completing all feature changes and testing them by end of the current sprint to be ready for kicking-off the roll-out of phase 1 of ULS. Roll-out will be coordinated by Niklas Laxström with administrators of all scheduled wikis. The team will also be hosting a bug triage session on May 29, 2013 on freenode.net IRC on the #mediawiki-i18n channel.

ULS is live on Commons!

Meanwhile, based on consensus reached by the Commons community, Universal Language Selector and the Translate extensions have been enabled on Commons.

For more details about the Language Engineering projects and ways to participate, please write to me [runa at wikimedia dot org] or ping us on #mediawiki-i18n.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Updates from Language engineering: changes to the Language Selector, new Extension Bundle release

In the recently concluded development sprint, the Wikimedia Language Engineering team made a new release of the Mediawiki Language Extension Bundle (MLEB), fixed bugs related to the Page Translation feature in Translate UX (TUX) and began work on design changes for the Universal Language Selector (ULS). The team also hosted a bug triage session that was well attended.

Input Settings from the ULS Language Settings Panel

Universal Language Selector Design Changes

Development and design changes have been initiated for the Universal Language Selector. The option to position the extension’s main panel in the sidebar was added and this feature is now being polished. Changes to the layout of the Language Settings dialog have been initiated, and usability tests for the proposed design changes were also done.

Using Wikimedia’s default GeoIP locator, ULS can now infer the user’s location and suggest language preferences.

MLEB Release

The April release for the Mediawiki Language Extension Bundle (MLEB) was announced by Amir Aharoni. Starting with this release, MLEB is no longer compatible with MediaWiki 1.19. MLEB 2013.04 and its later versions can only be used with MediaWiki version 1.20.4 or above.

The notable changes include update to CLDR v.23, bug fixes to further stabilize TUX and design changes for the Universal Language Selector. An experimental feature to present a restricted translation environment for new translators was developed for TUX. This is not enabled by default. Basic support for the XLIFF file format has also been added to Translate.

Up Next

During the next development cycle, the team will complete the changes to the Universal Language Selector design and test the features. The team is also participating in Google Summer of Code (GSoC) and the Outreach Program for Women (OPW), and will be working on completing the tasks in the next stages of the programs. More information about the other open projects for internationalization can be found in the master list.

The next Language Engineering office hour will be held on 8 May 2013 at 17:00 UTC (10:00 PDT) in #wikimedia-office on Freenode IRC.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Apply for an internship with the Language engineering team

Quim Gil, the Wikimedia Foundation’s Technical Contributor Coordinator, recently wrote about internship programs that the Wikimedia tech community participates in. These programs provide a valuable platform for a diverse group of contributors and nurture deeper collaboration across open source communities. He also shared details about participating in Google Summer of Code (GSoC) and Outreach Program for Women (OPW) for Wikimedia projects.

The Wikimedia Language Engineering team welcomes students to participate in the projects listed for Google Summer of Code and those listed for the Outreach Program for Women. The projects listed aim to resolve shortcomings or enhance various language tools that the team maintains; they include:

  • improving the jQuery.ime input method library;
  • building browser extensions for stand-alone operation of input methods;
  • creating a dashboard for language coverage information;
  • converting legacy wiki content into translatable entries.

Providing support for nearly 300 languages is no easy feat. There is constant demand for enhancements of tools, and this demand is only expected to grow. The team constantly encourages volunteers including students, language community members and others, to work with them on internationalization challenges. This includes various components like Translate UX (TUX) and Project Milkshake, in which participants can:

  • increase coverage of input methods and font library;
  • improve language rules for the internationalization library;
  • test and prepare validation tools;
  • test and enhance the translation tool;
  • write documents.

They can also contribute by building extensions like SpellingApi and LocalisationUpdate, or even creating usable multi-lingual CAPTCHAs.

Open projects are also added to the master list maintained for all mentorship programs. After ascertaining the availability of mentors, participants can collaborate on a project of interest. If no mentors are listed, students can ask the team on  #mediawiki-i18n (Freenode IRC) or write to me (runa at wikimedia dot org) for more information.

We look forward to all the exciting proposals for our projects for Google Summer of Code and Outreach Program for Women. Student applications close on May 3rd and May 1st respectively.  Time is short — apply now!

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Join the Language Mavens!

Among the Wikimedia projects, Wikipedia has the highest number of individual language projects — 285. The Language Engineering team focuses on building language tools and assets that improve the ability to interact with any article on Wikipedia. Language assets like fonts and input methods are integrated into MediaWiki and its extensions, and our wikis are localized using collaborative translation with translation tools to ensure a decent user experience.

Collaboration in Language Projects and the Language Maven Program

Language Engineering community meetup during GNUnify 2013 at Pune, India

Language tools are constantly evolving to ensure support for our users. It is a slow if not impossible task to scale our small engineering team to support hundreds of languages without close collaboration with our language communities, which have many capable and technically-savvy editors and administrators.

The Wikimedia Language Engineering team has compiled a proposal for the formation of a special interest group named the Language Mavens. With members from various language communities from around the globe, we hope to learn from our users, seek advice, guidance and validation on language features. We hope that the Language Mavens will pull in participation from community members and experts who care about language support features and their adoption in the wikis they read and contribute to.

Getting started with the Maven Program

The Language Maven pilot was rolled out earlier this month on April 13 with a meeting that was well attended. Program scope and activities were discussed. One of the recommendations was to ensure that documents and handy checklists be prepared for easy reference to the language tools available to each language community. Activities that the Mavens can participate in include usability tests, bug triages, testing days and even blogging to share valuable insights about the internationalization tools in their favorite language wiki projects.

The Mavens program is aimed to focus on collecting feedback and providing support for language tools and assets being deployed by the team. This will help develop a long-term user group that will be instrumental in helping other language community members learn more about the latest language features and tools being rolled out. The Maven team expects to meet once every month and communicate through the mediawiki-i18n mailing list. To participate as a Language Maven, please fill up this form to let us know about your interest or ping me (runa at wikimedia dot org) for any questions!

Help us make your language experience better — join the Mavens!

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

A new translation home page, the Maven Program and other updates from Language Engineering

The Wikimedia Language Engineering team continued to develop additional features for Translate UX (TUX) and to fix bugs, including a number of critical bugs that affected Microsoft Internet Explorer. TUX has now been deployed on Wikimedia sites and on translatewiki.net. The team also launched the new outreach program, called Language Mavens, and conducted an office hour.

The view after the user logs in to the new home page of translatewiki.net. This is currently in development.

The view after the user logs in to the new home page of translatewiki.net. This is currently in development.

Redesigning the main page of translatewiki.net

As part of the original specifications for TUX, the translatewiki.net main page is being redesigned. The layout will now include a set of tiles for the project groups that will contain the projects’ logo and statistics. The links conveniently placed on the tiles can be used to access the project messages for translation and proofreading. Once logged in, users will be presented with summarized statistics of their recent activities.

Significant changes are also planned for the account sign-up form. Along with features that will allow newly signed-up users to familiarize themselves with the translation workflow, it’ll make the process of getting the translation permissions smoother. The page will also include artwork to depict cultural diversity from around the world. The main page is currently under preparation, but the special page can already be accessed on translatewiki.net.

Language Mavens: The new outreach program

Language Mavens, the new outreach program for the Language Engineering team, was launched to provide better support to the various language communities within Wikimedia projects. The program was announced during the monthly office-hour (read the logs). After a meeting with the initial group of participants, the pilot phase is now underway. The program aims to regularly connect with the language communities across the Wikimedia projects to get feedback about the user experience and any improvements needed in the language support features. To optimize the internationalization tools and workflows, the program will also facilitate participation in activities such as bug triages and testing days. The team will hold regular meetings with the Maven team members. The program is open for participation, and new members can sign up online. To get more details about the program, you can read the program’s description and write to runa at wikimedia dot org.

Up next

For the next development sprints, the team will continue to work on the projects mentioned above, and also devote significant efforts on redesigning the Universal Language Selector (ULS) interface based on the design review document. The Language Engineering team will host a community bug triage session on April 24th, 2013 at 17:00 UTC. The team also invites students participating in Google’s Summer of Code 2013, to work with them on several internationalization projects.

To know more about our projects and ways to participate, please find us on the IRC channel #mediawiki-i18n (Freenode) or write to runa at wikimedia dot org.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Language Engineering Sprint Update: Translation User Experience improvements, testing and coverage

The Wikimedia Language Engineering team completed its recent development sprints with a focus on feature completeness of ‘Translate User eXperience’ or ‘TUX’ for deployment and also made preparations for its new community outreach project. Below are some of the highlights from the sprint.

Work is underway on the design of the new main page for translatewiki.net

Work is underway on the design of the new main page for translatewiki.net

 

Translate Editor – TUX, has been deployed: The Language Engineering team has been blogging on improvements to the Translate Extension which has now been deployed to WMF sites and been enabled as the default editor on translatewiki.net. Issues encountered while using the new editor can be reported via Bugzilla. More details about TUX’s design features can be found in our previous blog post.

New Language Outreach Program: A new outreach and support program to facilitate feedback from our language wiki projects is also currently in development. The program would facilitate focused feedback about the tools being developed and reaching out with solutions to the larger communities working on Wikimedia projects in various languages. The pilot phase is scheduled for launch in the coming weeks and an announcement with the details of the program will be shortly made.

Updates to the MediaWiki Language Extension Bundle (MLEB): In other news, Amir Aharoni announced the release of the new version of the MediaWiki Language Extension Bundle (MLEB). Besides development updates to TUX, version 2013.3 of MLEB includes bug fixes to the Universal Language Selector (ULS) as well as new fonts for Hebrew, Javanese and Arabic. Last but not least, a Marshallese input method has been contributed to jQuery.ime by Nick Doiron.

Testing plan for language tools: A preliminary outline of a testing plan for all internationalization tools currently maintained by the team was completed. Guided by earlier discussions with the Fedora Localization Testing Group (FLTG), the initial draft includes setup of test environment, preparing test scenarios, collecting feedback and analyzing results.

Preparations for visualization of language coverage information: Work also continued on the Language Coverage Matrix, a collection of data about the availability of language tools for different languages in Wikimedia projects. Currently, the focus is to prepare a technical specification for automated presentation and access to this information.

Event participation and other news: Siebrand Mazeland represented the Wikimedia Language Engineering team at the Internationalization and Localization Conference organized by Lingoport in Santa Clara. (More details: presentation slides, and talk recording)

The ongoing development sprint will focus on creating a new design for the translatewiki.net homepage, launch of the pilot phase for Language support outreach program, and publication of the Language Engineering roadmap for the next fiscal year 2013-2014. The Language Engineering team is also looking to hire JavaScript and PHP engineers with deep experience in i18n and l10n technologies. Additionally, a few of the projects that are open for participation have been listed here.

The Language Engineering team is available to answer any technical questions you may have about the tools it develops. You can join us at our monthly office hour scheduled this month on April 10, 1700 UTC and 1000 PDT or find us on irc.freenode.net at #mediawiki-i18n. Logs from the last office hour held on March 13, 2013 can be found here.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering

Report from the Spring 2013 Open Source Language Summit

Fortuna i forti aiuta, e i timidi rifiuta — an Italian proverb

The Wikimedia Foundation and Red Hat jointly organized the Second Open Source Language Summit on February 12th and 13th, 2013. The summit was held at the Red Hat engineering center in Pune, India. Similar to the previous summit, this face-to-face work session was focused on internationalization (i18n) and localization (l10n) features, font support, input method tools, language search, i18n testing methods and standards. The sessions were work sprints, each with special focus on a key area. Participants included core contributors from the Wikimedia Foundation, Red Hat (including Fedora SIG members), KDE, FUEL, Google and C-DAC. Below is a summary of what was accomplished during these two days.

During the summit, teams from different organizations came together to discuss language-related challenges, and worked together on features and tools to address them.

During the summit, teams from different organizations came together to discuss language-related challenges, and worked together on features and tools to address them.

Input Methods

Parag Nemade and Santhosh Thottingal worked on making additional input methods available for the jQuery.IME library. 60 input methods, covering languages like Assamese, Esperanto, Russian, Greek, Hebrew were added bringing the total to 144. Also IMEs from the m17n library missing from the jQuery.IME library were identified.

Translation tools, translatewiki.net & FUEL Sprint

Siebrand Mazeland and Niklas Laxström, together with Ankit Patel, Rajesh Ranjan and Red Hat language maintainers, worked to identify more tools that could be used as Translation aids in a translation system. The FUEL project aims to standardize translations for frequently used terms, translation style and assessment methodology. Until now it has focused mostly on languages of India. The FUEL project can now be translated in translatewiki.net. Pau Giner demonstrated new designs for the translation editor and terminology usage, remotely from Spain.

Language Coverage Matrix

To better evaluate the needs for enabling support for languages, a matrix detailing the requirements and availability of basic and extended features is being drawn up. With 285 languages currently supported in Wikimedia and more than 100 in Fedora, this document will be instrumental in bridging the gaps and porting features across projects and platforms. Key areas of evaluation include input methods, fonts, translation aids like glossaries and spell-checkers, testing and validation methods, etc. A preliminary draft was created during the summit by Alolita Sharma, Runa Bhattacharjee and Amir E. Aharoni.

Fonts, WebFonts

An initiative to document the technical aspects of fonts for scripts for languages spoken in India started during the language summit. For each of the scripts, a reference font will be chosen and each font will be explained in detail to intersect with the Open Type font specification as a standard. It will aim to act as a reference document for any typographer working on Indian language fonts. Initial draft and outline of this document was prepared during the second day of the language summit, mainly by Santhosh Thottingal and Pravin Satpute.

Testing Internationalization Tools

Finding suitable methods for testing internationalized components and contents was the major focus of this sprint, with the Fedora Localization Testing Group (FLTG) and Wikimedia’s Language Engineering team sharing details of their testing methods. The FLTG conducts Test Days prior to Fedora beta releases with a test matrix targeted at specific core components, and Wikimedia uses unit tests for frequent testing of their development features. The FLTG showed its plans to integrate the screenshot comparison method for testing localized interfaces. This method will be useful for Wikimedia too. Extending the method for web-based applications and Wikimedia’s language requirements (e.g. right-to-left) were identified as areas for collaboration.

More news from the Language Summit can be found in the tweets, the session notes and the full report.

Runa Bhattacharjee, Outreach and QA coordinator, Language Engineering