Skip to main content

English to Malay Automated Translation Resources –Proverb

[updated 12 Sept 2013]
These are several resources for my reseach on Malay Proverb Treatment in Malay-English Machine Translation…
  1. Google Translate - my dream to achieve... getting nearer
  2. BabelFish - another online translator by Altavista, now own by Yahoo!
  3. OpenLogos - machine translation, I can't get it to work for me yet.
  4. Statistical Machine Translation - the research page of SMT
  5. Moses: a state-of-the-art open source SMT system - got to try this... soon.
  6. CitCat - They have done English to Malay and Malay to English online translation... but still in BETA
  7. Machine Translation - comparison of machine translation applications on wikipedia
  8. Lost In Translation - ever think about funny stuff here? here's one...
  9. My favourite Malay-English translation #
  10. Proverb resources from all over the world, including proverb collections, articles on proverbs, proverb journals, and reference works -
  11. Articles for translators and translation agencies: Translation Theory: A Model of Translation Based on Proverbs and Their Metaphors: A Cognitive Descriptive Approach -
  12. Manual Translation -
  13. Stanford Natural Language Processing and Computational Linguistics Group -
  14. Google Translation Research group -
  15. University Research Program for Google Translate - The University Research Program for Google Translate provides researchers, in the field of automatic machine translation, tools to help compare and contrast with, and build on top of, Google's statistical machine translation system.Participation in the program will allow researchers programmatic access to Google's translation service.
Proverb/Idioms and Dictionaries
  1. Peribahasa Scanner - capable of detecting Malay proverb from a sentences or a part of sentence. There are currently 3000+ entries of Malay proverbs available in the database. Also capable of suggesting Malay proverb based on Malay sentence context.
  2. Large Malay idioms/proverbs collection by -
Open Journal Directories
Some open access thesis/dissertation directories, some have fulltext documents;
  1. DORAS (Dublin City University Online research Access Service) -
  2. Ohiolink Electronik Thesis and Dissertations Center -
  3. PQDT OPEN -
  4. UDL Thesis -
  5. UWC Thesis Online -
  6. Open Thesis -
  1. 8th International Conference of the Asian Association for Lexicography -
  2. International Conference on Translation 2013, Penang, Malaysia
  3. - MALINDO, as an annual international workshop, aims to bring together researchers and practitioners, representing different perspectives, to share and to exchange their ideas on the processing of South East Asian languages. Because of the venue of the 2012 Workshop, the Organisers of MALINDO will anticipate papers related to the processing of the varieties of Borneo languages. MALINDO’s objective is to highlight the effort and promising works done on the processing of the South East Asian languages, thus attracting more students to the CL and NLP fields.
  4. Computational Linguistics / NLP Conferences – 2013
  5. International Journal of Computer Science Issues -


Parallel Corpora


  •  European Corpus Initiative Multilingual Corpus I (ECI/MCI)
    A 98 million word corpus, covering most of the major European languages, as well as many others (viz. Albanian, Bulgarian, Chinese, Czech, Dutch, English, Estonian, French, Gaelic, German, Greek, Italian, Japanese, Latin, Lithuanian, Malay, Spanish, Danish, Uzbek, Norwegian, Portuguese, Russian, Serbian, Swedish, Turkish, Tibetan). The primary focus in this effort is on textual material of all kinds, including transcriptions of spoken material. ECI/MCI has 46 subcorpora in 27 (mainly European) languages. The total size of these is roughly 92 million (lexical) words. The corpora are marked up using TEI P2 conformant SGML (to varying levels of detail), with easy access to the source text without markup. Twelve of the component corpora are multilingual parallel corpora with from two to nine sub-corpora. All the alphabetic corpora (there is some Japanese and Chinese) are encoded in the ISO LATIN family of 8-bit character sets (ISO 8859-1, -5 and -7). A complete list of the contents is available following this link.
    Unusually cheap: the ECI/MCI is available directly from ECI at a price of 95 DFl (for payments made by credit card or Eurocheque); 110 DFl (for payments by bank transfer); or 120 DFl (for payments by cheques other than Eurocheques). Need only to sign a license agreement available (Postcript or LaTex version) at this address or this other one.
    It is also available at 35$ price (or trough membership) from the LDC in a CD-ROM in High Sierra format (ISO 9660), readable on UNIX, MSDOS and Apple systems at least: cf. this page.

    Complete listing:
    Some samples:

  1. Treatment of Germany Proverb – Dimitra Anastasiou
Articles on the issue of Malay Proverb Treatment in Machine Translation
  1. Khirulnizam Abd Rahman and Norita Md Norwawi, 2013. THE CHALLENGES OF HANDLING PROVERBS IN MALAY-ENGLISH MACHINE TRANSLATION. International Conference on Translation 2013. Universiti Sains Malaysia, Penang.
  2. Khirulnizam Abd Rahman and Norita Md Norwawi, 2013. Proverb Treatment in Malay-English Machine Translation. International Conference on Machine Learning and Computer Science (IMLCS'2013). Kuala Lumpur.
  3. Khirulnizam Abd Rahman and Norita Md Norwawi, 2012. Malay Proverb Detection; Implementation on Mobile Environment. Mobile Computing, Applications and Services, September 2012, Universiti Teknikal Melaka, Malaysia.
  6. The Function of Stemming and Stop-word removal in Filtering Malay Proverb.– (not yet published)
  7. Comparison of Pattern Matching, Bags of Words, N-gram and Bayesian in Proverb Detection.– (not yet published)
  8. Implementation of Malay Proverb Filtering in Moses – (not yet published)
This page is maintained by


  1. I would like someone help me to translate this sentence into Malay:

    The analytical solutions are obtained by applying direct integrations and transformation to the modified Bessel's equation.

    1. Penyelesaian analitikal diperolehi dengan menggunakan integrasi langsung dan transformasi kepada persamaan Bessel yang diubahsuai.


Post a Comment

Popular posts from this blog

Several English proverbs and the Malay pair

Or you could download here for the Malay proverbs app – English proverbs and the Malay pair Corpus Reference: Amir Muslim, 2009. Peribahasa dan ungkapan Inggeris-Melayu. DBP, Kuala Lumpur CTRL+F to search Proverbs in English Definition in English Similar Malay Proverbs Definition in Malay 1 Where there is a country, there are people. A country must have people. Ada air adalah ikan. Ada negeri adalah rakyatnya. 2 Dry bread at home is better than roast meat home's the best hujan emas di negeri orang,hujan batu di negeri sendiri Betapa baik pun tempat orang, baik lagi tempat sendiri. 3 There's no accounting for tastes We can't assume that every people have a same feel Kepala sama hitam hati lain-lain. Dalam kehidupan ini, setiap insan berbeza cara, kesukaan, perangai, tabia

Most used STRING functions in my PHP coding

These are my favourite string manipulation functions in my daily coding life. Dedicated especially to Web Programming students. Read them and have fun. Expect a happiness after a storm , and you’ll find your “inner peace”… This post is still in draft. I’ll update and refine with more examples that I’ve personally develop. More after the break…

Bootstrap Template for PHP database system - MyCompanyHR

HTML without framework is dull. Doing hard-coded CSS and JS are quite difficult with no promising result on cross platform compatibility. So I decided to explore BootStrap as they said it is the most popular web framework. What is BootStrap? - Bootstrap is the most popular HTML, CSS, and JavaScript framework for developing responsive, mobile-first web sites. (   ) Available here - Why you need Flat-UI? Seems like a beautiful theme to make my site look professional. Anyway you could get variety of BootStrap theme out there, feel free to select here Flat-UI is from DesignModo - Web Programming MyCompanyHR – PHP & MySQL mini project (with Boostrap HTML framework) Template 1: Template for the Lab Exercise. This is a project sample of a staff record management system. It has the PHP structured co

Contact Us at

Powered by EMF HTML Contact Form

The Challenges of Handling Proverbs in Malay-English Machine Translation – a research paper

*This paper was presented in the 14th International Conference on Translation 2013 ( ). 27 – 29 August 2013, Universiti Sains Malaysia, Penang. The PDF version is here: The test data is here: Khirulnizam Abd Rahman, Faculty of Information Science & Technology, KUIS Abstract: Proverb is a unique feature of Malay language in which a message or advice is not communicated indirectly through metaphoric phrases. However, the use of proverb will cause confusion or misinterpretation if one does not familiar with the phrases since they cannot be translated literally. This paper will discuss the process of automated filtering of Malay proverb in Malay text. The next process is translation. In machine translatio