πΊοΈ 1. Scope of Dusun Used
This dictionary does not strictly follow a single Dusun dialect. It focuses on widely understood Dusun varieties from the west coast and interior regions. The goal is mutual intelligibility, not dialect purity.
This dictionary is designed to be beginner-friendly, making it ideal for kids, new learners, and anyone starting their Dusun language journey. We aim to document practical Dusun that connects speakers across regions.
π― 2. Intended Use
This dictionary is not an official school syllabus and is not intended to replace formal linguistic references. It is designed for:
- Daily learning and casual reference β Quick lookups for learners and speakers
- Translator support β Practical translation between Dusun, Malay, and English
- Language preservation β Documenting vocabulary for future generations
- Digital tools and AI training β Building corpus data for neural machine translation
- Community usage β Supporting conversations, social media, and cultural expression
π 3. Getting Started
DusunDictionary offers multiple tools to help you learn and use the Dusun language:
π Translators
Translate between Dusun, English, and Malay with our intelligent translation tools
π Lessons
Learn Dusun grammar, sentence structure, pronouns, affixes, and language fundamentals
π¬ Common Phrases
Essential Dusun phrases for everyday conversation with English and Malay translations
π€ Chat with Dusun AI
Practice Dusun conversation with Dusun chatbot characters that respond in natural Dusun language
π Audio Expression
Listen to randomized Dusun sentences with English/Malay translations for immersive pronunciation practice
πΌοΈ Visual Learning
Learn Dusun vocabulary through images and visual associations for better retention
π― Practice & Quizzes
Test your knowledge with interactive exercises and practice quizzes at different skill levels
π Dialogs
Listen to audio dialogs with comprehension quizzes to improve your listening skills naturally
π 4. Pronunciation & Spelling Conventions
4.1 Final Vowels
Words ending with vowels are not always pronounced the same way. Some final vowels may sound softer or more clipped, while others sound more open or tense depending on speaker, emphasis, or context. These differences are generally not marked in spelling to keep the dictionary readable and consistent.
4.2 Apostrophe Usage
The apostrophe (') is used in this dictionary to indicate words with a sharp or short (glottal) sound, distinguishing them from similar words without the apostrophe.
ko= you (second person pronoun)ko'= or (conjunction)
Both words would be spelled "ko" without the apostrophe. The apostrophe in ko' indicates it has a glottal/short sound that distinguishes it from ko (you), which has a different pronunciation.
4.3 No Diacritics
Diacritics (Γ², Γ‘, Γ¨) and phonetic symbols are generally avoided to prevent clutter and to remain consistent with common Dusun writing practices used in everyday communication.
4.4 Consonant Sounds
Some consonants may be pronounced slightly differently across regions:
ngβ a single sound, similar to English "sing" (not "n" + "g")nyβ similar to Spanish "Γ±" or English "canyon"rβ typically trilled or tapped, varies by speaker
π§ 5. Audio Pronunciation Note
Some audio recordings are spoken by non-Dusun speakers. As a result, certain sounds, stress patterns, or intonation may differ slightly from native pronunciation. These recordings are meant as general pronunciation guidance, not as absolute standards.
π 6. Variants and Alternative Forms
Variants in spelling and pronunciation exist because of the different dialects of Dusun in Sabah. Where possible, commonly used variants are provided. This may include dialectal differences, alternative spellings, and commonly spoken forms.
We are gradually adding more variants to cater for the many dialects across Sabah, making the dictionary more inclusive and useful for speakers from different regions.
naaii/naawiβ finished/completedosongkial/osongkiwalβ naughtykondi/kondiuβ eagle
If you notice missing variants or regional differences from your dialect, please let us know through the feedback form!
π 7. Modern Usage & Loan Influences
Many words and terms used here reflect simple, daily conversational Dusun. To ease communication and adapt to modern usage, Dusun speakers may adopt, modify, or simplify words from Malay or occasionally English. These forms are included to reflect real-world usage, not to replace traditional vocabulary.
tayar(tire) β from Malay "tayar"miting(meeting) β from English "meeting"henpon(handphone) β from English "handphone"
This approach ensures the dictionary remains relevant and useful for contemporary speakers while preserving traditional terms where they're actively used.
π¬ 8. Translation Tips
Getting the best results from our AI translator:
8.1 For Best Accuracy
- Use complete sentences
- Provide context when needed
- Keep sentences reasonably short
- Use proper punctuation
- Single words without context
- Very long paragraphs
- Mixing multiple languages in one sentence
- Overly complex nested clauses
8.2 Understanding Output
The AI translator uses advanced grammar rules and a comprehensive dictionary to provide contextually accurate translations. However, it may occasionally:
- Choose one variant over another in ambiguous cases
- Use modern loan words instead of traditional terms
- Struggle with idioms or highly figurative language
- Need clarification for words with multiple meanings
8.3 Language-Specific Input Guidelines
The quality of your translation depends heavily on the quality of your input. Here's why and how to write better source text:
Why Input Quality Matters
Our translator uses a multi-step translation pipeline. When translating from Malay to Dusun, the system first converts Malay to English, then English to Dusun. This means:
- Errors compound β Poor Malay input produces inaccurate English, which then produces worse Dusun
- The AI must understand first β Before translating, the system needs to comprehend your source language accurately
- Context gets lost β Informal abbreviations and slang can confuse the intermediate translation steps
For Malay Input
Use standard, formal Malay (Bahasa Melayu Baku) for best results:
sy x tauβ "I don't know"ape yg ko nkβ "What do you want"kt mneβ "Where"org tu dh pgβ "That person already left"mcm mne nk buatβ "How to do"
Saya tidak tahuApa yang kamu mahuDi manaOrang itu sudah pergiBagaimana cara membuat
For English Input
Use clear, grammatically correct English:
Where you go yesterday?I go market alreadyHe no come todayCan you help me or not?
Where did you go yesterday?I already went to the marketHe is not coming todayCan you help me?
Common Issues to Avoid
- SMS/Chat abbreviations β Write full words: "saya" not "sy", "tidak" not "x"
- Code-switching β Avoid mixing languages: "Saya want to pergi" should be either all English or all Malay
- Regional slang β Use widely understood terms instead of local colloquialisms
- Missing punctuation β Always use proper periods, question marks, and commas
- Incomplete sentences β Write complete thoughts with subjects and verbs
Why We Don't Auto-Correct
You might wonder why the system doesn't automatically fix informal input. Here's why:
- Meaning preservation β Auto-correction might change your intended meaning
- Context matters β The same abbreviation can mean different things in different contexts
- Accuracy priority β We prefer accurate translation of clear input over guessing unclear input
- Learning opportunity β Writing properly helps you learn both source and target languages better
Special Cases
Here are some additional guidelines for specific situations:
- Numbers β You can use numerals (5) or write them out (lima/five); both work
- Names β Proper names (people, places) should be capitalized and will typically remain unchanged
- Loanwords β Modern loanwords (teknologi, komputer) are generally understood and may pass through untranslated
- Technical terms β If translating specialized vocabulary, provide context or simpler explanations
- Idioms β Idiomatic expressions may not translate well; consider using literal descriptions instead
- Is this standard Malay/English, or informal chat language?
- Are all words spelled correctly and fully written out?
- Does my sentence have proper punctuation?
- Would someone unfamiliar with my region understand this?
β οΈ 9. Known Limitations
We believe in transparency about what our tools can and cannot do:
9.1 Dictionary Coverage
- Not all dialectal variations are included yet
- Technical and specialized vocabulary is still expanding
- Some traditional terms may be missing
- Audio is not available for all entries
9.2 AI Translator Limitations
Translation accuracy depends heavily on the AI model being used. We continuously experiment with different models to balance accuracy and affordability, as premium models can be costly for a community project like this.
- May struggle with idiomatic expressions
- Complex nested sentences might not translate perfectly
- Cultural context may not always be captured
- Formal vs. casual register distinctions are developing
9.3 Regional Variations
Dusun has significant dialectal variation across Sabah. Our focus on mutual intelligibility means some region-specific terms or pronunciations may not be represented.
π° 10. Translation Cost & Sustainability
The AI translator operates using token-based processing, where each translation consumes computational resources that incur real costs. To keep the service running sustainably:
- Funding source β Costs are covered through available project funding (monthly allocations or pay-as-you-use)
- Model selection β The system may dynamically switch between AI models based on available resources
- Quality variation β Advanced models provide higher accuracy when funding allows; affordable models maintain service during limited funding
- Consistency maintained β All outputs use the same dictionary rules and linguistic structure regardless of model
This adaptive approach prioritizes keeping the translator accessible rather than unavailable. While translation quality may vary slightly depending on the model in use, all versions produce usable results for learners and daily communication.
π 11. Help Us Improve
This is a community-driven project that grows with your input. You can help by:
- Reporting errors β Found a mistranslation or typo? Let us know!
- Suggesting additions β Missing words, phrases, or dialectal variants
- Sharing usage patterns β Tell us how words are used in your region
- Testing translations β Try the AI translator and report what works well or poorly
π± 12. How You Can Help Preserve Dusun
Language preservation is a community effort. There are many practical ways you can contribute to keeping Dusun alive for future generations.
Start at Home - The Most Effective Way
Speak Dusun with your family. If you have children, speak to them in Dusun at home. If your parents are Dusun speakers, insist on speaking Dusun with them. Daily use within families is the most powerful way to keep a language alive.
Help Train AI & Machine Translation
For Neural Machine Translation (NMT) to work, AI needs to learn from real examples. Here's how you can help:
Where to Share Parallel Translations
If you know Dusun, please share parallel translations (Dusun sentences with English/Malay translations) on public platforms where AI can find them:
- Social Media: Facebook, Twitter/X, Instagram captions
- Content Platforms: Reddit, Medium, personal blogs
- Video Platforms: YouTube (with subtitles), TikTok (with text overlays)
- Documents: Public PDFs, Google Docs, Wikipedia articles
- Forums & Communities: Language learning forums, community websites
- Educational Sites: Lesson materials, study guides, teaching resources
The more parallel translations we have across the internet, the better AI systems can learn Dusun, bringing us closer to seeing Dusun listed in Google Translate.
Our 2026 Target: 10,000 Parallel Translations
Having achieved 10,000 total entries (including single words, two-word phrases, and longer expressions) by the end of December 2025, we're setting a new ambitious target: 10,000 parallel sentence/phrase translations by the end of 2026. This means complete sentences or multi-word phrases with their translations, not just individual word entries.
Don't Worry About Dialect Differences
Dialect variations are not a problem for NMT training. What matters most is that AI learns the sentence structure and word patterns of Dusun. Once the AI understands the grammatical structure and word order, we can easily introduce dialectal word variants. The underlying patterns remain consistent across dialects.
Other Ways to Contribute
- Share this dictionary with Dusun learners and speakers in your community
- Use Dusun on social media regularly to increase its online presence
- Create Dusun content - videos, stories, songs, podcasts with transcriptions
- Teach others - help friends, family, or students learn basic Dusun
- Document traditional knowledge - record elders speaking, preserve stories and cultural practices
- Support Dusun media - follow, share, and engage with Dusun language content creators
Closing Note
This project prioritizes clarity, usability, and real spoken usage while remaining linguistically informed. Choices in spelling, categorization, and pronunciation are intentional and documented for transparency.
Our goal is not linguistic perfection, but practical preservation β ensuring Dusun remains a living, usable language for daily communication, digital spaces, and future generations.
Thank you for being part of this language preservation journey. Together, we're keeping Dusun alive and thriving.
β Back to Top π Back to Home