As promised! However, the vocabulary has expanded since I first tweeted about the custom dictionary:
Now the word count is at: 14,240 words (as of May 7, 2015)
Click here to download it.
Epically annoyed that WordPress doesn’t support the .DIC file type. Ugh. Don’t know how often I’ll update this post with a new version. Maybe on a quarterly basis?
I can’t remember how long I’ve been developing this custom dictionary (probably started around 2011). Wouldn’t necessarily call it a “development and aid” dictionary, either. Just my personal Bahasa Indonesia dictionary. Whenever I get new assignments, there will always be new terms to cross-check (I always Google them first, and do check to ensure that they are EYD-spelled and KBBI compliant) and add to the dictionary. The longer I’ve had this, the easier it gets—the less words I add into my dictionary and personal glossary (but I don’t think I’ve never really had a document over 1,000 words that didn’t include new words to add—but unlike Variety reviews, I do tend to know what they all mean with few exceptions because, last time I checked, there are ~90,000 entries in the KBBI while there are 1,030,475.3 words in the English language).
I’ve already circulated this among some friends/coworkers (will eventually send this to everyone I know… If I can find the confidence to contact everybody, that is).
Please note that a lot of what you’ll see listed/approved into the custom dictionary will also depend on my personal preferences (or the preferences of current/former employers). For instance, I will never add “monev” into my personal dictionary because I’ve always preferred “M&E” (and by the way, the correct Bahasa Indonesia term is “pemantauan & evaluasi”, not “monitoring dan evaluasi”—seriously pick one, that’s like saying “photokopi” or “fotocopy”). So don’t even try to argue with me on issues like that. Each organisation has standards and the ones I’ve worked for use those—if they don’t have existing standards, I suggest the ones I prefer.
However, if you see something I may have accidentally entered (really, all it takes is pressing Right Click+A on your keyboard), like “annalisis”, please do alert me, so I can have it removed (the correct term is ‘analisis’, by the way—never use ‘analisa’, that’s not even a word).
Also, certain words like “sumberdaya” will be listed as one word, so if you or your organisation/project prefer to spell that out as two (sumber daya), be sure to Ctrl+F and replace any “sumberdaya” with “sumber daya” as a precaution. This is just me adjusting with other people’s preferences.
Watch out for the word “pasca”. the origins of this word is Sanskrit, not English, from “paścat” which means ‘after’ (subsequent to) and is used as a prefix like the word ‘post’ in English—the letter ‘c’ pronounced like an English ‘ch’ or Dutch ‘tj’ and the Anglicised pronounciation ‘paska’ frequently used by people with colonial-hangover on television is incorrect. Anytime the word “pasca” is used, whatever word follows should never been preceeded by a space. Ex: “pascapanen” (not “pasca panen”, although hyphenating “pasca-panen” should be tolerated if that’s your organisation’s house style). The word alone has been added into the custom dictionary—so you will no longer get the red zigzag underline spell-check thing, but it’s your own responsibility to ensure that it’s used in the correct format.
Please also don’t get overly excited over the word count. I know it seems a lot, but my dictionary includes really technical terminology and obscure words, including Latin names for various sea creatures, marine flora, and stuff like fish herpes. But I’m sure you’ll make general use of some of the words that were used in certain contexts as part of legal terminology, such as “adat” (from ‘hukum adat’, customary law) although I doubt you’ll ever need “adatrecht”. The figure also includes bits from math formulae, so… No, don’t get excited.
You’ll find some that involve:
- Cross-cutting stuff (gender, human rights)
- Environmental issues (climate change, carbon trade, marine life)
- General infrastructure terminology
- Governance (common job titles, official names of organizations)
- IT terminology (daring/luring, unduh/unggah, dikinikan, peramban, berkas, tautan, antarmuka)
- Labour issues (sweatshops, domestic workers, child labour)
- Legal (types of Indonesian laws and regulation)
- Project management (in development and aid, and programmes/projects)
- Transportation and urban mobility (railways, ports, aviation, buses, roads, and safety)
- Water and sanitation
- And other random stuff…
Since I am an actress as well, you will also occasionally find industry-specific terminology, such as “douchecanoe” (or the occasional strange out-of-place word I picked up during the course of my non-fiction ghostwriting gigs—seriously, don’t ask! I don’t tell).
And while we’re on topic, let me just take the opportunity to make this very clear to my readership: I would never marry a man who doesn’t use the Oxford comma (it just goes against my life principles). That’s all.
Updates Log: January 1, 2015: Version 1.0 (12,201 words) January 27, 2015: Forgot to mention inclusion of IT terminology. January 16, 2015: Added note on "pasca". May 7, 2015: Version 1.1 (14,240 words—well, this is embarrassing: This update includes an eat-my-hat moment, in which I add the term 'monev' after I said I would never—what was I supposed to do? the Indonesian government's fond of the term).
November 13, 2016: Version 2.0 (15,823 words).