Given a text file in UTF-8 encoding, written in Chinese characters:
Look up each individual character against an external text file hosted in the same directory (dictionary file) and add a number (1-6) after each character based on the entry in the dictionary file. Do NOT add a number after punctuation (Chinese or English), non-Chinese words (there will be English words in the files; these will have a white space before and after them) or whitespaces. Place a number "7" after any Chinese characters that are not found in the dictionary file. Place a number "6" after any Chinese characters that occur twice in the dictionary file with different values.
Example dictionary file:
Hello I found a trick on how to distinguish chinese characters from english/punctuation. The rest of the requirements should be rather easy to implement. I'm making a demo as we speak. Thanks.
5 pekerja bebas membida secara purata $30 untuk pekerjaan ini
Hello Dear We phase similar type of problem for [login to view URL] this website because in this we used hebro , so i have experience to handle utf-8 character . Thanks
Hello, I think writing this script can be done pretty fast using PHP and I would appreciate you accepting my bid! I could provide you with a stand-alone PHP script within less than one day!