I have a dataset that contains 1.6 million tweets with two columns "class" and "text". I need a code that can do the following :
1- correct the spelling of each tweet
2- Replacing slang and abbreviations, like “omg” to “oh my god” taking into account that it could be written like "OMG" or "omg"
3- Replacing repetitions of question mark with “multiQuestionMark” and repetitions of exclamation mark with "multiExclamationMark"
The code should fit in the exisiting code in the attched file taking into account the order of the preprocessing steps as :
• Removing non-English characters
• Removing user names, URLs, and hashtags
• Replacing slang and abbreviations, like “omg” to “oh my god”
• Replacing contractions, like “I’m” to “I am”
• Removing numbers
• Replacing repetitions of punctuation (? and ! only)
• Replacing words with more than 2 repetitions of a character with exactly 2 repetitions. For example, “happyyyyyy” would be replaced with “happy”
• Removing extra whitespaces
• Converting text to lower case
• Correcting spelling in tweets
• Lemmatizing words in tweets
23 pekerja bebas membida secara purata $198 untuk pekerjaan ini
Hello, I can complete this project using python natural language processing. I will be looking forward to hear from you. Please contact me on PM for details.
Hi, I am a Python developer. Have plenty of time to work on this task at the moment. Will you provide a list of slang that needs changing or I should take care of that? Contact me thanks, Pandelis
Hello I'm JunMing. I am a python expert who has more than 10 years of development experiences. So i can do your project without any problems Please send message and discuss more further. Best regards
Hello! I'd like to help you with dataset preprocessing task. I'm familiar with natural language processing as well python libraries for data analysis. I can do the job blazingly fast. Please, give me a try!
Hello I am a expert python developer with over 4 years of experience. I can help you fix this simple script with a day or two. Lets discuss it over chat Regards
Hello, dear. I have read your description carefully I am Python expert and I have knowledge about NLP. Well, I wanna contact and discuss over chat. BEst regards :)