Our project is intended to model topic analyses in a German language data set. Accordingly, we are looking for an R expert with competences that are necessary for the upcoming tasks of data cleansing and data preparation.
Specifically, the task is to combine about 1,300 individual files as well as to minimize the dataset by means of predefined text modules and to clean up the character formatting (remove German umlauts etc., make it readable for R in UTF-8 format, etc.). The goal is to have an accurate data set at the end, which can be analyzed in R using LDA/Topic-Modeling. Assistance for starting the topic analysis (especially regarding the correct selection of R packets for analysis) is also welcome.
23 pekerja bebas membida secara purata €165 untuk pekerjaan ini
Hi, I am Ibrahim and I am expert in R, I may not know German, and might require your help with it, but I am a pro in statistics. Regards, ibrahim Anjum