I need help on my project about data analysis. I have dataset and I have to analyze this dataset on r studio. I've included details about the project below.
[login to view URL] the scope of data quality analysis, examining whether there are possible erroneous values, outliers / extreme values (via descriptive statistics, frequency distribution, filtering, etc.) and elimination of relevant erroneous measurements,
• Elimination of variables considered unnecessary in multivariate data sets,
Missing values, if any, should be determined. For variables with incomplete observations, it is necessary to eliminate the relevant variable or to fill in the missing values using the necessary techniques.
2. Creating Training and Test Data Sets: After the necessary operations are done on dirty data, the final data should be divided into 80% training and 20% testing. After this step, all transactions until the validity step will be carried out on the training data set.
• Defining the dummy variables, categorizing and recoding some of the quantitative variables in addition to the qualitative variables already in the data, giving information about the derived variables (for example; if the ratio variable is from which variables), standardization / normalization when necessary,
I attached my dataset. And on my dataset there is no missing [login to view URL] have to create them on r studio. İf there is something unclear you can ask me.
13 pekerja bebas membida secara purata $42 untuk pekerjaan ini
Hello I m a data analyst with more than 5 years experience I can help you in your cleaning and data preprocessing Send a message to discuss more details Regards
Hello i am Data scientist and can help you in task. Please share details ( Report Length / Deadline etc ) so i can give you exact deliverable time & cost. Best Regards. Atif
Hello, We are GraficoRplot, we work with Rstudio and ggplot2 software to generate high quality graphics, we have experience in scientific research, we can help you with your work.