Match datasets based on precalculated principal components (R programming)

This needs to be done in R, so we need R code as a result. Attached is the file "[login to view URL]"

The attached file consists of 12 columns and 1000 rows (incl. header) , The headline identifies each row with ID, cohort, PC1 to PC10. Cohort contains one of the 6 uniq values ("Control", "set_a", "set_b", "set_c", "set_d", "set_e")

Take each "set" one-by-one and find the closest match to the PCs by finding the control with the minimum of sum over i of square (PCi-PCi) where i stands for PC1, 2 ... 10 and the difference is between the value for the "Control" and "set". So one needs to calculate these for case/set pairs. Once a control is selected it needs to be removed completely so it won't be selected for another case.

Start with a case chosen from one series, and determine the best control. Then switch to another case series and find the best control for a chosen case. Continue until the end of all the cases. Then, start again finding a new control for each case until you reach controls for each case.

Thank you!

Our goal is to select 5 controls for each "set" that are closely matched.

Kemahiran: Pemprosesan Data, Bahasa Pengaturcaraan R, Statistik

Lihat lagi: match com based drupal, principal components factor matlab, principal components analysis matlab, data analytics hadoop r programming, freelance r programming, help with r programming project -- 3, help with r programming project 3, r programming and, r programming language, r programming machine learning, r programming project, R programming, r-programming language, short r programming project, short r programming, test on r programming, freelancer r programming, r programming freelance, r programming freelance job, remote source r programming

Tentang Majikan:
( 0 ulasan ) Montreal, Canada

ID Projek: #17973076

7 pekerja bebas membida secara purata $131 untuk pekerjaan ini


This is Vibrant Webtech and I was glad to see that you're looking for help for project Match datasets based on precalculated principal components. I've delivered more than 400 + projects in the last 5 years and this Lagi

$180 CAD dalam 3 hari
(183 Ulasan)

I am a data scientist by profession with more than 4 years of programming experience in R and have completed more than 35 projects in R. I can finish the task within 24 hrs

$150 CAD dalam 3 hari
(46 Ulasan)

I have Masters degree in Economics and Statistics with 7years of professional experience working as a Quantitative Analyst (in the field of Statistics). A professional statistical analyst seeking opportunity to provid Lagi

$35 CAD dalam sehari
(22 Ulasan)

I possess exceptional data and statistical analysis experience. I use Excel, STATA, R-Programming and SPSS software’s in qualitative and quantitative research and report writing. I hold MBA (Strategic Management) under Lagi

$250 CAD dalam 3 hari
(33 Ulasan)

Hello, i have read the details provided..please contact me to discuss more on the project deadline and some other few things

$120 CAD dalam 3 hari
(11 Ulasan)

Feel fee to contact me for Match datasets based on precalculated principal components .Shoot me message to discuss further more details .We provide the commments,images,videos,demos and live sessions in order to he Lagi

$150 CAD dalam 3 hari
(9 Ulasan)

I am expert in R Programming pls check my reviews and you can trust me and I will offer u the best price and the best quality work

$30 CAD dalam 3 hari
(9 Ulasan)