ALL IMPLEMENTATIONS SHOULD NOT BE COPIED OR TAKEN FROM INTERNET ! ALL SHOULD BE MADE BY FREELEANCER STARTING FROM EMPTY PROJECT FILE.
THE SOURCE SHOULD BE AS SIMPLE TO UNDERSTAND AS POSSIBLE ! NO ERROR CATCHINGS OR ANY OTHER NOT RELEVANT THINGS.
I need a very detailed tutorial how to implement a dynamic programming algorithm (of my choose) first normally on CPU, than parallel on CPU and last how to make it work faster as a praller application on a GPU. I need it to be done in CUDA - Visual Studio 2010. Plus make comparison of the speed to the CPU implementation versions.
ALGORITHM WILL COMPARE 2 LONG SEQUENCES SO IT IS ENOUGH IF COMPARISON PART IF ALGORITH IS MADE PARALLEL FOR FORST IMLEMENTATION - LATER I HOPE FOR LONG COOPERATION.
Tutorial must be very datailed, written so that person with no knowledge on programming on CUDA could learn and make both the programs workable by him/her self.
So I need a tutorial that will let me implement the algorithm normally that will work on CPU and than how to optimalize to work parallel on CPU and implement it in CUDA. Step by step from the very scratch like installation of CUDA or compiler etc. So not only codes to copy and paste should be in tutorial but also description of algorithms and optimalizations and what thay do etc. There will be a lot of comparisons in the algorithm as far as i know. The final app should first do the algorithm on CPU than on GPU and show results and compare computing times.
1> create a detailed tutorial about programming CUDA in our Project.
2> analyze the Algorithm Implementation and how to program it and optimize for GPGPU
3> implement the algorytmik on CPU.
4> implement the algorithm on GPU.
5> compare and create performance function for both the implementations and also mention optimized solutions for future.