Machine learning models are trained on large data sets. I'm looking for someone to find and catalogue all the data sets available on the web for training machine learning models. You should make a list, along with links to the dataset, details on what type of data it contains, format of the data, and what type of license the data comes with that dictates its use.
What I need for this project is to initially build a search engine that identifies and categorizes all the data sets on the web useful for machine learning models. The idea is, a user should be able to come in and browse/search by data set, type of data, amount of data, license to data, format of data
17 freelancers are bidding on average $20/hour for this job
I have experience in working in R, Data mining and Machine learning for 2.5 yrs. I have build and application integrating R and Java and implementing association rule algo in it.