Sedang Disiapkan

Data Mining - Scrape a database from a website

I need you to make a script that will run either [url removed, login to view] or [url removed, login to view] and populate a mysql database (the reason that this needs to be a script is there are probably 700,000+ entries in total). I prefer to the script to be in PHP, but if you have other, faster methods, then you can use those as well.

The scraped data should go into 2 tables in mysql.

1st table: artists

fields:

a_name (name of the artist, i.e. "Britney Spears"),

a_id (incremental artist ID 1 by 1 starting from 1)

a_alias_plain (url field - it'll be structure "artist-name" the multiple words are separated by dashes. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

a_alias_lyrics (url field - it'll be the structure "artist-name-lyrics", mutliple words are separated by dashes and "-lyrics" is appended at the end. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

2nd table: songs

fields:

s_id (id of the song, incremental 1 by 1 starting with 1)

s_name (the name of the song, i.e. "Feel The Way")

s_text (the actual text of the song, I only want the text and not any other stuff on the page)

s_artist (this is going to be the Artist's ID from a_id - this is so that I can associate which song is for which artist)

s_alias_plain (this is an url field - structure is "song-name", each word is separated by dashes. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

s_alias_lyrics (this is the 2nd url field just in case, each word is separated by dashes with "-lyrics" appended at the end. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

Database should have proper collation so that all special characters are displayed.

The whole database should probably have 700,000+ entries. I don't want to wait more than 5 days, so if you can complete it within that time frame, feel free to bid. I am not paying more than $100 so please don't bid higher. I need to start as soon as possible, so if you give me a good bid, you could even start working today.

Please only bid if you have read the requirements fully.

Just to clarify, I want the whole database completed, and I also want to have the script from you just in case.

Kemahiran: Pemasukan Data, Pemprosesan Data, Linux, PHP, Memasang Skrip

Lihat lebih lanjut: scrape database website, working from, working as a artist, use of data structure, tables in data structure, table data structure, start to make a free website, needs of artists, need of data structure, need a associate, mysql database for free, lyrics into songs, i want to be a artist, i need a good artist, i need a free artist, if i want to be a artist, data structure that, data structure.com, artist for higher, scrape website database, scraping words full websites, scraping data websites database, scrape lyrics database, music lyrics database scrape, scrape site databases

Tentang Majikan:
( 275 ulasan ) Brooklyn, United States

ID Projek: #323328

16 freelancers are bidding on average $147 for this job

SigmaVisual

Please check PMB.

$100 USD dalam 5 hari
(269 Ulasan)
8.0
jeevanoss

I am 5 year experienced Linux based programmer. See PM for Details

$80 USD dalam 2 hari
(41 Ulasan)
6.3
hameedkhan

Hi, Kindly have a look at PM, Thanks.

$100 USD dalam 0 hari
(103 Ulasan)
6.0
phpXpertbd

Highly excited to do this job. Please check PMB.

$100 USD dalam 2 hari
(26 Ulasan)
5.9
programmerAS3

hi.. those people who had bid lesser than 100 or 100 dollar's are all fakes or don't know scrapping is all [url removed, login to view] you want within that amount of money and time, it is really tough.I'm saying this because I've just f Lagi

$200 USD dalam 5 hari
(16 Ulasan)
5.8
andreiandrei

Hi,please check PM.

$250 USD dalam 2 hari
(7 Ulasan)
4.9
zhukaster

I can help with that

$100 USD dalam 4 hari
(22 Ulasan)
4.6
AKSolutions

We can do this for you. The task is interesting. Please let us know to which site you wanna give priority to fetch data from the given two. The task is not critical at all. but it should get completed with proper car Lagi

$230 USD dalam 5 hari
(7 Ulasan)
4.2
satsco

Hello, This is a placeholder bid - please see pm for details. regards, Satsco.

$100 USD dalam 5 hari
(2 Ulasan)
1.7
parkgroup

We will deliver you what you need in the specified time with 100% surety of Clear Data . Waiting for your PM Thanks

$245 USD dalam 10 hari
(0 Ulasan)
0.0
waynwill

Kapow Robot is the solution to this problem. WayNwill ( [url removed, login to view] ) recognized expert in Kapow Robot [url removed, login to view] having a development facility in India helps us provide extremely cost-effec Lagi

$200 USD dalam 2 hari
(0 Ulasan)
0.0
johnapop

I can do that with another method fast method. Contact me for details

$200 USD dalam sehari
(0 Ulasan)
0.0
andysumy

Hello I can realize it on Perl in 2 days max

$50 USD dalam 2 hari
(0 Ulasan)
0.0
egoriy

Web scraping is my strong point. Please view pmb for an example.

$50 USD dalam sehari
(0 Ulasan)
0.0
hijack23

Hi! I have Kapow Mashup Server to do it faster. But i need win or linux server to run bot.

$250 USD dalam 7 hari
(0 Ulasan)
0.0
Oreip

Please check your PM.

$100 USD dalam 4 hari
(0 Ulasan)
0.0