Sedang Disiapkan

Data Mining - Scrape a database from a website

I need you to make a script that will run either [url removed, login to view] or [url removed, login to view] and populate a mysql database (the reason that this needs to be a script is there are probably 700,000+ entries in total). I prefer to the script to be in PHP, but if you have other, faster methods, then you can use those as well.

The scraped data should go into 2 tables in mysql.

1st table: artists

fields:

a_name (name of the artist, i.e. "Britney Spears"),

a_id (incremental artist ID 1 by 1 starting from 1)

a_alias_plain (url field - it'll be structure "artist-name" the multiple words are separated by dashes. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

a_alias_lyrics (url field - it'll be the structure "artist-name-lyrics", mutliple words are separated by dashes and "-lyrics" is appended at the end. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

2nd table: songs

fields:

s_id (id of the song, incremental 1 by 1 starting with 1)

s_name (the name of the song, i.e. "Feel The Way")

s_text (the actual text of the song, I only want the text and not any other stuff on the page)

s_artist (this is going to be the Artist's ID from a_id - this is so that I can associate which song is for which artist)

s_alias_plain (this is an url field - structure is "song-name", each word is separated by dashes. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

s_alias_lyrics (this is the 2nd url field just in case, each word is separated by dashes with "-lyrics" appended at the end. All words are lower case. All non-numeric/non-alphabet characters must be parsed out. Make sure there is only 1 dash separating each word)

Database should have proper collation so that all special characters are displayed.

The whole database should probably have 700,000+ entries. I don't want to wait more than 5 days, so if you can complete it within that time frame, feel free to bid. I am not paying more than $100 so please don't bid higher. I need to start as soon as possible, so if you give me a good bid, you could even start working today.

Please only bid if you have read the requirements fully.

Kemahiran: Pemasukan Data, Pemprosesan Data, Linux, PHP, Memasang Skrip

Lihat lebih lanjut: working from, working artist, use data structure, tables data structure, table data structure, start make free website, needs artists, need data structure, mysql database free, need good artist, need free artist, artist higher, website database mining, money database mining software, artist data, today data, song lyrics, php data mining, php artists, numeric, database mining, data tables, britney, php data table, page database

Tentang Majikan:
( 273 ulasan ) Brooklyn, United States

ID Projek: #323328

16 pekerja bebas membida secara purata $147 untuk pekerjaan ini

SigmaVisual

Please check PMB.

$100 USD dalam 5 hari
(269 Ulasan)
8.0
jeevanoss

I am 5 year experienced Linux based programmer. See PM for Details

$80 USD dalam 2 hari
(41 Ulasan)
6.3
hameedkhan

Hi, Kindly have a look at PM, Thanks.

$100 USD dalam 0 hari
(103 Ulasan)
6.0
phpXpertbd

Highly excited to do this job. Please check PMB.

$100 USD dalam 2 hari
(26 Ulasan)
5.9
programmerAS3

hi.. those people who had bid lesser than 100 or 100 dollar's are all fakes or don't know scrapping is all about.And you want within that amount of money and time, it is really tough.I'm saying this because I've just f Lagi

$200 USD dalam 5 hari
(16 Ulasan)
5.8
andreiandrei

Hi,please check PM.

$250 USD dalam 2 hari
(7 Ulasan)
4.9
zhukaster

I can help with that

$100 USD dalam 4 hari
(22 Ulasan)
4.6
AKSolutions

We can do this for you. The task is interesting. Please let us know to which site you wanna give priority to fetch data from the given two. The task is not critical at all. but it should get completed with proper car Lagi

$230 USD dalam 5 hari
(7 Ulasan)
4.2
satsco

Hello, This is a placeholder bid - please see pm for details. regards, Satsco.

$100 USD dalam 5 hari
(2 Ulasan)
1.7
parkgroup

We will deliver you what you need in the specified time with 100% surety of Clear Data . Waiting for your PM Thanks

$245 USD dalam 10 hari
(0 Ulasan)
0.0
waynwill

Kapow Robot is the solution to this problem. WayNwill ( http://www.waynwill.com/kapow.htm ) recognized expert in Kapow Robot Development.And having a development facility in India helps us provide extremely cost-effec Lagi

$200 USD dalam 2 hari
(0 Ulasan)
0.0
johnapop

I can do that with another method fast method. Contact me for details

$200 USD dalam sehari
(0 Ulasan)
0.0
andysumy

Hello I can realize it on Perl in 2 days max

$50 USD dalam 2 hari
(0 Ulasan)
0.0
egoriy

Web scraping is my strong point. Please view pmb for an example.

$50 USD dalam sehari
(0 Ulasan)
0.0
hijack23

Hi! I have Kapow Mashup Server to do it faster. But i need win or linux server to run bot.

$250 USD dalam 7 hari
(0 Ulasan)
0.0
Oreip

Please check your PM.

$100 USD dalam 4 hari
(0 Ulasan)
0.0