Find Jobs
Hire Freelancers

Website scraping(repost)

$30-5000 USD

Ditutup
Disiarkan lebih dari 12 tahun yang lalu

$30-5000 USD

Dibayar semasa penghantaran
We need Java components developed for the scraping of specific data from many different web sites. The components will be implementations/extensions of interfaces/base classes we will provide, will be deployed in a crawling system we are developing internally and will return data in the form of List of instances of classes we've already designed. So what we are asking for is just the logic of data extraction, the components will be hosted as plugins by our application. The data the be extracted is about events/happening/venues, more specifically: * detailed timing (start date, end date, scheduling) * title, description, and owner * location, city, address Components will be of 2 types: the first type (we call it Spider) will simply scan a websites returning all the urls that should be analyzed, along with some information for call those urls later (web method, parameters, etc..). The second type of component will receive the information retrieved by the first type (plus last visit date, if any) and will grab data as describerd before. So, given a website we need to grab info from, we need 2 components implemented: 1 of the first type and one of the second. Please bid with a per-site cost. **UPDATE 10/08/2011** First of all, many thanks for all the replies and bids. Some of you asked for more details and/or sample of the websites we are going to grab data from. I'm giving them in the "**Detailed Requirements**" section below. ## Deliverables Here some the sites: * In **ITALIAN **language**:** * **[login to view URL]**. The section to look is "Calendation" ([login to view URL]). For this site we provide a sample implementation see below. * **[login to view URL]**. A search page is available here: [login to view URL] * **[login to view URL]**. A search page is available here: [login to view URL] * **[login to view URL]**. A search page is available here: [login to view URL] * **[login to view URL]**. A search page is available here: [login to view URL] * **[login to view URL]**. A search page is available here: [login to view URL] * In **ENGLISH **language; * **[login to view URL]**. A search page is available here: [login to view URL] * **[login to view URL]**. A search page is available here: [login to view URL] Further more I'm attaching a sample implementation (it's a NetBeans project) of the interfaces I talked about. It's base of WebHarvest (<[login to view URL]>), a java open source framework (with a nice IDE) for web data extraction. We do **NOT **require you to build your implementations on this tool, but we think some may found it to be useful. This sample projects contains 2 packages: * [login to view URL] * [login to view URL] The first one contains base classes with functionalities we think can be shared between all WebHarvester based implementations. The second one contains the specific implementation for one of the site listed above ([login to view URL]). **A note on language/location.** As you can see from the site list, we are basically talking about Italian events. Most of the sites allow to perform a per-city search, and when this is possible we need to search in the Milan area. But please make it a parameter in your implementation, we hope to cover other cities very soon. * * *This broadcast message was sent to all bidders on Wednesday Aug 10, 2011 6:41:37 AM: dear bidders, as request by many of you, I've just added some details to the project description. take a look m.
ID Projek: 3557486

Tentang projek

4 cadangan
Projek jarak jauh
Aktif 13 tahun yang lalu

Ingin menjana wang?

Faedah membida di Freelancer

Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan
4 pekerja bebas membida secara purata $405 USD untuk pekerjaan ini
Avatar Pengguna
See private message.
$800 USD dalam 14 hari
0.0 (1 ulasan)
0.0
0.0
Avatar Pengguna
See private message.
$70 USD dalam 14 hari
0.0 (0 ulasan)
0.0
0.0
Avatar Pengguna
See private message.
$600 USD dalam 14 hari
0.0 (0 ulasan)
0.0
0.0
Avatar Pengguna
See private message.
$150 USD dalam 14 hari
0.0 (1 ulasan)
0.0
0.0

Tentang klien

Bendera ITALY
Italy
5.0
4
Ahli sejak Ogo 3, 2011

Pengesahan Klien

Terima kasih! Kami telah menghantar pautan melalui e-mel kepada anda untuk menuntut kredit percuma anda.
Sesuatu telah berlaku semasa menghantar e-mel anda. Sila cuba lagi.
Pengguna Berdaftar Jumlah Pekerjaan Disiarkan
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Memuatkan pratonton
Kebenaran diberikan untuk Geolocation.
Sesi log masuk anda telah luput dan telah dilog keluar. Sila log masuk sekali lagi.