Sedang Disiapkan

Web scraper using ASP.NET, C# & SQL Server

Web scraper using ASP.NET, C# & SQL Server

This is for my own learning. 1st time I have posted a project.

The following is also attached in the file "spec.doc"

SPECIFICATION

Three (3) tables to be used

1st_Table) URL

Field Example_Data

----- ------------

ID 1

URL [url removed, login to view]

ARGUMENT /news?source=ig&hl=en&um=1&tab=wn&q=

REGEX (?<=babout<b>bs*)[0-9]*(?=s*b</b> forb)

2nd_Table) TERM

Field Example_Data

----- ------------

ID 1

TERM radon

3rd_Table) RESULT

Field Example_Data

----- ------------

ID 1

URL [url removed, login to view]

ARGUMENT /news?source=ig&hl=en&um=1&tab=wn&q=URL_ARG[url removed, login to view];hl=en&um=1&tab=wn&q=radon

REGEX (?<=babout<b>bs*)[0-9]*(?=s*b</b> forb)

TERM radon

RESULT 553

DATETIME August 05, 2008 4:00PM

COUNT 1

Loop through combinations of 2 tables (URL & TERM), get the regular_expression and append it (and other data) into the 3rd table (RESULT)

1st_Step) Get bring the 'source' of URL into a text file

2nd_Step) Parse this text using Regex

//c# EXAMPLE

Regex regex = new Regex(@"(?<=babout<b>bs*)[0-9]*(?=s*b</b> forb)",[url removed, login to view]);

// The above is from table URL field REGEX --> ---------------------------------------------

// Run regex parsing on matches

MatchCollection matches = [url removed, login to view](text);

3rd_Step) Save findings and other info to "RESULT" table

Note:

1) The attached example code "almost" works for a single URL, I got stuck on matchcollections

2) Liberal use of comments is appreciated.

3) The "ID" field is a Identity field that self increments, but is not used for anything now

4) I would like the use of REGEX & Matchcollection, unless a better method is known.

5) A simple ASP.NET page allowing the add/edit/delete of the 3 tables will be needed

5a) two buttons one to import & on to export to a excel file is needed for all three tables

Kemahiran: .NET, ASP, Pengaturcaraan C

Lihat lebih lanjut: asp scraper, aspnet html scraper, html scraper, write web scraper, web scraper open source, regex scraper, net html scraper, vbnet web scraper, scraper net, aspnet scraper, scraper using net, asp web scraper project, quot, display data sql html table using aspnet, regular expression html scraper matchcollection, aspnet web scrape, web scraper tutorial, regex web scraper, web scraper aspnet, web scrapping aspnet, rss scraper, amp sql net web, web anything, using regex in c, sql server web

Tentang Majikan:
( 0 ulasan ) fairfield, United States

ID Projek: #299699

Dianugerahkan kepada:

Mohitkatariya

I have 2 year of experience in .net technologies and programming. As a freelancer i had done two web scrapping projects, so i have good exposure how to use Regex and other .net classes for scrapping. I had done a web s Lagi

$150 USD dalam 10 hari
(0 Ulasan)
0.0

14 pekerja bebas membida secara purata $201 untuk pekerjaan ini

exoticsolutions

Hello Sir, We are interested. Please check PMB

$250 USD dalam 10 hari
(15 Ulasan)
5.9
namrom

[url removed, login to view]

$200 USD dalam 5 hari
(0 Ulasan)
0.0
pradyuman

we are india based website and software development firm, please have a look at our website [url removed, login to view] for ore information.

$250 USD dalam 8 hari
(0 Ulasan)
0.0
cleric138

i can do it,see your PM.

$230 USD dalam 6 hari
(0 Ulasan)
0.0
titonmoy

HI, This is Tonmoy, I have seen your job posting, I may fit for this job. Please see PM for the details. Thanks. Regard, Tonmoy

$200 USD dalam 5 hari
(0 Ulasan)
0.0
Harryjassal

Hello Sir, Here this is Randhir singh from India.I am very much interested to do your work. i have 5 years of experiance in asp.net,ajax,HTML/CSS,flash communication server. i have experience in e-commerce, conte Lagi

$150 USD dalam 3 hari
(0 Ulasan)
0.0
ryannerd

This is easily done. Let me know when to start! BTW, How can something like this: (?<=baboutbs*)[0-9]*(?=s*b forb) be called Regular?!?

$250 USD dalam 20 hari
(0 Ulasan)
0.0
mrugesh4

we are interested and ready to develop this application as soon as possible..as per your requirements and convenience.

$200 USD dalam 5 hari
(0 Ulasan)
0.0
Bagdady

i am ready to start [url removed, login to view] me.

$180 USD dalam 3 hari
(0 Ulasan)
0.0
amitstech

HI, I am interested in your job posting. This may be my first project, as i m new to freelance world:) Regard, Amit

$150 USD dalam 8 hari
(0 Ulasan)
0.0
Inny

I will write this scraper for you.

$150 USD dalam 2 hari
(0 Ulasan)
0.0
sanalyst

Dear Sir, I am having 4 years of experience in Microsoft Technologies. In this span I have developed various intranet and web based projects using [url removed, login to view] framework/NHibernate and MS Sql Server as b Lagi

$202 USD dalam 30 hari
(0 Ulasan)
0.0
mianrizwanali

Please View PM.

$250 USD dalam 30 hari
(0 Ulasan)
0.0