Sedang Disiapkan

Build Database to Store & Extract Data from Text Files (Easy $$$)

NOTE: I originally hired someone for this project and he got called away. Below is the basic scope of work needed. I also included some of the comments from the original developer. You may have a better idea. Speed is important… very important.

My goal… load email list into a database and extract all of the email addresses that have the same email domain as a URL in my domain list.

Example domain

[login to view URL]

[login to view URL]

[login to view URL]

Example text string

chris,jackson,chris@[login to view URL],1234567897

billy,bob,bbob@[login to view URL],84881451

john,doe@[login to view URL],8814

Example saved results

chris,jackson,chris@[login to view URL],1234567897

john,doe@[login to view URL],8814

I am looking for any line that has an email that matches the domain. So if I have [login to view URL] it will extract every line where there is a @[login to view URL] email address. Domains are one per line.

In my list I will have [login to view URL] but when it is being searched, you will code it to search for @[login to view URL] to ensure it pulls a valid email format and not ‘email@[login to view URL]’.

IMPORTANT

Once the Source files are imported into the database there will be around 2 billion records. Many duplicates. I have a list of 400,000 domains that I want to scan against it. I do not expect this to be completed in a few minutes but I do not expect it to take days or weeks. One of the ways we can speed up the Search process is by allowing me to load a suppression list of domains that I do NOT want to import into the database (Gmail, Hotmail, Yahoo). It may be best to have this as a list that I paste into directly into the database so that it doesn’t bring this data in. This will hinder the Import speed but it will speed up the Matching phase since most will be free email accounts.

NOTES

-- The Source files are .csv files that I renamed to .txt for this purpose.

-- Some files have 1 column, and some have 3, 4, etc. Instead of trying to build a table to match the columns, we will treat each row as a single text string (as a single column) and import the entire row of data.

REQUIREMENTS

-- The results need to save frequently instead of waiting until the task is complete.

-- It must have a basic UI – no commands. I want to be able to click a button to run Import and click a button to Match.

-- When I click Match, it asks me to select the [login to view URL] file. This is the file that has the URL’s I want each email address to include.

-- Speed is very important. Speed matters in two parts – the initial import of data and the matching of Domains against the email list.

DEVELOPER NOTES

Here are some key points from the previous developer who started it.

-- Use C++

-- with some precomputation and indexing we can save a lot of time

-- your issue looks mainly like an indexing issue, if you'll index emails with domain name you -- won't have to go through all data every time you search

-- I talked with the DB Admin, and we are using nested Queries

I cannot waste any additional time on this so I must ensure that you have read the details or I’ll flag your bid. In your bid, include the following the quoted text in the first line “I reviewed the notes and understand that speed is important to you.” To help me understand that you are the right person for the job, let me know how soon you can start, when you can finish, and how you plan to develop. The more detail you provide, the more confident that I am that you are the best choice.

Make your best bid first as I will not be overpaying for this task. Do not bid the maximum budget amount as my max is lower than that.

Thanks!

Kemahiran: Pengaturcaraan C++, Pembangunan Pangkalan Data, Pengaturcaraan Pangkalan Data, MySQL, Kejuruteraan Perisian

Lihat lagi: extract data mdb files, extract data html files php, tools extract data csv files, extract data pdf files, build online store product data base, office need util extract data dot files, able extract data pdf files excel, extract data report files, perl script extract data text file, mdb database password extract data, database function extract data xml file, extract data big files, extract data pdf files java program, read data text files java linked list, extract data pdf files excel, extract data ipd files, software extract data pdf files, data text files mysql perl, script extract data text file, extract data dat files

Tentang Majikan:
( 49 ulasan ) Lexington, United States

ID Projek: #17493376

Dianugerahkan kepada:

DavidEssaadi

I reviewed the notes and understand speed is important to you. Indeed it seems you need an index of the data, so that you can search through it much faster than when using a regular database. 2 billion records might be Lagi

$144 USD dalam 7 hari
(1 Ulasan)
1.2

20 pekerja bebas membida secara purata $212 untuk pekerjaan ini

ITPyramid85

hello,how are you. i read your bid carefully. i am c/c++ expert and have full experience for 10 years. c/c++ language is my top skill and i can complete your project fully by using c/c++. i can provide most quality Lagi

$250 USD dalam 10 hari
(16 Ulasan)
6.8
mdjavedakhtar

i reviewed the notes and found speed is important to you. we can start immidiatly. i am planning to first put all data to database using some regex filter. then from database we can find it easily. also in database Lagi

$166 USD dalam 2 hari
(25 Ulasan)
6.3
sumon355

I reviewed the notes and understand that speed is important to you. Hello, As an experienced software developer and having sound knowledge in database, i am very much interested to do this work. I read your descrip Lagi

$260 USD dalam 3 hari
(118 Ulasan)
6.4
Attractionnet

“I reviewed the notes and understand that speed is important to you.” Hello. I have read the specifications and here is my proposal of how this could work. I am a vb.net programmer, so I will write a simple ui i Lagi

$200 USD dalam 3 hari
(113 Ulasan)
6.1
rajdeepa555

“I reviewed the notes and understand that speed is important to you.” Hello Sir, I am a professional software developer, my goal is efficiency of the code. I checked your requirements, and I have a great idea to Lagi

$300 USD dalam 7 hari
(39 Ulasan)
6.3
leemilun

Hi, Dear. Nice to meet you. I've read your post carefully. I have many and good experiences on C#, C++ app, web app, smartphone app and so on. We could discuss more details on chatting room. Regards. Gao M.

$233 USD dalam 10 hari
(22 Ulasan)
4.9
roshanasim

Surely, I've given expertise. I'm Microsoft Certified Professional, Senior Software Engineer and Microsoft Certified Trainer with over five Years of Experience. Skills: ASP.NET Core (MVC 6), MVC 5, WCF, N-Tier Arch Lagi

$247 USD dalam 7 hari
(16 Ulasan)
4.5
Codiggin

Greetings sir, This is a very easy task for me, I am very experienced with processing CSV data to extract data. I can deliver top quality work in a few hours. Please send me a message whenever you want to start, Lagi

$110 USD dalam 2 hari
(8 Ulasan)
4.0
mahpour1987

Hi, I read your document and noticed this is easy to do it. I am an experienced programmer with over 13 years of experience. Thanks.

$166 USD dalam 10 hari
(7 Ulasan)
3.1
shentiakov

if I understand correctly you will use the software on your local PC what operating system do you use? is it Windows or MacOS, etc. ?

$166 USD dalam 3 hari
(2 Ulasan)
2.4
daozi333

Hi, I reviewed the notes and understand that speed is important to you. I have a lot of experience in C++ and C# development and have been working on application development for the past 10 years. I had been made Sec Lagi

$200 USD dalam 10 hari
(2 Ulasan)
2.0
uzairabdullah786

I reviewed the notes and understand that speed is important to you. Hi there, I have considered your project because it seems interesting to me. I would like to work on it and have already prepared the plan how I'll Lagi

$110 USD dalam 4 hari
(8 Ulasan)
2.2
tsft

“I reviewed the notes and understand that speed is important to you.” Dear Sir/Madam: I can do you system, first sterp: linux or Windows work you, any OS I could manage to process text with Python tha's best to t Lagi

$200 USD dalam 10 hari
(3 Ulasan)
2.2
uskay

I reviewed the notes and understand that speed is important to you. I can start the day I am selected. Let me share my interpretation of the problem. We will have email lists that can be imported to the database Lagi

$288 USD dalam 2 hari
(4 Ulasan)
1.7
BlackStarGazer

Hello I like the amount of details you put on the job , so many people don't care to do that . But I still have to say I'm little confused , are you having the source files loaded in a database or you want to p Lagi

$222 USD dalam 7 hari
(1 Ulasan)
0.6
$222 USD dalam 10 hari
(0 Ulasan)
0.0
ducktaleitdev01

Hi, Hope you doing well. Understanding the aspect of the job description, I am pretty confident that I can effectively contribute my skills and time delicately. Let’s have a conversation. I believe in long Lagi

$233 USD dalam 10 hari
(0 Ulasan)
0.0
mkduga

I reviewed the notes and understand that speed is important to you. I have several years experience with c/c++ and also with MySQL databases. I can help you with this project. I believe 5 days is a reasonable tim Lagi

$250 USD dalam 5 hari
(0 Ulasan)
0.0
amitkfreelance

Hi I have the required skill set (C#, Application Development, Console Applications, MSSQL, MS Access, Scripting) and experience for the job, would be happy to assist you on this. Understood your requirement, we Lagi

$277 USD dalam 10 hari
(0 Ulasan)
0.0