
Selesai
Disiarkan
Dibayar semasa penghantaran
We are seeking a highly experienced web scraping professional for a long-term collaboration involving large-scale B2B data extraction from major public industry directories in Germany and Austria. This is not a one-time scraping task. We are building a structured, scalable data acquisition system and require a technically strong partner capable of handling full extraction, enrichment, and monthly incremental automation. --- ### Phase 1 – Initial Full Extraction (Per Source) We will provide 1–2 directory sources as a pilot. For each source, the freelancer must: • Extract the complete company inventory • Capture required fields: * Email address (highest priority) * Company name * Street address * Postal code * City • If available: Telephone, Fax, Website, Industry • Deliver structured Excel output For records without email but with website: • Crawl company website • Scan Impressum / Contact / Legal Notice pages • Extract and validate business email addresses • Enrich dataset accordingly This phase must be priced as a **fixed one-time cost per source**, covering full extraction, enrichment, formatting, and delivery. --- ### Phase 2 – Monthly Incremental Extraction After completing the full inventory for each source: • Build a system to detect and extract only newly added listings each month • Apply the same enrichment logic • Deliver monthly Excel file • Monitor structural changes in source directories This phase must be priced as a **fixed monthly rate per source**, ensuring predictable ongoing costs. --- ### Important Context We already maintain a very large internal B2B database (~5 million records). Scalability, structured architecture, and long-term reliability are essential. We are looking for a professional partner — not a short-term task worker. --- ### Proposal Requirements Please include: 1. Experience with large directory scraping projects 2. Your technical approach to enrichment via website crawling 3. Fixed price per source for full extraction 4. Monthly price per source for incremental updates 5. Estimated timeline for one full directory extraction We are building a long-term partnership with consistent monthly work — a stable win-win collaboration. --- If you'd like, I can also give you a more “strict screening” version that filters out low-quality applicants immediately.
ID Projek: 40273289
32 cadangan
Projek jarak jauh
Aktif 10 hari yang lalu
Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan

As an experienced leader of BN-Droids Digital Services, I offer you precisely what you need for your large-scale B2B web scraping project. I have successfully managed similar projects extracting over a million data entries a day across multiple geographical regions. My team even maintains an impressive database of 20 million retail data points from countries like USA, UAE, and Europe. This depth of experience ensures that we have the proficiency to handle your massive inventory and enrich it with meticulous attention to detail.
₹12,500 INR dalam 7 hari
6.9
6.9
32 pekerja bebas membida secara purata ₹21,106 INR untuk pekerjaan ini

Hello there, I have read your description carefully and I can help you with this. I understand that you need an experienced web scrapper for a long term collaboration. If I understand you well, there will be a first scrape of the data from the directory and after, monthly scrapes to get the new additions to the directory. If there are no emails but there are websites, I'll that to get the emails using python. If you check my profile you will see that I have 15+ years experience in this. I can share with you websites I have had to scraped on hourly basis to get the new ones. I put them into Google sheet. You will see when it was run. I have done this for 1 client for 4 e-commerce websites for straight 4 years. I can share these with you for confirmation. I can easily handle this for you using python. Let's chat please. Thanks Claudia
₹15,000 INR dalam 30 hari
7.3
7.3

Have over 18 years of experience in data mining/ Web scrapping/ Scraping Bots/ Chrome/Opera Extensions I have done it all. Tell us your source and we will put it in excel for you, Or we can even give you filtered results as per your requirement, In the format you want. You can also ask for data into a particular format - Excel, Json, Mysql, Databases, XMLs, you name them. Further Can help you with integrating it with ur databases, Can create json outputs. We are not only good with scraping but also with the tools that u may need after that. We can help you build you softwares round the data we have 99% Data Accuracy. We have Duplicate finder. etc., We can help with Statistics on the data We can help with creating Api's front the data We can create Softwares to manage that data We can build Sites round the data
₹15,000 INR dalam 2 hari
6.9
6.9

Having extensive experience in data entry and web scraping, I am confident that my skillset is a strong fit for your large-scale B2B data extraction project. My proficiency in data extraction, mining, processing, database management, and Excel is well-established, which directly aligns with the core requirements of your project. I understand the importance of delivering not just quantity, but quality data that you can rely on to make informed business decisions - and I excel at this task. My technical approach to enrichment via website crawling complements your project needs perfectly. Utilizing Python's powerful web scraping capabilities, I'll ensure every valid business email address is thoroughly validated and integrated with the existing dataset efficiently. This focus on generating reliable, clean, and well-formatted results extends to every part of my work - reducing any extra effort required on your part to make good use of the extracted data.
₹25,000 INR dalam 7 hari
6.7
6.7

Hi, Lets get connect over a chat. I have more than 9 years of experience in building custom platforms in python. I will walk through to my work samples as well. I am online right now. Thanks Ali
₹15,000 INR dalam 1 hari
5.3
5.3

I specialize in large-scale B2B directory extraction (millions of records), building modular Scrapy/Playwright pipelines with proxy rotation, anti-block handling, structured parsing, and automated enrichment via Impressum/Contact crawling + email validation (regex + MX + dedupe). Architecture: full crawl → structured DB storage → website enrichment layer → validation → Excel export; monthly incrementals via change-detection (hashing, last-modified checks, diff crawling). Timeline: ~5–8 days per full source depending on size.
₹12,500 INR dalam 1 hari
5.1
5.1

Hi, As per my understanding: You require a scalable, long-term B2B data acquisition system targeting major public directories in Germany and Austria. Phase 1 involves full inventory extraction per source, structured field capture (email priority), enrichment via company website crawling (Impressum/Contact), validation, and Excel delivery. Phase 2 requires automated monthly incremental extraction detecting new listings only, applying the same enrichment logic, and ensuring structural monitoring for reliability. Implementation approach: I will build a modular Python scraping framework (Scrapy + async crawling) with per-source spiders and a normalized data schema. Email enrichment will include automated website crawl, targeted parsing of legal/contact pages, regex + validation filters, and deduplication against your 5M-record DB. Data pipeline will support change detection, logging, and structural alerting. Monthly jobs will run via scheduled automation with incremental diff logic. Output: clean, structured Excel with audit logs. A few quick questions: Are anti-bot protections expected on pilot sources? Do you provide sample directory URLs? Required email validation depth (SMTP check or regex only)?
₹12,500 INR dalam 7 hari
4.6
4.6

Senior Web Scraping Expert for Large-Scale B2B Directory Extraction (Germany & Austria) I’m a full-stack software engineer with expertise in React, Node.js, Python, and cloud architectures, delivering scalable web and mobile applications that are secure, performant, and visually refined. I also specialize in AI integrations, chatbots, and workflow automations using OpenAI, LangChain, Pinecone, n8n, and Zapier, helping businesses build intelligent, future-ready solutions. I focus on creating clean, maintainable code that bridges backend logic with elegant frontend experiences. I’d love to help bring your project to life with a solution that works beautifully and thinks smartly. To review my samples and achievements, please visit:https://www.freelancer.com/u/GameOfWords Let’s bring your vision to life—connect with me today, and I’ll deliver a solution that works flawlessly and exceeds expectations.
₹25,000 INR dalam 4 hari
4.7
4.7

Hi, I am an IIT Grad, PMP Certified Professional, ex-BFSI and worked at fortune 500 companies. I will make it a reality for you. As a Web Scraping Expert, I will utilize Python libraries like BeautifulSoup, Scrapy, and Selenium to extract structured data from German and Austrian industry directories, employing techniques such as CSS selectors, XPath expressions, and anti-scraping measures to ensure efficient and reliable extraction. Kindly click on the chat button so we can discuss and get started. Will share you my prior projects done and my resume too. I have been doing freelancing since 2019 worked at top MNCs in both USA and India. Lets connect
₹12,500 INR dalam 7 hari
3.7
3.7

As a seasoned professional with demonstrated experience in web scraping, data extraction and enrichment, I am confident that I can deliver exemplary results for your project. Having taken on similar large-scale directory scraping projects before, I appreciate the intricacies and importance of a structured, scalable data acquisition system. This is why I offer my expertise and knowledge in Python, Excel, Data Extraction and Mining to ensure you gain both depth and breadth in the final dataset. Central to my approach is maximizing your desired 'Required Fields' including key fields like email addresses which we endeavor to extract even from websites via validation and enrichment. Given my technical aptitude in website crawling and keen attention to detail, there's minimal chance for any critical data loss; precisely what your large internal B2B database (~5 million records) demands. By conducting comprehensive scans of listings' Impressum/Contact/Legal Notice pages and using a combination of automation and manual validation processes, I will significantly bolster the data available for subsequent use.
₹20,000 INR dalam 7 hari
2.9
2.9

I understand you require a seasoned web scraping expert to handle large-scale B2B directory extraction from German and Austrian sources, including full initial extraction with email enrichment from company websites and ongoing monthly incremental updates. Your need for a structured, scalable system that supports long-term collaboration and reliable data delivery is clear. With over 15 years of experience and more than 200 projects completed, I specialize in API integration and web scraping using Python, combined with database design and cloud deployment to ensure scalable and maintainable solutions. My background includes building automated pipelines that extract, enrich, and deliver structured Excel outputs tailored to client specifications. For this project, I will develop a modular scraper to extract all required fields from your pilot sources, followed by a targeted crawl of company websites to capture and validate emails from Impressum or contact pages. The initial full extraction can be completed within 3–4 weeks per source, with a fixed price reflecting the complexity. Subsequent monthly incremental updates will run automatically with monitoring for source changes, billed at a fixed monthly rate per source. Let’s discuss your priorities and how I can help build this robust data acquisition system for your ongoing needs.
₹13,750 INR dalam 7 hari
2.6
2.6

Hi there, I am an experienced freelance web scraper with 2+ years of hands-on experience delivering accurate, reliable, and scalable data extraction solutions. Successfully completed 100+ projects across multiple domains, consistently achieving high client satisfaction and timely delivery. Core Expertise: Web Scraping & Data Extraction Python Development & Automation Selenium & Browser Automation XPath & HTML/XML Parsing lxml & Data Processing Your Queries: 1. My Experience with large directory scraping projects: I have worked on similar large-scale scraping projects with datasets reaching up to 3 million data points in some cases. 2. Technical approach to enrichment via website crawling: I already have enrichment scripts that scrape emails, social handles with up to 80 percent accuracy. 3. Fixed price per source for full extraction (for up to 2 million data points): usually between 15000 - 20000 INR 4. Monthly price per source for incremental updates: Usually stays between 50 - 80 USD. 5. Estimated timeline for one full directory extraction: usually takes 1 to 6 weeks. Additional note: It is difficult to estimate cost or timeline without analyzing the websites. If you can provide the website URLs, I can give a more accurate estimate. I can also offer bulk discounts if you share how many websites you want to scrape and how long you require incremental updates. I am looking forward to building a long-term partnership with your team. Thank you. Anubhab
₹20,000 INR dalam 30 hari
2.6
2.6

Hello, I can help build a reliable and scalable Python-based scraping system for extracting large B2B datasets from industry directories in Germany and Austria. The solution will capture all required company details with a strong focus on email extraction, and for listings without emails, it will automatically crawl company websites (Impressum, Contact, Legal pages) to enrich the dataset with validated business emails. The system will deliver clean, structured Excel outputs and include a modular architecture that supports monthly incremental extraction of newly added listings while monitoring directory structure changes. I have experience building large-scale scraping and enrichment pipelines, ensuring scalability and stability for long-term projects. My pricing is ₹25,000 per source for full extraction and ₹8,000 per source monthly for incremental updates, with an estimated 5–7 days for completing one full directory extraction. I’m interested in building a long-term collaboration to support continuous data acquisition and expansion.
₹25,000 INR dalam 7 hari
2.5
2.5

Hello, i read your requirement i have experience in Excel and done many projects. I give you best work on your time and budget. Thanks waiting for your response...
₹13,000 INR dalam 4 hari
1.8
1.8

Hi, Your project aligns well with my 8+ years of experience in structured web scraping and automated data extraction workflows. I focus on building scalable, maintainable scraping systems designed for long-term reliability rather than one-time data pulls. For each directory, I would implement a structured extraction process to capture all required fields, followed by automated enrichment for missing emails through targeted crawling of company websites (Impressum/Contact pages) with validation and normalization logic. For monthly updates, I would design an incremental extraction system that detects newly added listings, applies the same enrichment workflow, and monitors structural changes to ensure continuity. I’m interested in building a stable, long-term collaboration and would be happy to review one of the sources to provide a detailed technical proposal and fixed pricing. I could also provide you with sample work from my previous projects. Thanks & Regards, Priyanshu
₹20,000 INR dalam 7 hari
1.4
1.4

As a highly experienced email marketing expert, I understand the power of accurate, organized data for fruitful campaigns. Over the years, I have honed my skills in extensive data extraction and processing, including large directory scraping projects similar to yours. Through my comprehensive understanding of web scraping technologies, I can confidently and proficiently handle your full extraction needs, enlisting not only companies' names but also their pertinent details like addresses and industry. One area where my expertise particularly proves valuable is enriching datasets via website crawling. This crucial task aligns with your priority of extracting business email addresses from websites where they are not readily available. With my eagle eye and meticulousness, no essential data point goes unnoticed; a methodology that guarantees reliable enrichment of your dataset. Beyond just technical competency, I approach even the most complex projects with a comprehensive outlook – scalability and structured architecture are ingrained in each action we will take. Having maintained an expansive B2B database myself for years, I genuinely comprehend the significance of long-term reliability you emphasized on. In conclusion, choosing me for this project means selecting an adept professional who not only meets but understands your unique needs.
₹25,000 INR dalam 7 hari
0.0
0.0

Hello, With over 5 years of expertise in Python, Web Scraping, and Data Mining, I am Jure, a seasoned professional ready to tackle your large-scale B2B directory extraction project in Germany & Austria. I understand the need for a structured and scalable data acquisition system for ongoing automation. Specializing in fast-turnaround automation and data solutions, my core services include Python scripting, web scraping with tools like BeautifulSoup and Scrapy, and API integrations for seamless data extraction. I ensure clean, documented code and timely delivery within 24-48 hours. I am committed to effective communication throughout the project and am excited to discuss how we can work together to achieve your data extraction goals. Thanks, Jure
₹25,000 INR dalam 7 hari
0.0
0.0

I have done large-scale scraping projects including NSE (National Stock Exchange) datasets and multiple Indian government portals where structure changes, anti-bot protections, and incremental updates were critical. I can apply the same structured, scalable approach to German and Austrian B2B directories. For Phase 1, I will build a robust extraction pipeline that captures the full company inventory per source, including company name, street, postal code, city, and prioritised email extraction. If email is not directly available but a website is listed, the system will automatically crawl the site, scan Impressum, Kontakt, and Legal Notice pages, and extract validated business email addresses using regex + domain verification logic. All data will be cleaned, deduplicated, normalized (UTF-8, German characters handled correctly), and delivered in structured Excel format. This phase will be delivered at a fixed one-time cost per source, covering full extraction, enrichment, validation, and formatted output. For Phase 2, I will implement a scalable incremental detection system that identifies newly added listings monthly through change detection, listing ID tracking, and structured diff comparison. The same enrichment logic will automatically apply to new records. Each month, you will receive a clean Excel file containing only new entries. I will also monitor structural changes in source directories to ensure long-term reliability.
₹12,500 INR dalam 1 hari
0.0
0.0

My name is Pratham, and as a digital marketing expert with interdisciplinary skills in data management, Python, and web scraping, I am supremely equipped to excel in your large-scale and long-term project of scraping B2B data from Germany and Austria industry directories. To elaborate on my rich experience, I've spearheaded projects requiring comprehensive data management, aided by my technical skills in Python programming and web scraping. Having delivered impressive results - like generating 50,000+ qualified leads - for my clients through dedicated approaches, I'm well-versed to establish a lucrative B2B database for you. Moreover, I grasp the essence of your requirements centered around building a structured framework for data extraction and incremental updates. My practice includes maintaining well-defined systems that effectively detect new listings each month while ensuring necessary enrichment via website crawling. Outlining my competitive pricing per source for both full extraction and monthly updates will help provide predictable costs without any compromise on quality. Choosing me means not just a task worker but a stable partnership that will yield consistent growth for your business. Looking ahead, I anticipate the opportunity to discuss your project in depth. My vision is to not just scrape data but to achieve scalable architecture and long-term reliability. Let's make this project a stable win-win collaboration!
₹25,000 INR dalam 7 hari
0.0
0.0

Hello, I’m interested in supporting your long-term B2B data acquisition system and understand this requires scalable, structured architecture not one-off scraping. I’ve handled large directory extractions including full inventory scraping, data normalization, website crawling for missing contact details, and delivering clean Excel outputs ready for database import. I build structured Python scraping pipelines to extract and standardize all company data. For records missing emails, I crawl company websites, extract and validate business emails, and enrich the dataset accordingly
₹25,000 INR dalam 7 hari
0.0
0.0

Hi, I'm a Python automation specialist with hands-on experience in large-scale web scraping and B2B data extraction. Technical Approach: - Scrapy + Playwright for JS-rendered directories, BeautifulSoup for static pages - Impressum/Contact page crawling with email extraction & validation (regex + DNS MX check) - Structured Excel output with deduplication against your 5M+ database - Anti-blocking: rotating proxies, rate limiting, header rotation Phase 1 — Full Extraction (per source): - Complete inventory with all required fields (email, company, address, postal, city + optional) - Website crawling for email enrichment on records without direct email - Clean Excel delivery with data quality report - Timeline: 5-7 days per source Phase 2 — Monthly Incremental: - Automated new listing detection - Same enrichment pipeline, monthly Excel delivery - Structural change alerts if directory layout changes Experience: - Scraping systems processing 100K+ records across European directories - DACH market directory structures (WKO, Gelbe Seiten, etc.) - GDPR-aware data handling I'm looking for exactly this kind of long-term partnership. Happy to start with your pilot sources immediately. Best regards, Gabor
₹15,000 INR dalam 7 hari
0.0
0.0

Gurgaon, India
Kaedah pembayaran disahkan
Ahli sejak Jul 4, 2025
₹1500-12500 INR
₹12500-37500 INR
₹37500-75000 INR
$750-1500 USD
₹12500-37500 INR
₹750-1250 INR / jam
₹1500-12500 INR
€750-1500 EUR
€30-250 EUR
€30-250 EUR
$15-25 USD / jam
$30-250 USD
$250-750 USD
$30-250 USD
£20-250 GBP
$15-25 USD / jam
$10-30 USD
$10-30 USD
$30-250 USD
$30-250 USD
₹12500-37500 INR
₹12500-37500 INR
$20 USD