
Ditutup
Disiarkan
Dibayar semasa penghantaran
This project is a full stack system built around a vision based browser automation bot. The bot visits three target websites, fills multi-step forms on each of them using data provided by an operator, scrapes the results returned after each submission, and stores everything in a database. The operator interacts with the system through a web interface and never touches the bot or the code directly. The system has five parts that all need to be built and connected. A React frontend is where the operator submits input data and monitors what the bot is doing in real time. A FastAPI backend sits behind the frontend and handles communication between the operator, the bot, and the database. Redis acts as the message layer between the backend and the bot, passing events like run triggers, status updates, CAPTCHA alerts, and completions. PostgreSQL stores all input and output data with full traceability between them. The automation service is the bot itself, built using MSS for screen capture, OpenCV for image matching, OCR for reading text from the screen, and PyAutoGUI for simulating mouse and keyboard interactions. The bot does not interact with HTML at any point. It works entirely by looking at the screen, the same way a human would. For every section on every page it visits, it must follow a strict five step cycle. It first takes a screenshot and verifies it is in the correct section before doing anything. It then selects that section and verifies again. It then fills the required field using only mouse and keyboard inputs. It then takes another screenshot to confirm the field is no longer empty. It then validates one more time before moving forward. If any step fails, the bot goes back to a defined earlier step and retries. There are no shortcuts and no fallback methods. If the bot encounters a Cloudflare challenge or bot detection page it cannot resolve on its own, it pauses, publishes an alert through Redis, and triggers a sound notification that repeats every five minutes until a human operator manually resolves it and tells the bot to continue. The full system must run on Windows. All output data must be available for download as an Excel file through the frontend. Version control and code documentation are mandatory throughout the build. The reference images and interaction flow the bot uses for visual matching will be provided by the client. Developers do not generate or decide these themselves.
ID Projek: 40354491
45 cadangan
Projek jarak jauh
Aktif 6 hari yang lalu
Tetapkan bajet dan garis masa anda
Dapatkan bayaran untuk kerja anda
Tuliskan cadangan anda
Ianya percuma untuk mendaftar dan membida pekerjaan
45 pekerja bebas membida secara purata $458 USD untuk pekerjaan ini

Noticed that the system's core relies on a vision-based bot interacting with multi-step forms on three specific websites. I recently implemented a similar system using OpenCV for a client in the e-commerce sector, which involved complex form parsing and data storage in PostgreSQL. Curious, have you considered how the operator will handle discrepancies in form layouts across different websites? There's an effective strategy we could discuss. Can refine the approach today, let me know when you’d like to dive deeper.
$250 USD dalam 7 hari
5.6
5.6

Hello, I’m Karthik with 15+ years of experience in Python automation, FastAPI, React, PostgreSQL, Redis, and computer-vision-driven workflow systems. I can build your Windows-based full-stack vision automation platform exactly around the strict screen-driven process you described. My approach: • Build React frontend for operator input, run monitoring, alerts, and Excel export • Develop FastAPI backend for orchestration, APIs, auth, and traceability • Use Redis for bot triggers, live status, CAPTCHA/manual intervention alerts, and completion events • Store all input/output data in PostgreSQL with full run mapping • Build the bot with MSS + OpenCV + OCR + PyAutoGUI, following your required 5-step verification cycle for every field/section • Add retry logic, logging, screenshots, sound alerts every 5 minutes, and resume controls after manual intervention Deliverables: • Complete frontend, backend, bot, Redis, PostgreSQL integration • Excel export from frontend • Windows-ready setup • Version-controlled, documented codebase I understand this is pure vision automation only—no Selenium, no HTML shortcuts, no fallback methods. My focus will be reliability, traceability, and clean system integration. Warm Regards, Karthik B
$750 USD dalam 7 hari
5.0
5.0

Hi there, Most developers approach browser automation by hooking into HTML and the DOM. Because your strict requirement is a purely vision-based, human-mimicking bot, this project cannot be treated like a standard scraper; it needs to be architected as a resilient, visual state-machine. I specialize in building complex, data-driven automation pipelines and full-stack systems. I understand that the success of this bot relies entirely on strict operational boundaries: zero HTML interaction, mandatory 5-step visual verifications for every action, and a seamless human-in-the-loop system for Cloudflare challenges. Here is my execution plan for your architecture: 1. The Vision Automation Engine (Python / OpenCV / PyAutoGUI) The bot will be built as a standalone Windows service. Using your provided reference images, I will implement a strict visual verification loop. Before any mouse or keyboard event is fired via PyAutoGUI, the system will use MSS for high-speed screen capture and OpenCV/OCR to assert the current state. If the post-action screenshot validation fails, the bot will gracefully revert to the previous safe state to retry, ensuring zero rogue clicks. 2. The Message Layer & Human-in-the-Loop (Redis) I will implement Redis as the central nervous system. This is crucial for your Cloudflare requirement. If the bot's OCR detects a challenge it cannot pass, it will instantly halt execution, publish a "Challenge Event" to Redis, and trigger the localized 5-minute recurring audio alarm. 3. The Command Center (FastAPI & React) The bot will run headlessly (from an operator's perspective), controlled via a clean React frontend and a FastAPI backend. The operator will feed target data through this portal, which FastAPI will log in PostgreSQL (for full traceability) before pushing the job to the Redis queue for the bot to pick up. 4. Data Delivery & Documentation All scraped output data will be relationally mapped in Postgres and exportable to Excel directly from the React dashboard. The entire codebase will be managed via Git, with comprehensive documentation on how to update the OpenCV reference images when the target UI changes. I am ready to review the initial reference images and interaction flows. Let’s connect to discuss the specific target sites and get this built.
$700 USD dalam 14 hari
4.6
4.6

Hey, I liked your project, Vision-Based Browser Automation System Development and believe I can help you with the project. With my background in Python, PostgreSQL, Redis, I'm confident I can meet your requirements. Would be glad to go over specifics if you're interested.
$250 USD dalam 7 hari
4.4
4.4

Hi! I'm a specialist in Python and Automation. I noticed your project 'Vision-Based Browser Automation System Development' and it piqued my interest. I understand that you need a full-stack system built around a vision-based browser automation bot. I've worked on similar projects and am confident in my ability to deliver. If you have any questions or want to discuss the project's specifics, please, feel free to reach out. I'd be more than happy to contribute to your project and deliver clean, tested, production-ready code. Looking forward to potentially working together.
$600 USD dalam 14 hari
4.2
4.2

As a seasoned Full Stack Developer with over 7 years of experience, I am well-versed in designing and implementing end-to-end solutions, which is precisely what your vision-based browser automation system entails. With my exceptional skills in tying React frontends to robust FastAPI backends, I can integrate the required Redis message layer, keeping the operator, bot, and your PostgreSQL database communicating harmoniously. Furthermore, my expertise in database management and proficiency with PostgreSQL serves as a guarantee for an efficient data storage system that provides full traceability—a critical aspect of your project. A notable functionality of this system relies on the bot's ability to mimic human-like behavior on the web interfaces it interacts with. This is where my extensive knowledge in screen capture using MSS, image matching using OpenCV, OCR for text extraction, and PyAutoGUI for simulating mouse and keyboard interactions becomes pivotal. My competency in these technologies aligns neatly with your project's demands. Maintaining traceability and version control are nonnegotiable throughout the development process and that's something you can expect from me at all times. I'm certain my communicative approach and deep passion for my work would help us establish a solid professional relationship that lasts far beyond this project. Let's connect and discuss further!
$500 USD dalam 7 hari
4.0
4.0

Your vision-based automation system needs a React frontend, FastAPI backend, Redis messaging, PostgreSQL storage, and a Python bot using MSS/OpenCV/OCR for screen-based form filling. I'd build the full stack with real-time WebSocket updates so you can monitor the bot's five-step verification cycle as it works through each target site. I built similar automation systems - an AI travel planner with FastAPI/React and a full-stack invoicing platform with 81-table PostgreSQL. Also developed automated trading bots and PDF processing pipelines that handle complex multi-step workflows. You can check out my work at ffulb.com. Ready to start building this system. Want to discuss the technical details and how we'd handle the cross-component communication?
$500 USD dalam 10 hari
3.6
3.6

Hello There!!! ★★★★ ( Full-stack vision automation system with CV bot + React dashboard ) ★★★★ I understand you need a complete system with a vision-based bot (no DOM access) that fills multi-step forms, validates via screenshots, handles retries, and integrates with React, FastAPI, Redis, and PostgreSQL for full control and traceability. ⚜ Vision bot using OpenCV, OCR, PyAutoGUI, MSS ⚜ React dashboard for input & live tracking ⚜ FastAPI backend with API orchestration ⚜ Redis messaging for events & alerts ⚜ PostgreSQL structured data storage ⚜ Step-based validation & retry logic ⚜ Excel export & logging system I have 6+ years experince in CV automation and full-stack systems, built similar human-like bots with strict validation cycles and error recovery flows. My approach: modular Python bot, Redis event-driven pipeline, scalable backend, and pixel-adaptive template matching for stability. Ensures accurate automation even with UI shifts. Let’s discuss your workflow and start building this robust system. Warm Regards, Farhin B.
$500 USD dalam 7 hari
3.2
3.2

Hi, I’m very interested in your Vision-Based Browser Automation System Development project. I specialize in building robust vision-based automation systems using exactly the stack you need: Python + MSS (screen capture), OpenCV (image matching), OCR, PyAutoGUI (mouse/keyboard actions), FastAPI backend, Redis messaging, PostgreSQL and React frontend. What I will deliver: Complete 5-step verification cycle for every form section (screenshot → verify → select → fill → confirm → validate) with full retry logic on failure. Vision bot that works 100% by screen (no HTML/DOM interaction), as specified. React frontend for operator input, real-time monitoring and Excel export of all data. FastAPI + Redis communication layer (run triggers, status, CAPTCHA alerts with sound notification every 5 minutes). PostgreSQL with full traceability between inputs and outputs. Full system running on Windows, clean architecture, Git version control and detailed documentation. I can start immediately. I always deliver modular, well-commented, production-ready code that is easy to maintain. Please share the reference images and interaction flow so I can confirm the details. Looking forward to building this reliable vision-based system for you. Best regards
$500 USD dalam 7 hari
3.3
3.3

Hi there, I’ve read the Vision-Based Browser Automation requirements carefully and I’m confident I can deliver a robust, Windows-native system that meets your strict vision-first constraints. I’ve built full-stack automation systems using OpenCV for template matching, Tesseract-based OCR, MSS for reliable screen capture, and PyAutoGUI for deterministic mouse/keyboard interactions, all coordinated via FastAPI, Redis messaging, and PostgreSQL for traceable data storage. My approach is to design a deterministic state machine that enforces the five-step verification cycle per section, with visual checks after each action and retry logic tied to explicit rollback points. Redis will convey run triggers, CAPTCHA alerts and status updates; the frontend (React) will stream live bot screenshots and statuses and allow Excel export of results. I’ll include sound alerting logic and manual resume controls when human intervention is required. Full version control and inline documentation will accompany the deliverable. I can start with a detailed design and milestone plan and deliver a working prototype within the initial sprint. Which three target sites and any special form flows or timing quirks should I prioritize first? Best regards, Cindy Viorina
$250 USD dalam 6 hari
2.2
2.2

Hello, I can develop your complete full-stack system centered around a vision-based automation bot with strict workflow validation and reliable architecture. I have strong experience in building systems using React, FastAPI, Redis, PostgreSQL, and computer vision tools like OpenCV, OCR, MSS, and PyAutoGUI, which aligns perfectly with your requirements. I will create a clean React frontend where operators can input data, monitor bot activity in real time, and download outputs in Excel format. The FastAPI backend will manage all orchestration, connecting the frontend, Redis messaging layer, automation bot, and PostgreSQL database with full traceability of inputs and results. Redis will handle real-time communication such as job triggers, progress updates, CAPTCHA alerts, and completion signals. The automation bot will strictly follow your defined five-step verification cycle using screen-based interaction only—no HTML access—ensuring accuracy and consistency. I will implement robust retry logic, section validation, and controlled execution flow. In case of Cloudflare or bot-detection challenges, the system will pause, send alerts via Redis, and trigger recurring notifications until manual intervention is completed.
$750 USD dalam 7 hari
2.5
2.5

Hi, This is Jagrati. I understand you need a full-stack system built around a vision-based automation bot that interacts with websites purely through screen recognition (no DOM/Selenium), integrated with a React frontend, FastAPI backend, Redis messaging, PostgreSQL, and a Windows-based bot service. My approach is to design an event-driven architecture where the React frontend allows operators to submit inputs and monitor runs in real time. FastAPI will orchestrate communication between the UI, Redis, the bot worker, and PostgreSQL. Redis will handle triggers, status updates, alerts, and completion events, while PostgreSQL stores all input/output data with full traceability. The bot will be implemented in Python using MSS for screen capture, OpenCV for template matching, OCR for text extraction, and PyAutoGUI for mouse/keyboard actions. It will follow a strict step-by-step verification cycle (capture → verify → interact → re-verify → validate) with retries on failure. No shortcuts or DOM-based fallbacks will be used. For CAPTCHA/detection cases, the bot will pause execution, send an alert via Redis, and trigger a repeating notification until manually resumed by the operator. The frontend will display live logs, bot status, and allow Excel export of results. The system will be Windows-compatible, fully documented, and version-controlled. All visual templates and reference images will be used exactly as provided. Best regards, Jagrati
$350 USD dalam 7 hari
2.3
2.3

Developing a vision-based browser automation system that interacts with websites through visual cues, mimicking human actions, without HTML interaction. The system includes a React frontend for operator input and real-time monitoring, and a FastAPI backend for communication between the operator, bot, and database. Redis will manage event messaging, including CAPTCHA alerts and status updates. PostgreSQL will ensure data traceability, storing both input and output. The bot will utilize MSS for screen capture, OpenCV for image recognition, OCR for text reading, and PyAutoGUI for simulating interactions. It will follow a strict five-step cycle for each form section, ensuring accuracy and retrying upon failure. Cloudflare challenges will trigger alerts via Redis and sound notifications until manually resolved. The system will run on Windows, with all data downloadable as Excel files. Version control and documentation will be maintained throughout. Timeline: 5 days. Could you confirm the availability of the reference images for the bot's visual matching process?
$600 USD dalam 5 hari
1.8
1.8

⭐⭐⭐⭐⭐ Solving Your Browser Automation Needs Effortlessly Hi there, ➤ Striving to streamline your workflow by developing a vision-based browser automation system that seamlessly navigates through target websites, fills forms, scrapes data, and stores it securely. ✅ How I will help: ↪️ Implementing a React frontend for real-time monitoring, a FastAPI backend for smooth communication, utilizing Redis for event handling, PostgreSQL for data storage, and crafting a robust automation bot using MSS, OpenCV, OCR, and PyAutoGUI. ✅ Work Experience: *️⃣ Proficient in developing complex automation systems, ensuring precise execution and error handling. ⚫Quick Questions: ❔ How often do you anticipate the need for manual intervention in the automation process? ❔ Are there any specific security protocols or compliance requirements to consider? Looking forward to discussing the project further. Regards, ✍️ AB
$490 USD dalam 7 hari
1.1
1.1

Hi there, it looks like you're dealing with a complex vision-based automation workflow that needs strict validation loops, reliable communication between services, and full-stack coordination. I've spent the last 5 years building vision-driven bots using OpenCV, OCR, FastAPI, Redis, and PostgreSQL, and I can implement your multi-step screenshot-verified flow with the exact five-step cycle you described. I'll build the React dashboard for real-time monitoring, the FastAPI backend for orchestrating bot activity, the Redis messaging layer for events, and the Windows-based automation bot using MSS, PyAutoGUI, and your provided reference images. I'll also ensure Excel export, Cloudflare-interrupt handling, and full version control and documentation throughout. Best regards, Arvin Jay
$555 USD dalam 5 hari
0.0
0.0

Hello, As an experienced Full Stack Developer and AI Chatbot expert, I bring a unique set of skills perfectly aligned with the needs of your project. I have successfully built sophisticated automation systems using vision-based techniques - including image matching, OCR, and screen capture - similar to what you require for this browser automation bot. Blending Python with tools like MSS, OpenCV, and PyAutoGUI, I've developed highly efficient, error-averse bots that perform tasks like form-filling and data scraping accurately using only the information visible on-screen. Moreover, my expertise extends to every aspect of your project architecture. From the React frontend through the FastAPI backend to Redis and PostgreSQL storage, I have in-depth knowledge and hands-on experience. This also involves integration of CAPTCHA resolution mechanisms for overcoming potential obstacles during automation - something that your project specifically emphasizes on. Additionally, I understand that effective project management and thorough documentation is essential throughout development. I assure you that I am meticulous about version control and code documentation to ensure transparency, traceability, and improved collaboration. Finally, what sets me apart is not just my technical acumen but also my proactive attitude towards risk mitigation. As we develop your system together, I'll emphasize the importance of testing every defined step meticulo Thanks!
$250 USD dalam 6 hari
0.0
0.0

Hi There, I am excited about your project involving a vision-based browser automation bot. Your requirement for a seamless full-stack system that integrates React, FastAPI, Redis, PostgreSQL, and robust image processing techniques aligns perfectly with my expertise. With over 7 years of experience in Python, PostgreSQL, Redis, Image Processing, OpenCV, and Full Stack Development, I am well-equipped to build the five integrated components you need. My experience in automation and database management ensures that every detail, from user input at the frontend to data storage, is meticulously handled. You can view my relevant work here: [Portfolio Link](https://www.freelancer.com/u/KanzahAfaqAhmad) I look forward to the opportunity to discuss how I can bring your vision to life. Thank you, Regards, Kanzah Afaq
$500 USD dalam 7 hari
0.0
0.0

I can build your full-stack, vision-based browser automation system with a strong focus on reliability, traceability, and strict adherence to your defined interaction flow. The solution will include a React frontend for operator input and real-time monitoring, a FastAPI backend to orchestrate communication, Redis for event-driven messaging, PostgreSQL for structured data storage, and a Windows-based automation bot using MSS, OpenCV, OCR, and PyAutoGUI. Each component will be cleanly integrated so the operator can control and observe the entire process without directly interacting with the bot. The automation engine will strictly follow your five-step validation cycle for every interaction—screen verification, selection confirmation, input via mouse/keyboard, post-action validation, and final confirmation—ensuring human-like execution with no shortcuts. Robust retry logic will be implemented for failure handling, and in cases like CAPTCHA or Cloudflare challenges, the bot will pause, trigger Redis alerts, and notify the operator until manual resolution is completed. All actions and results will be logged and linked in PostgreSQL for full traceability, with export functionality to Excel उपलब्ध directly from the frontend.
$650 USD dalam 15 hari
0.0
0.0

⚠️No results no payment ⚠️ Your system’s strict five-step verification cycle for each form section ensures accuracy and reliability—you're in good luck. I specialize in precise automation and seamless integration, guaranteeing the bot’s flawless visual recognition and interaction without shortcuts. Deliverables: - Fully connected React frontend with real-time monitoring - Robust FastAPI backend managing communication - Reliable Redis event messaging layer - Secure PostgreSQL database with traceability - Vision-based bot using MSS, OpenCV, OCR, and PyAutoGUI - Excel export functionality and thorough code documentation I would like to discuss more about the project. You lose nothing. If you create milestones, your payment will be fully protected, and you can use this message as proof for a full refund guarantee. Kind Regards Shafeeq
$250 USD dalam 14 hari
0.0
0.0

Hi! I noticed you need help with Workflow automation. I can automate your workflow with Python/Go. Save hours of manual work with custom scripts. Clean code, fast delivery. Available to start now!
$500 USD dalam 7 hari
0.0
0.0

Dhaka, Bangladesh
Kaedah pembayaran disahkan
Ahli sejak Apr 5, 2026
$10-3500 USD
$10-300 USD
€250-750 EUR
$30-250 USD
₹600-1500 INR
₹100-400 INR / jam
₹1500-12500 INR
$30-250 USD
₹1500-12500 INR
$14-20 NZD
₹1500-12500 INR
₹12500-37500 INR
$30-250 USD
$5000-10000 USD
$250-750 USD
₹12500-37500 INR
$250-750 USD
₹12500-37500 INR
₹37500-75000 INR
$3000-5000 USD
$30-250 USD
₹750-1250 INR / jam