*~*~*~* NO COPY/PASTE PROPOSALS *~*~*~*
*~*~*~* PLEASE READ PROJECT DESCRIPTION BEFORE POSTING *~*~*~*
I have a PHP based web crawler that is hosted on AWS. The crawler runs every 24 hours, scans a list of websites that I have inputted, and emails me if it notices a change in any of those websites.
There are two problems:
1) The crawler believes there are web page updates even when there are none. Not for all websites, only some.
2) The crawler is VERY expensive to run on AWS. I believe the code could be improved to make it less costly to run on an EC2 server (see screenshot).
I'd like two tasks completed:
1) Help me debug the crawler to make sure it only emails me when real changes are made to webpages.
2) Explain and help me configure the code or the AWS instance to not be so expensive.
This project should be completed within 24 hours given that all the code is currently written, and just needs to be improved.