I have a series of web pages with the full HTML source code of each page stored in longtext fields within a MySQL database.
Each record has a series of links in here, the majority of which are actually tracking links which will redirect you to the right page on the site which is why I need to resolve these.
For each record I need a list of all the domains that the links automatically resolve to and stored with the unique ID of the web page record in a separate table within the database.
The domain that needs to be stored is just [url removed, login to view], not [url removed, login to view] - I just want the bit after the www's and nothing after the .com/.org etc
On completing of this the script should also update a flag in the original table to show that it has been done.
This needs to be scheduled to run on a regular basis (every 30 minutes or so) and only select records that have not yet been processed.
Server is Windows 2003, PHP4/5
A PHP script and a cron job running every 30 minutes and done. Just trust me, I promise you great quality. I am equipped with relevant experience and skill to accomplish this.