I have to migrate a b2evolution blog to wordpress and need a script to help me with the process. I don't have access to the databases, so the script will have to use scraping.
Here's a rough idea of what I have in mind for the script:
1. The RSS feed of the b2evolution blog already contains all the posts of the blog. The script will have to extract the title, content and url for each post from the RSS feed.
2. The script then has to visit the url for each post and scrape the content to collect the category and all tags for the post.
3. The title,content,catgeories and tags for all posts should then be written to an xml file (or any other format that can be imported in Wordpress), so that I can import into wordpress.
In addition to this I also need the above script to create a 301 redirect script to help me redirect the old b2evolution urls to the new wordpress urls. Essentially it should take the old b2evolution url (as taken from the RSS feed) and the expected new Wordpress url (which can be devised from the post title) and build a 301 redirect for it. I'll provide more detail on this to the successful bidder, but essentially this script should build a bunch of if statements. I basically need a php script with a statement like this for each post:
if ($queryString == "b2evolution-url")
header("HTTP/1.1 301 Moved Permanently");
Lastlly, I also need to build a redirect file for the tag pages using a similar script, so each b2evolution tag url should redirect to the appropriate wordpress tag. There are 800 of these, so this step is important.
Please note - I don't particularly want the actual scripts that does the above - it will be fine if you do all this processing and just send me the resulting xml files and redirect scripts.
Please let me know if any of this is not clear.