I am URGENTLY looking for a new version (Version 9.0) of software to be written to be able to extract information from a specific website ([login to view URL]) using the search facility - [login to view URL] where q= a number which increments by 1 each time.
I want to set the start and end numbers for each search.
We have a very good piece of software already but it needs to be updated. It searches using multi-threading and brings back 10, 50 or 100 records at a time, selected by the Entries Per Round value on the dropdown (depending on which we choose) and then writes them directly to SQL. Please see screenshot of existing software.
A dropdown box is needed for England & Wales where the Company numbers are an 8-digit number from 00000001 upwards (the leading zeros are important), Scotland where the Company numbers begin SC and then have 6 digits from SC600000 upwards e.g. SC600600 and Northern Ireland where the Company numbers begin NI and then have 6 digits from NI600000 upwards. The program needs to update the record in the database if it already exists – it must not delete and re-add the record, it must just amend.
Currently searching is undertaken with numeric values only and the NI and SC are added after.
I need a tick box along with two value boxes which will allow me to choose how many records to search for before stopping e.g. 400 and then the time in minutes to wait until continuing the search with the next 400 – this is because Companies House now stop bulk searches from a single IP address. If the fields are blank, then these fields should be ignored.
A manual lookup option is needed to just allow a list of numbers to be looked up which are manually entered in the command line below:
The extracted data needs to be written directly to our MySQL database in the correct fields. Screen shot of what the existing software looks like attached and SQL schema also attached (although not all fields are written to by the program).
I would like an option where I can tick a box and the search will only add records between the Start Value and End value where the record does NOT currently exist.
The extracted data which is written to SQL is as below:
• Company Number
• Company Name
• Incorporation Date
• Address (split across 4 fields)
• First Directors Name
Directors’ names need to be formatted as “Firstname Surname”
You must make sure that postcode goes in the postcode field
We will provide a new SQL database table for you to write to to test, so once completed you will just be able to write to the live database.