Hi,
I would like to do it,
For testing i ran a test to fetch 10000 urls to see if they do any kind of blocking with 10 threads fetching the page in python and it completed in 20 mins on my connection and apparently they haven blocked, so are you sure that they block
If you really want to use proxies , you can use rotating proxies that change ip in a minute since you don't have to do anything state full in the website , but i think normal proxies will work fine too
I prefer python because for scrapping i have done most of the things it, I did data scraping in pure JavaScript too but you can't save data anywhere with it, node.js is an option however i am not sure about multiple requests there so
For data storage we can discuss on chat with what you prefer i mean if we have data from website and what you want to store is is about plain text, then its just mater of using API calls to the database you prefer
If you are interested i will be available on chat , you can provide me with the details you like and i can do some test runs to see if its working according to your requirements or not
Thanks