I need an expert in Node.JS, Amazon AWS, and Database management to help with a data collection project.
We have created a script which queries a third-party API, using a specific keyword, and returns JSON data related to that keyword.
We need that script to run once every 60 seconds.
We need a Database set up to store the JSON data. From speaking with experts, it sounds like this should be a relational database such as MySQL.
Task #3 (hard part):
We need to duplicate this automated script for approximately 5,000 different keywords, all running queries every 60 seconds and storing their data in the main Database.
This presents a few issues, on which we need advice.
Problem #1: The third-party API has a rate limit of 60 seconds per request. Presumably, this means we need to create 5,000 different scripts, all running from different IP addresses (need advice). From our research, some people have had success doing this on a smaller scale using Amazon’s Lambdas service. We need advice on how to run all of these API queries concurrently, efficiently, and at the lowest cost. Recommendations on services and solutions are welcome. Note: This is not an attempt to avoid paying for API access - it is a public API that has a rate limit.
Problem #2: These queries are very quickly going to create a very large database. We need advice on the best Database service / solution to use, for maximum performance at the lowest cost.
Problem #3: The goal for this data is to send it to a Front-End which will be able to present the Data in real time. We are still building that Front-End, but have a general idea of what it will look like and how the data will be presented. We need advice on the best way to Collect, Process, Analyze, Store, and Present this data in a real-time environment. From our research, there are various services and solutions for this type of problem. For example, Amazon Kinesis seems like it could be a solution? Need advice.
If you are interested, please get in touch and we can chat further. Thank you for your time.