I wanna get trending URLs from log file.
Please check attachment file:
- Input data is on Google Cloud Storage in JSON lines format
- We'll parse user-agent strings and use the free GeoLite database
- We'll use a Jupyter notebook and git with GitHub
Questions for you:
- What is your experience with these technologies?
- What similar projects have you worked on?
- What is your GitHub username?
- What trending algorithm might be best?