The tasks are as follow: 1. Given the IMDB 5000 Movie dataset, create a network. You should think about the metric to build a network. For example, a co-play network has actors as nodes and connections between two actors are determined based on that the two actors played the same movie(s). 2. Find two subnetworks. You are free to find any two subnetworks from the network that you have created, and each subnetwork should meet the following two requirements. • Including at least 20 nodes • All nodes in a subnetwork are NOT isolated, so each node should have at least one connection with other node(s). 3. Determine similarity metrics. You need to define a way of computing similarity between the two subnetworks by using the information given in this dataset. For example, based on the overall budget and movie genres, parts of the two subnetworks are similar, and their computed similarity score is 0.78. 4. Interactively show the similarity. You should design an interactive way of displaying the computed similarity between two subnetworks. For example, as a user selects a few nodes in subnetwork1, similar component(s) in subnetwork2 gets highlight, and a table pops up with necessary information to explain how the similarity is determined.

The design should run in a web browser. You can use any web-based library, and make sure you give reference in the written document

