create mapreduce program and run it on amazon aws linux nodes

Build your own Hadoop AMI, starting from the Amazon Linux AMI ([login to view URL]). You have to use latest stable Hadoop release. You are required to store this AMI in S3, and its name must include your last name. This AMI will be tested with the application built for task 2. However, if your AMI doesn’t work you are allowed to use one of the pre-built Hadoop AMIs for task 2.

Write a Hadoop/Yarn MapReduce application that takes as input the 50 Wikipedia web pages dedicated to the US states (we will provide these files for consistency) and:

Computes how many times the words “education”, “politics”, “sports”, and “agriculture” appear in each file. Then, the program outputs the number of states for which each of these words is dominant (i.e., appears more times than the other three words).

Identify all states that have the same ranking of these four words. For example, NY, NJ, PA may have the ranking 1. Politics; 2. Sports. 3. Agriculture; 4. Education (meaning “politics” appears more times than “sports” in the Wikipedia file of the state, “sports” appears more times than “agriculture”, etc.)

INPUT FILE IS GIVEN - [login to view URL]

Kemahiran: Perkhidmatan Web Amazon, Big Data Sales, Hadoop, Java, Linux

Lihat lagi: run mapreduce program java program, amazon aws create website, amazon aws api script create vpc, aws emr applications, aws emr create-cluster, emr instance types, emr cluster configuration, emr instance configuration, aws emr bootstrap script, emr limits, aws emr cli, java, linux, amazon web services, big data, hadoop, accounting program amazon aws, linux server admin service amazon aws, hbase hadoop mapreduce php thrift amazon aws ec2, rar run program create sfx archive run program sfx options setup program

Tentang Majikan:
( 0 ulasan ) United States

ID Projek: #16354070

12 pekerja bebas membida secara purata $297 untuk pekerjaan ini

$882 USD dalam 3 hari
(17 Ulasan)

Dear Customer, My name is Yuriy Tumakha. I am interested in your AWS Hadoop project. I am Senior Scala/Java Developer with 14 years of experience. You can see my code examples on GitHub [login to view URL]

$350 USD dalam 7 hari
(16 Ulasan)

Hello there, We are a team of expert Big Data developers with more than 10 years of rich inductry experience & have succesfully delivered multiple projects in the past like a)recipe recommendation b)movie recom Lagi

$277 USD dalam 3 hari
(7 Ulasan)

Hello, I have extensively worked in map reduce progran in Python Scala and Java. Can we talk directly in the chat? Thanks!

$200 USD dalam 3 hari
(24 Ulasan)

Hi, I have more than 3+years of experience in hadoop technologies like MapReduce,HDFS, Spark, sqoop etc. I can write the mapreduce program according to your requirements and I can deploy on amazon aws Contact me fo Lagi

$250 USD dalam 3 hari
(7 Ulasan)

Hi, I am an IT specialist and data scientist and thus have the skills required for this job. I have experience building AWS EC2 machines from AMIs as well as installing and configuring Hadoop. Finally, I have experi Lagi

$222 USD dalam 5 hari
(7 Ulasan)

NOTE: Most of the requirement of your project scope is already completed by us and we have demo for you as well. We are Amazon MWS / Ebay API experts and completed so many projects using its API I have ready to use Lagi

$155 USD dalam 3 hari
(1 Ulasan)

Hi, We are a Team of Amazon certified Solutions Architects, we have more then +3 years experience with amazon AWS and more than +5 years as Linux SysOps. We can help you with this Please let me know if you need Lagi

$250 USD dalam 5 hari
(1 Ulasan)

Hey, Can we start project ASAP and will complete within 3 working days so kindly suggest time to connect with you.

$222 USD dalam 3 hari
(0 Ulasan)

Languages: JAVA. Java/J2EE: Core JAVA,JAVAFX, Advanced JAVA, Servlet, JSP, JSTL, EJB, JDBC, Junit, Web Services, XML, XSD, JAX-RS, DOM, SAX, Multithreading, JTA, Custom Tags, JPA API’s. Web Technologies: Html, DHTML Lagi

$255 USD dalam 7 hari
(0 Ulasan)

A proposal has not yet been provided

$277 USD dalam 3 hari
(0 Ulasan)

I've been working in a big data company for 2 years. I'm very good at hadoop/spark. I know how to optimize difficult map reduce jobs. What's more, I'm good at system-level optimization. I know how to analyze IO/network Lagi

$222 USD dalam sehari
(0 Ulasan)