Skip to content

Latest commit

 

History

History
74 lines (39 loc) · 2.01 KB

README.md

File metadata and controls

74 lines (39 loc) · 2.01 KB

Kata-HowTo

Download links :

SSH Terminal with SFTP client : https://mobaxterm.mobatek.net/download-home-edition.html

IDE : https://www.jetbrains.com/fr-fr/idea/download

Map the hostname of the sandbox to the IP adress :

In this file :

c:\Windows\System32\Drivers\etc\hosts (windows)

/etc/hosts (unix)

add the following line :

172.16.1.120 sandbox-hdp.hortonworks.com

Connection

After being connected to the VPN, You can connect to the the Sandbox VM using MobaXTerm : 172.16.1.120:2222 with the login / password you received

Files required

Find attached in this repository all the files you could use during the kata

Kata

You will find Kata project for the Spark, Kafka and Hive. Once you import project into IntelliJ it should be up and running. You will find the correction in the correction branch.

Building project

Clone the project in Intellij : File -> New -> Project from Version Control [Enter git address]

Once you cloned the project, directly try to build with maven install. Maven Tab (Upper Right) -> [project] -> Lifecycle -> install

IntelliJ will ask you to set SDK: please directly download JDK 1.8 provided by IntelliJ.

kata-spark

You will only execute program from your own computer. Please download the Scala plugin for IntelliJ in File -> Settings -> Plugin and search for "Scala". Once you restarded the IDE, please Setup Scala SDK to a version 2.12.X

If you cannot run the KataX program, please ensure your scala directory is properly marked as Sources in File -> Project Structure -> Modules

kata-hive and kata-kafka

Once you builded a project with maven install, you will find the jar generated in target directory. Now you can upload it via MobaXTerm to your home directory.

For kata-hive, you just have to specify the jar in the creation of your function inside the hive client (you will need to push it first to HDFS). See explanations provided in the slides.

For kata-kafka, just launch java -jar kata-kafka-1.0-SNAPSHOT-shaded.jar