Amazon EMR is an Amazon data science analysis box.
You can ssh to its linux interface using putty. And you will see this:
And itself has spark. You type "spark-shell" , and go to spark interface.
It itself does not have sbt. Its system is Redhat. So you use yum to install sbt.
First type "sudo yum update"
Here is how on a webpage.
http://www.scala-sbt.org/0.13/docs/Installing-sbt-on-Linux.html
Then type
curl https://bintray.com/sbt/rpm/rpm | sudo tee /etc/yum.repos.d/bintray-sbt-rpm.repo
Then type:
sudo yum install sbt
And you will install sbt on your EMR. You do not need to install spark additionally, it already has spark on EMR.
And you can use "sbt package" to package your scala code to jar files.
No comments:
Post a Comment