ezoic

Monday, April 24, 2017

math and computer science

I have a Phd in statistics, a mathematical subject. I was not very outstanding in statistics though. That kind of subject is so hard that after you finish studying it, you kinda do not have the effort to do further a lot of things on it.

But, grandually, I started learning computer science, or more speicfically, programming, I found it was easier than math. Or that's my illusion? because the beginning of math is sought of easier too, and gradually it becomes more and more difficult. But how ahout computer science or i.e. programming?  I feel it is easier now, is that because I am at the beginning of computer science? though I have had 5 years' experience on cs/it etc industry.

I feel CS is more practical, with it we can "build" things, not like math, more abstract, tricks.

Tuesday, April 18, 2017

If you want to use assembly sbt, how to do it and the layout and the error fix

The way to build sbt package on linux, is first, to make some directories,

myProject/src/main/scala

And under scala, you put sbt file and scala files, and you make a "project" directory under scala, and put build.sbt under "project" directory  which has content:


- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
resolvers += Resolver.url("artifactory", url("http://scalasbt.artifactoryonline.com/scalasbt/sbt-plugin-releases"))(Resolver.ivyStylePatterns)

resolvers += "Typesafe Repository" at "http://repo.typesafe.com/typesafe/releases/"

resolvers += "Spray Repository" at "http://repo.spray.cc/"

addSbtPlugin("com.eed3si9n" %% "sbt-assembly" % "0.11.2")

resolvers += "jitpack" at "https://jitpack.io"


- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -

And I found the error after I "sbt package" under "scala" directory



- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 
module not found: com.eed3si9n#sbt-assembly;0.11.2

- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - 

Some ppl said that it was because, you did not put correct scalaVersion there. 

But I found that if I "sudo yum update", I resolved the problem. 

So when you use the linux, first thing to do is to "sudo yum update",  this is for red hat. for other linux, try 
"sudo apt-get update". 






Wednesday, April 12, 2017

Amazon EMR is an Amazon data science analysis box

Amazon EMR is an Amazon data science analysis box.

You can ssh to its linux interface using putty. And you will see this:






And itself has spark. You type "spark-shell" , and go to spark interface.

It itself does not have sbt. Its system is Redhat. So you use yum to install sbt.

First type "sudo yum update"


Here is how on a webpage. 

http://www.scala-sbt.org/0.13/docs/Installing-sbt-on-Linux.html



Then type 

curl https://bintray.com/sbt/rpm/rpm | sudo tee /etc/yum.repos.d/bintray-sbt-rpm.repo


Then  type:

 sudo yum install sbt

And you will install sbt on your EMR. You do not need to install spark additionally, it already has spark on EMR.

And you can use "sbt package" to package your scala code to jar files.

Monday, April 10, 2017

Spark data type convert, change

val text=sc.textFile("README.md")

text is an RDD. README.md is a Spark system file.

change rdd to dataframe:

val text1=text.toDF

convert list to RDD:

 val textfile1=List(("Ale",1259,278,6782))

textfile1 is a list.

 change list to RDD.
  val textfile2=sc.parallelize(textfile1)


convert list to dataframe:

val aa=List(("a","k","u","t"))

val ak=aa.map{case (a, b, c, d) => (a, b, c, d)}

val au=ak.toDF

println(au.show)



val aa=List(List("a","k","u","t"))

val ak=aa.map{case List(a, b, c, d) => (a, b, c, d)}

val au=ak.toDF

println(au.show)

change RDD to list:

 val text1=sc.textFile("README.md")

text1 is an RDD

change RDD to array:

val text2=text1.collect()











Spark scala, change the data type of the RDD

code:

val aa=sc.parallelize(List(("Ale",12,2783,89277)))

val ak = aa.map ( {case(w,k,r,u)=>(w,k,r,u.toLong)})

print(ak.toDF.show())


Change the last integer's type to Long.

looking for a man

 I am a mid aged woman. I live in southern california.  I was born in 1980. I do not have any kid. no compliacted dating.  I am looking for ...