support - SparkR

Description

There is a need to support SparkR using Zeppelin.
In order to support current Zeppelin Tutorial/R (SparkR) exist notebook you should sun the following on ubuntu os:
apt-get install r-base
R -e "install.packages('knitr', repos = 'http://cran.us.r-project.org’)"
sudo R -e "install.packages('data.table', type = 'source',repos = 'http://Rdatatable.github.io/data.table')"

Currently there is no option to connect to the grid using sparkR, we should support the same as we are doing for pySpark:

  1. save DataFrame to the grid
    jsonDf.write.format("org.apache.spark.sql.insightedge").mode("overwrite").save("salaries")

  1. load DataFrame from the grid
    gridDf = spark.read.format("org.apache.spark.sql.insightedge").option("collection", "salaries").load()

We also need to supply suitable shell script in insightEdge bin:
insightedge-sparkR
insightedge-sparkR .cm

Workaround

None

Acceptance Test

None

Status

Assignee

Unassigned

Reporter

Aharon Moll

Labels

None

Priority

Medium

SalesForce Case ID

None

Fix versions

None

Commitment Version/s

None

Due date

None

Product

InsightEdge

Edition

Enterprise

Platform

All
Configure