Data Science

SparkR with OpenShift

Posted on

Let’s set up a data science workbench on OpenShift (docker/kubernetes). The components will be an RStudio Server executing SparkR instructions to a remote Apache Spark instance, all hosted within a local OpenShift cluster instance. Examples include processing AWS S3 Bucket hosted data from Spark. So saddle up and ride ole’ Roxy all the way! This lab is another in […]