Exploring and Querying Datasets
Extracting Data From a Dataset With Spark
Initiating a Spark Session
import pyspark
sc = pyspark.SparkContext()
spark = pyspark.sql.SparkSession(sc)install.packages("sparklyr")
library(sparklyr)
port <- Sys.getenv("SPARK_MASTER_PORT")
master <- paste("spark://master:", port, sep = '')
sc = spark_connect(master)Executing SQL Queries
Query to Extract Data From Database Using extract_dataset
extract_datasetQuery to Filter and Extract Data from Database Using extract_assay germline
extract_assay germlineRun SQL Query to Extract Data
Best Practices
Last updated
Was this helpful?