Creating DataFrames

A Spark DataFrame is an immutable collection of data distributed within a cluster. The data inside a DataFrame is organized into named columns that can be compared to tables in a relational database.

In this recipe, we will learn how to create Spark DataFrames.