Input/OutputΒΆ

Data GeneratorΒΆ

range(start[, end, step, num_partitions])

Create a DataFrame with some range of numbers.

Spark Metastore TableΒΆ

read_table(name[, index_col])

Read a Spark table and return a DataFrame.

DataFrame.to_table(name[, format, mode, β€¦])

Write the DataFrame into a Spark table.

Delta LakeΒΆ

read_delta(path[, version, timestamp, index_col])

Read a Delta Lake table on some file system and return a DataFrame.

DataFrame.to_delta(path[, mode, β€¦])

Write the DataFrame out as a Delta Lake table.

ParquetΒΆ

read_parquet(path[, columns, index_col, β€¦])

Load a parquet object from the file path, returning a DataFrame.

DataFrame.to_parquet(path[, mode, β€¦])

Write the DataFrame out as a Parquet file or directory.

ORCΒΆ

read_orc(path[, columns, index_col])

Load an ORC object from the file path, returning a DataFrame.

DataFrame.to_orc(path[, mode, β€¦])

Write a DataFrame to the ORC format.

Generic Spark I/OΒΆ

read_spark_io([path, format, schema, index_col])

Load a DataFrame from a Spark data source.

DataFrame.to_spark_io([path, format, mode, β€¦])

Write the DataFrame out to a Spark data source.

Flat File / CSVΒΆ

read_csv(path[, sep, header, names, β€¦])

Read CSV (comma-separated) file into DataFrame or Series.

DataFrame.to_csv([path, sep, na_rep, β€¦])

Write object to a comma-separated values (csv) file.

ClipboardΒΆ

read_clipboard([sep])

Read text from clipboard and pass to read_csv.

DataFrame.to_clipboard([excel, sep])

Copy object to the system clipboard.

ExcelΒΆ

read_excel(io[, sheet_name, header, names, β€¦])

Read an Excel file into a pandas-on-Spark DataFrame or Series.

DataFrame.to_excel(excel_writer[, β€¦])

Write object to an Excel sheet.

JSONΒΆ

read_json(path[, lines, index_col])

Convert a JSON string to DataFrame.

DataFrame.to_json([path, compression, β€¦])

Convert the object to a JSON string.

HTMLΒΆ

read_html(io[, match, flavor, header, β€¦])

Read HTML tables into a list of DataFrame objects.

DataFrame.to_html([buf, columns, col_space, β€¦])

Render a DataFrame as an HTML table.

SQLΒΆ

read_sql_table(table_name, con[, schema, β€¦])

Read SQL database table into a DataFrame.

read_sql_query(sql, con[, index_col])

Read SQL query into a DataFrame.

read_sql(sql, con[, index_col, columns])

Read SQL query or database table into a DataFrame.