Modules🔗
Overview🔗
There are 12 modules used in this package, which covers 41 functions
Module Descriptions🔗
Module | Description |
---|---|
constants |
The constants module is used to hold the definitions of all constant values used across the package. |
io |
The io module is used for reading and writing tables to/from directories. |
checks |
The checks module is used to check and validate various attributed about a given pyspark dataframe. |
types |
The types module is used to get, check, and change a datafames column data types. |
keys |
The keys module is used for creating new columns to act as keys (primary and foreign), to be used for joins with other tables, or to create relationships within downstream applications, like PowerBI. |
scale |
The scale module is used for rounding a column (or columns) to a given rounding accuracy. |
dimensions |
The dimensions module is used for checking the dimensions of pyspark dataframe 's. |
columns |
The columns module is used to fetch columns from a given DataFrame using convenient syntax. |
datetime |
The datetime module is used for fixing column names that contain datetime data, adding conversions to local datetimes, and for splitting a column in to their date and time components. |
info |
The info module is used to provide utility functions for retrieving information from pyspark dataframes. |
formatting |
The formatting module provides functions for formatting and displaying. |
cleaning |
The cleaning module is used to clean, fix, and fetch various aspects on a given DataFrame. |
duplication |
The duplication module is used for duplicating data from an existing dataframe , or unioning multiple dataframe 's together. |
schema |
The schema module is used for checking, validating, and viewing any schema differences between two different tables, either from in-memory variables, or pointing to locations on disk. |
delta |
The delta module is for various processes related to Delta Lake tables. Including optimising tables, merging tables, retrieving table history, and transferring between locations. |
Functions by Module🔗
Testing🔗
This package is fully tested against:
- Unit tests
- Lint tests
- MyPy tests
- Build tests